Guideline rules on name and type conventions #9

jl-wynen · 2024-03-04T14:35:12Z

Fixes #7

Please flag any and all names / types that you don't think are appropriate! And suggest any that I missed.

SimonHeybrock · 2024-03-05T03:56:17Z

docs/user-guide/reduction-workflow-guidelines.md

@@ -43,6 +40,56 @@ We plan to include the following in future versions of the guidelines:

 - *Provider*: A callable step in a workflow writing with Sciline.

+## C: Convention


Please either remove the empty Naming below if you prefer Convention, or move this.

SimonHeybrock · 2024-03-05T04:00:16Z

docs/user-guide/reduction-workflow-guidelines.md

+| --- **Run IDs** ---           |             |                                                                           |
+| SampleRun, BackgroundRun, ... | Any         | Identifier for a run                                                      |
+| RunType                       | TypeVar     | Constrained to the run types used by the package, see above               |
+| RunTitle                      | str         | Extracted from NeXus or provided by user, can be used to find files       |
+| --- **Monitors** ---          |             |                                                                           |
+| IncidentMonitor \| *Monitor   | Any         | Identifier for a monitor                                                  |
+| MonitorType                   | TypeVar     | Constrained to the monitor types used by the package, see above           |


Here we assume that a workflow necessarily wants/needs to use generics for runs types and monitor types. Is that a good assumption? Or should we consider formulating it as "if using generics for X, use this convention"?

Should we maybe split C.1 to discuss naming for generics (and their typevars) separately?

Here we assume that a workflow necessarily wants/needs to use generics for runs types and monitor types. Is that a good assumption? Or should we consider formulating it as "if using generics for X, use this convention"?

I don't understand what you are saying. When we don't use generics, the SampleRun, ... and RunType simply don't exist. This table doesn't mandate using all entries, it's supposed to tell us how to name them if and when we need them.

Should we maybe split C.1 to discuss naming for generics (and their typevars) separately?

Why? How would you structure that table?

I don't understand what you are saying. When we don't use generics, the SampleRun, ... and RunType simply don't exist.

Say I want to write a workflow that process a sample run and a vanadium run, without use of generics. Which names do I use?

I am suggesting to make two guidelines (each with their own table), or have a separate guideline specifying conventions for naming typevars and identifiers.

In principle, this is already part of it. Simply ignore the Run IDs section and use the '*Filename'-type patterns. Do you think we need to be more explicit and list {Sample,Vanadium}Filename, {Sample,Vanadium}RawData, etc?

If the reader is not familiar with using generic providers in a workflow then the current table is quite unclear. Please make a separate guideline. We do not need to make an explicit list.

SimonHeybrock · 2024-03-05T04:02:56Z

docs/user-guide/reduction-workflow-guidelines.md

+- Gracefully promote dtypes for small parameters.
+  E.g., `sc.scalar(2, unit='m')` and `sc.scalar(2.0, unit='m')` should be usable interchangeably.


👍
Should we require the same, e.g., for params like bin-edges such that, e.g., passing arange works just like passing linspace?

Doesn't it work the same way?

Operations may raise when ints are passed instead of floats as coords (bin edges, for example). I am suggesting extending your suggestion for graceful handling.

SimonHeybrock · 2024-03-05T09:33:02Z

docs/user-guide/reduction-workflow-guidelines.md

+|-------------------------------------------|---------|-----------------------------------------------------------------|
+| --- **Run IDs** ---                       |         |                                                                 |
+| SampleRun, BackgroundRun, ...             | Any     | Identifier for a run, only used as a type tag                   |
+| RunType                                   | TypeVar | Constrained to the run types used by the package, see above     |


So is the rule to end typevars with Type?

Don't know in general. I've only seen those two in practice.

@SimonHeybrock Do we need to do anything about this?

Should we make this a rule? I think it is quite confusing to know what is a typevar and what is and ID otherwise?

SimonHeybrock · 2024-03-05T10:42:53Z

docs/user-guide/reduction-workflow-guidelines.md

+| Name                        | Type        | Description                                                               |
+|-----------------------------|-------------|---------------------------------------------------------------------------|
+| --- **Files** ---           |             |                                                                           |
+| Filename \| *Filename       | str         | Simple name of a file, must be processed into FilePath                    |


Should Filename even exist? If the file is obtained from SciCat, we only have FilePath, I suppose. Otherwise, can we replace Filename by, e.g., and instrument identifier and a run number?

SciCat datasets can contain any number of files. So we need a pair or (dataset id, file id). And the file id can only really be a file name.

SimonHeybrock reviewed Mar 5, 2024

View reviewed changes

jl-wynen added 6 commits March 11, 2024 13:02

List name and type conventions

71e71e1

Add rule about flexible types

764d22c

Remove naming section

b19fff5

Mention arange vs linspace

ccc8804

Split off type vars table

b16ea24

Make Type suffix a rule

249c8c0

jl-wynen force-pushed the conventions branch from b310c37 to 249c8c0 Compare March 11, 2024 12:02

SimonHeybrock approved these changes Mar 11, 2024

View reviewed changes

jl-wynen merged commit 8bf9dcf into main Mar 11, 2024
3 checks passed

jl-wynen deleted the conventions branch March 11, 2024 13:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guideline rules on name and type conventions #9

Guideline rules on name and type conventions #9

jl-wynen commented Mar 4, 2024

SimonHeybrock Mar 5, 2024

SimonHeybrock Mar 5, 2024

jl-wynen Mar 5, 2024

SimonHeybrock Mar 5, 2024

jl-wynen Mar 5, 2024

SimonHeybrock Mar 5, 2024 •

edited

Loading

SimonHeybrock Mar 5, 2024

jl-wynen Mar 5, 2024

SimonHeybrock Mar 5, 2024

SimonHeybrock Mar 5, 2024

jl-wynen Mar 5, 2024

jl-wynen Mar 11, 2024

SimonHeybrock Mar 11, 2024

SimonHeybrock Mar 5, 2024

jl-wynen Mar 5, 2024

		@@ -43,6 +40,56 @@ We plan to include the following in future versions of the guidelines:

		- Provider: A callable step in a workflow writing with Sciline.

		## C: Convention

		- Gracefully promote dtypes for small parameters.
		E.g., `sc.scalar(2, unit='m')` and `sc.scalar(2.0, unit='m')` should be usable interchangeably.

Guideline rules on name and type conventions #9

Guideline rules on name and type conventions #9

Conversation

jl-wynen commented Mar 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock Mar 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock Mar 5, 2024 •

edited

Loading