Expose NeXus transformation chains in workflow steps #114

SimonHeybrock · 2024-10-02T13:35:05Z

Part of #96.

This is not completing this task, but does most of the prep work. The idea is that the to_transformation could accept additional arguments (or be replaced by one that does), which could:

Select a specific time interval, including averaging and threshold checking.
Update values of one or more transformations in the chain. This is mainly relevant for live-data reduction.

Neither 1.) nor 2.) is implemented right now. The initial focus will be on 1.). I would mainly like input from the reviewer on whether this new structure is adequate for this. The actual implementation of these features could be performed in follow-up work.

Depends on a release of scipp/scippnexus#243 (that is why CI is failing for now, otherwise this PR is ready for review).

Note that I ran into major (code) overhead/duplication from the current duplication (or near-duplication) of different code for different component types. Previously we had failed to find a way to generalize this. The reason was that monitors required one extra type-parameter (MyMonitor[RunType, MonitorType] vs MyDetector[RunType], MySample[RunType], ...). I believe I have now solved this via the new ComponentType. We can thus use MyComponent[Component, RunType] almost throughout, resulting in a significant deduplication. The "trick" is that Component takes general component names except for the monitors, where there are multiple named ones.

unused arg Compute component pos later Fix for non-nexus component "groups"

SimonHeybrock · 2024-10-02T13:38:27Z

src/ess/reduce/nexus/_nexus_loader.py

-    return compute_component_position(loaded)
+    return loaded


This is the heart of the change in this PR: We are no longer computing the position directly here. This is moved into an extra step.

Previously we introduced false dependencies of detector ids on these.

SimonHeybrock · 2024-10-03T13:44:53Z

src/ess/reduce/nexus/types.py

@@ -95,9 +91,9 @@ class TransmissionRun(Generic[ScatteringRunType]):
 """Identifier for an arbitrary monitor"""
 Monitor5 = NewType('Monitor5', int)
 """Identifier for an arbitrary monitor"""
-Incident = NewType('Incident', int)
+IncidentMonitor = NewType('IncidentMonitor', int)


Added the monitor suffix here since these now show up as MyComponent[IndicidentMonitor] and it may be confusing without.

nvaytet · 2024-10-04T11:49:00Z

src/ess/reduce/nexus/workflow.py

-) -> NeXusSource[RunType]:
-    """
-    Load a NeXus source group from a file.
+def nx_class_for_monitor() -> NeXusClass[MonitorType]:


Slightly annoying that we need to explicitly have nx_class_for_monitor, nx_class_for_detector, nx_class_for_source, nx_class_for_sample... but not sure what can be done?

One could set it as workflow parameters. Alternative would be to duplicate downstream providers, but as those have functionality it would mean code duplication, i.e., break more easily by running out of sync.

I think it's fine as is for now.

nvaytet · 2024-10-04T12:05:56Z

src/ess/reduce/nexus/workflow.py

-) -> NeXusMonitor[RunType, MonitorType]:
-    """
-    Load monitor from NeXus, but with event data replaced by placeholders.
+    When loading a detector or monitor, event data is replaced by placeholders.


Maybe I missed something, but is this for loading only detectors and monitors? I thought Component was also for sample and source?

Edit: I now see that the case for NXsample is overloaded by the load_nexus_sample function above. Are we missing a load_nexus_source function?

It is also used for the source. But this paragraph basically just applies to the groups that have event data.

nvaytet · 2024-10-04T12:15:04Z

src/ess/reduce/nexus/workflow.py

+    file_path_to_file_spec,
+    all_pulses,
+    component_spec_by_name,
+    unique_component_spec,  # after component_spec_by_name, partially overrides


Does the comment mean that we are relying on the behaviour of the pipeline; that it applies the providers in the order that they are given. Can we assume that the behaviour will never change?

We are doing that in many places now, not just here: Insertion order does matter.

There are explicit tests for this, e.g., here: https://github.com/scipp/sciline/blob/a90b73c50d4cef1fc8adb231dc2208cf99ef2f0a/tests/pipeline_test.py#L916. That is, this is intentional behavior and not likely to change. If it would, quite a few other workflows would also break, I think.

nvaytet · 2024-10-04T12:19:59Z

tests/nexus/workflow_test.py

-        nexus_detector,
-        offset=workflow.no_offset,
-        source_position=source_position,
-        sample_position=workflow.origin,
-        gravity=workflow.gravity_vector_neg_y(),
-        bank_sizes={},
-    )
-    assert_identical(
-        detector.drop_coords(('sample_position', 'source_position', 'gravity')),
-        nexus_detector['data'],


SimonHeybrock added 3 commits October 2, 2024 14:02

Do not auto-compute component position

81145a0

unused arg Compute component pos later Fix for non-nexus component "groups"

Add example of extrating position via separate TransformationChain

596cf26

Remove unnecessary added code

7c634ac

SimonHeybrock commented Oct 2, 2024

View reviewed changes

SimonHeybrock mentioned this pull request Oct 3, 2024

Loki gets lab-frame transform for solid angle from uncalibrated detector position scipp/esssans#174

Closed

SimonHeybrock added 12 commits October 3, 2024 11:16

Move source/sample position assign to separate later step

61d9e96

Previously we introduced false dependencies of detector ids on these.

Try to unify component handling code

33a2b5f

Unify some more code

bb3837c

More unification

ab4ffd9

Drop selection also for monitor

4ab8a23

Unify even more

99bde33

Cleanup

c7181c5

Rename TypeVar ComponentType

0f68b0c

Rename monitor comp names

3518616

Rename some more

54d39c4

Update tests

1aa06c9

docstring

f71405a

SimonHeybrock commented Oct 3, 2024

View reviewed changes

Cleanup unused subclass

3b80bc2

SimonHeybrock marked this pull request as ready for review October 3, 2024 13:49

SimonHeybrock added 2 commits October 3, 2024 15:53

Document reason for function split

f58b51f

Comment on offset

e161dc9

nvaytet self-assigned this Oct 4, 2024

nvaytet reviewed Oct 4, 2024

View reviewed changes

nvaytet approved these changes Oct 4, 2024

View reviewed changes

SimonHeybrock enabled auto-merge October 7, 2024 08:06

SimonHeybrock disabled auto-merge October 7, 2024 08:06

SimonHeybrock and others added 2 commits October 7, 2024 10:07

Bump to new minimum scippnexus

1618d6c

Merge branch 'main' into transformations

b202320

SimonHeybrock enabled auto-merge October 8, 2024 04:26

SimonHeybrock merged commit da717ec into main Oct 8, 2024
4 checks passed

SimonHeybrock deleted the transformations branch October 8, 2024 04:28

This was referenced Oct 8, 2024

Update for restructured NeXus base workflow from ess.reduce.nexus scipp/esssans#178

Merged

Update for restructured NeXus base workflow from ess.reduce.nexus scipp/essdiffraction#99

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expose NeXus transformation chains in workflow steps #114

Expose NeXus transformation chains in workflow steps #114

Uh oh!

SimonHeybrock commented Oct 2, 2024 •

edited

Loading

Uh oh!

SimonHeybrock Oct 2, 2024

Uh oh!

SimonHeybrock Oct 3, 2024

Uh oh!

nvaytet Oct 4, 2024 •

edited

Loading

Uh oh!

SimonHeybrock Oct 4, 2024

Uh oh!

nvaytet Oct 4, 2024

Uh oh!

nvaytet Oct 4, 2024

Uh oh!

SimonHeybrock Oct 4, 2024

Uh oh!

nvaytet Oct 4, 2024

Uh oh!

SimonHeybrock Oct 4, 2024

Uh oh!

SimonHeybrock Oct 4, 2024

Uh oh!

nvaytet Oct 4, 2024

Uh oh!

Uh oh!

Uh oh!

Expose NeXus transformation chains in workflow steps #114

Expose NeXus transformation chains in workflow steps #114

Uh oh!

Conversation

SimonHeybrock commented Oct 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nvaytet Oct 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SimonHeybrock commented Oct 2, 2024 •

edited

Loading

nvaytet Oct 4, 2024 •

edited

Loading