Split loading of monitors and detector data #82

SimonHeybrock · 2024-02-12T12:21:36Z

There are a couple reasons for this:

Increase opportunities for task-graph level parallelism.
If monitors are large, it may be helpful to avoid issues with memory.
Simplify handling of transmission runs, where we only want to load monitors. This would avoid having to worry about whether the transmission runs have detector data (which we would want to exclude from the load), or data-runs where we want to avoid loading the transmission monitor (if there is a separate transmission monitor).

Should also be considered in relation to #83.

nvaytet · 2024-02-12T12:33:47Z

Note that a first attempt at this was done when implementing the direct beam iterations.
It was however reverted in favour of a load_nexus which loaded everything internally.

If I remember correctly, this was mostly for simplicity and clarity in the final task graph, but we did not rule out implementing this in the future.

SimonHeybrock · 2024-02-12T13:21:33Z

I have experiment with larger files today (up to about 1e9 in detector and monitor) and my conclusion was that we have to be more deliberate about dependencies in the graphs, such as avoiding keeping alive intermediates by using them in multiple places.

We actually have to also sit down and have a look at final meta data assembly, e.g., as implemented in scipp/essreflectometry#27, as this potentially introduces long term "global" dependencies that result in keeping alive raw data longer than we want. @jl-wynen has already ran into some problem, I think, so we really have to figure out some guidelines/lessons.

SimonHeybrock · 2024-02-15T04:34:27Z

After some more thought, I believe we should also split loading "the rest" of the file: We need information such as chopper logs to be able to process either monitors or data, and there are probably also other examples such as determining slicing. So my suggestion is something like:

Load everything but monitors and detectors
Some pre-processing
Load
- monitors
- detectors

Furthermore, if we need to resort to chunking the processing of the events due to memory limitations, it would be advantageous to load and process the "rest" only once.

Splitting this is probably also beneficial for reusing more code for the live-reduction, where we initially get the non-event data, and a series of updates containing events and log values.

Make sure to see also scipp/beamlime#134.

SimonHeybrock · 2024-02-26T12:17:40Z

Absorbed in #83 and #99.

SimonHeybrock added this to Development Board Feb 15, 2024

github-project-automation bot moved this to Triage in Development Board Feb 15, 2024

SimonHeybrock added this to the Essentials milestone Feb 19, 2024

SimonHeybrock mentioned this issue Feb 26, 2024

Load monitors separately from the remainder of the files #99

Closed

SimonHeybrock closed this as completed Feb 26, 2024

github-project-automation bot moved this from Triage to Done in Development Board Feb 26, 2024

SimonHeybrock removed this from the Essentials milestone Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split loading of monitors and detector data #82

Split loading of monitors and detector data #82

SimonHeybrock commented Feb 12, 2024 •

edited

Loading

nvaytet commented Feb 12, 2024

SimonHeybrock commented Feb 12, 2024 •

edited

Loading

SimonHeybrock commented Feb 15, 2024 •

edited

Loading

SimonHeybrock commented Feb 26, 2024 •

edited

Loading

Split loading of monitors and detector data #82

Split loading of monitors and detector data #82

Comments

SimonHeybrock commented Feb 12, 2024 • edited Loading

nvaytet commented Feb 12, 2024

SimonHeybrock commented Feb 12, 2024 • edited Loading

SimonHeybrock commented Feb 15, 2024 • edited Loading

SimonHeybrock commented Feb 26, 2024 • edited Loading

SimonHeybrock commented Feb 12, 2024 •

edited

Loading

SimonHeybrock commented Feb 12, 2024 •

edited

Loading

SimonHeybrock commented Feb 15, 2024 •

edited

Loading

SimonHeybrock commented Feb 26, 2024 •

edited

Loading