T0 to produce skimmed RAW data through Repack Workflow #12298

LinaresToine · 2025-03-07T04:05:07Z

Status

Ready

Description

In order to support the new raw skim datasets, we need to allow two things:

The moduleLabel in Repack.py can't have - in its name, so we introduce the parameter parentDataset in Outputs. This way we can create a legal name for moduleLabel:

output['moduleLabel'] = "write_%s_RawSkim_%s_%s" % (output['parentDataset'],
                                                                                             output['rawSkim'],
                                                                                             output['dataTier'])

We then delete the new attribute similarly to what is done in setupProcessingTask.

The repack workflow needs to pass a global tag to CMSSW in order to use the desired trigger paths.

Is it backward compatible (if not, which system it affects?)

YES

Related PRs

T0 PR: dmwm/T0#5041
CMSSW PR:

External dependencies / deployment changes

NO

dmwm-bot · 2025-03-07T04:16:31Z

Jenkins results:

Python3 Unit tests: succeeded
- 1 changes in unstable tests
Python3 Pylint check: failed
- 3 warnings and errors that must be fixed
- 5 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/449/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-07T08:28:35Z

Jenkins results:

Python3 Unit tests: failed
- 3 new failures
- 2 changes in unstable tests
Python3 Pylint check: failed
- 3 warnings and errors that must be fixed
- 5 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/450/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-07T09:11:53Z

Jenkins results:

Python3 Unit tests: succeeded
- 2 changes in unstable tests
Python3 Pylint check: failed
- 3 warnings and errors that must be fixed
- 5 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/451/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-07T10:12:10Z

Jenkins results:

Python3 Unit tests: succeeded
- 4 changes in unstable tests
Python3 Pylint check: failed
- 3 warnings and errors that must be fixed
- 4 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/452/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-12T12:27:10Z

Jenkins results:

Python3 Unit tests: failed
- 3 new failures
- 1 changes in unstable tests
Python3 Pylint check: failed
- 3 warnings and errors that must be fixed
- 4 comments to review
Pycodestyle check: succeeded
- 2 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/470/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-12T16:25:13Z

Jenkins results:

Python3 Unit tests: succeeded
- 3 changes in unstable tests
Python3 Pylint check: failed
- 3 warnings and errors that must be fixed
- 4 comments to review
Pycodestyle check: succeeded
- 2 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/474/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-12T16:51:42Z

Jenkins results:

Python3 Unit tests: succeeded
- 3 changes in unstable tests
Python3 Pylint check: failed
- 3 warnings and errors that must be fixed
- 4 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/475/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-13T00:52:19Z

Jenkins results:

Python3 Unit tests: failed
- 2 new failures
- 3 changes in unstable tests
Python3 Pylint check: failed
- 10 warnings and errors that must be fixed
- 2 warnings
- 93 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/479/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-13T22:04:10Z

Jenkins results:

Python3 Unit tests: failed
- 144 new failures
- 13 changes in unstable tests
Python3 Pylint check: failed
- 14 warnings and errors that must be fixed
- 2 warnings
- 94 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/488/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-13T22:41:04Z

Jenkins results:

Python3 Unit tests: succeeded
- 3 changes in unstable tests
Python3 Pylint check: failed
- 11 warnings and errors that must be fixed
- 2 warnings
- 94 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/489/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-13T22:49:55Z

Jenkins results:

Python3 Unit tests: succeeded
- 3 changes in unstable tests
Python3 Pylint check: failed
- 11 warnings and errors that must be fixed
- 2 warnings
- 94 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/490/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-14T11:27:34Z

Jenkins results:

Python3 Unit tests: succeeded
- 1 changes in unstable tests
Python3 Pylint check: failed
- 11 warnings and errors that must be fixed
- 2 warnings
- 94 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/497/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-03-14T16:47:12Z

Jenkins results:

Python3 Unit tests: succeeded
- 1 changes in unstable tests
Python3 Pylint check: failed
- 11 warnings and errors that must be fixed
- 2 warnings
- 94 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/498/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-07T15:40:59Z

Jenkins results:

Python3 Unit tests: succeeded
- 3 changes in unstable tests
Python3 Pylint check: failed
- 11 warnings and errors that must be fixed
- 2 warnings
- 94 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/550/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-07T15:57:10Z

Jenkins results:

Python3 Unit tests: succeeded
- 2 changes in unstable tests
Python3 Pylint check: failed
- 10 warnings and errors that must be fixed
- 2 warnings
- 93 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/551/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-10T09:14:17Z

Jenkins results:

Python3 Unit tests: failed
- 1 new failures
- 2 changes in unstable tests
Python3 Pylint check: failed
- 12 warnings and errors that must be fixed
- 2 warnings
- 126 comments to review
Pycodestyle check: succeeded
- 5 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/554/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-10T10:57:27Z

Jenkins results:

Python3 Unit tests: failed
- 2 new failures
- 1 changes in unstable tests
Python3 Pylint check: failed
- 12 warnings and errors that must be fixed
- 2 warnings
- 125 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/556/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-10T12:22:14Z

Jenkins results:

Python3 Unit tests: failed
- 1 new failures
- 3 changes in unstable tests
Python3 Pylint check: failed
- 12 warnings and errors that must be fixed
- 2 warnings
- 125 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/557/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-10T12:41:11Z

Jenkins results:

Python3 Unit tests: failed
- 1 new failures
- 3 changes in unstable tests
Python3 Pylint check: failed
- 12 warnings and errors that must be fixed
- 2 warnings
- 126 comments to review
Pycodestyle check: succeeded

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/558/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-10T13:27:23Z

Jenkins results:

Python3 Unit tests: failed
- 1 new failures
- 1 changes in unstable tests
Python3 Pylint check: failed
- 12 warnings and errors that must be fixed
- 2 warnings
- 127 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/559/artifact/artifacts/PullRequestReport.html

dmwm-bot · 2025-04-10T13:40:00Z

Jenkins results:

Python3 Unit tests: succeeded
- 3 changes in unstable tests
Python3 Pylint check: failed
- 12 warnings and errors that must be fixed
- 2 warnings
- 127 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/560/artifact/artifacts/PullRequestReport.html

LinaresToine · 2025-04-10T16:52:16Z

Hello @anpicci @todor-ivanov I have modified the unit tests for the repack workflow, and seems like I got those to work. However I see many other tests that failed and I don't understand their relationship with the changes I made:

https://cmssdt.cern.ch/dmwm-jenkins/job/WMCore-PR-Report/560/#showFailuresLink

There I dont see any of the StdBase.py and any of the Repack.py tests failing. Could you please have a look and let me know what I can do to improve those test?

amaltaro

@LinaresToine the 2 failing unit tests are unstable, so there is nothing to worry with that.
For the Repack unit test, if there was no complain, that means the status of the unit test did not change (but to be on the safe side, perhaps we should ensure that it is succeeding in jenkins).

amaltaro · 2025-04-14T18:26:58Z

src/python/WMCore/WMSpec/StdSpecs/Repack.py

-                                                     output['dataTier'])
+            moduleLabel = "write_%s_%s" % (output['primaryDataset'],
+                                           output['dataTier'])
+            output['moduleLabel'] = moduleLabel.replace("-", "_")  # For T0 Raw Skims, PDs will contain a "-", so here we replace for "_" for the moduleLabel


Antonio, can you please move this comment to the line above instead of inline?

Being really honest, I fear that this change can cause us "hidden" problems in the future. Is there any strong reason not to create a PD named with underscore instead of a dash?

In addition, given that it is a skim dataset, why changing the primary dataset and not the processing string (what goes between primary dataset and datatier)?

Thank you for your comments @amaltaro. It all comes down to creating RAW data. The value of the Raw Skim datasets is that we are producing RAW data that can be Prompt Reconstructed if desired. This means that this feature takes effect in the Repack workflow. In short, we are creating two RAW outputs from the same Repack workflow, and we must distinguish between them.

Lets say then

/PD/Era-v1/RAW /PD/Era-RawSkim-v1/RAW

which has a few inconveniences:
a). It is not trivial at all for T0 to give the new output an independent prompt reco configuration. This has the additional limitation of sending both sets of RAW data to the same destinations, which is not wanted. Allowing us to configure all these details only for the skimmed RAW is really what makes this project possible. Treating it as a primary dataset gives us this freedom.
b). It does not save us from the module label problem, since those two outputs would have the same module label by definition.

May I please ask what your concern with the dash is?

I would like to add that we do produce PDs with a dash (the error PDs), and they are processed without a problem through the system (Tier-0/WMCore/Rucio/DBS). We don't think accepting dashes in the skimmed PDs would cause a problem.

I see. I guess the tape families is a good argument to make this change at the PD level instead of the PS.

My only concern is that this moduleLabel could be used downstream and no longer be consistent with the output module. However, as that naming conversion only happens at the Repack factory, the risk is much smaller.

Thank you for the follow up, it looks good to me.

LinaresToine · 2025-04-15T14:00:17Z

The repack tests were failing before my changes in the commit 0b888e3

Please see

However, it was not clear to me if my development required modification or addition of unit tests for the StdBase module. Existing tests were successful.

dmwm-bot · 2025-04-15T14:12:03Z

Jenkins results:

Python3 Unit tests: succeeded
- 1 changes in unstable tests
Python3 Pylint check: failed
- 12 warnings and errors that must be fixed
- 2 warnings
- 127 comments to review
Pycodestyle check: succeeded
- 1 comments to review

Details at https://cmssdt.cern.ch/dmwm-jenkins/view/All/job/WMCore-PR-Report/572/artifact/artifacts/PullRequestReport.html

amaltaro · 2025-04-15T20:27:45Z

These changes are looking good to me. Can you please squash these commits? See some information on this in "Step 10" at https://github.com/dmwm/WMCore/blob/master/CONTRIBUTING.rst#contributing

support raw skim

50d97a4

LinaresToine mentioned this pull request Mar 7, 2025

Repack raw skims cms-sw/cmssw#47525

Merged

LinaresToine force-pushed the raw-skim branch from a6933a4 to bc55861 Compare March 7, 2025 08:12

LinaresToine force-pushed the raw-skim branch from bc55861 to 0188efc Compare March 7, 2025 09:00

LinaresToine changed the title ~~Raw skim~~ T0 to produce skimmed RAW data through Repack Workflow Mar 7, 2025

support global tag for repack

eeaab93

LinaresToine force-pushed the raw-skim branch from 0188efc to eeaab93 Compare March 7, 2025 09:58

LinaresToine force-pushed the raw-skim branch from 021cc6e to dbdca6c Compare March 12, 2025 16:16

LinaresToine force-pushed the raw-skim branch from dbdca6c to 6d2adbb Compare March 12, 2025 16:39

LinaresToine force-pushed the raw-skim branch from 6d2adbb to 8185e51 Compare March 13, 2025 00:37

LinaresToine mentioned this pull request Mar 13, 2025

backport Raw Skims to CMSSW_14_1_X cms-sw/cmssw#47596

Merged

LinaresToine force-pushed the raw-skim branch from 8185e51 to d6967d9 Compare March 13, 2025 21:52

LinaresToine force-pushed the raw-skim branch 2 times, most recently from 98804d2 to bf3e6cf Compare March 13, 2025 22:40

LinaresToine force-pushed the raw-skim branch from bf3e6cf to 760cb95 Compare March 14, 2025 11:15

LinaresToine mentioned this pull request Mar 14, 2025

Allow for longer module labels in DBS dmwm/dbs2go#123

Open

LinaresToine mentioned this pull request Mar 19, 2025

Backport Raw Skims to CMSSW_15_0_X cms-sw/cmssw#47623

Merged

LinaresToine mentioned this pull request Mar 19, 2025

Backport Raw Skims CMSSW_14_2_X cms-sw/cmssw#47624

Merged

LinaresToine force-pushed the raw-skim branch from 4a52953 to 760cb95 Compare April 7, 2025 15:31

Using .replace

c8c9c58

LinaresToine force-pushed the raw-skim branch from 760cb95 to c8c9c58 Compare April 7, 2025 15:46

LinaresToine force-pushed the raw-skim branch from 8f02fd9 to 23b26a5 Compare April 10, 2025 10:44

LinaresToine force-pushed the raw-skim branch from 23b26a5 to 217e78d Compare April 10, 2025 12:10

LinaresToine force-pushed the raw-skim branch from 217e78d to 0808b7a Compare April 10, 2025 12:28

LinaresToine force-pushed the raw-skim branch from 0808b7a to 0282fd5 Compare April 10, 2025 13:13

including new feature in spec testing

0b888e3

LinaresToine force-pushed the raw-skim branch from 0282fd5 to 0b888e3 Compare April 10, 2025 13:30

amaltaro reviewed Apr 14, 2025

View reviewed changes

Update Repack.py

6956cf8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

T0 to produce skimmed RAW data through Repack Workflow #12298

T0 to produce skimmed RAW data through Repack Workflow #12298

LinaresToine commented Mar 7, 2025

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 12, 2025

dmwm-bot commented Mar 12, 2025

dmwm-bot commented Mar 12, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 14, 2025

dmwm-bot commented Mar 14, 2025

dmwm-bot commented Apr 7, 2025

dmwm-bot commented Apr 7, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

LinaresToine commented Apr 10, 2025

amaltaro left a comment

amaltaro Apr 14, 2025

amaltaro Apr 14, 2025

LinaresToine Apr 15, 2025 •

edited

Loading

jeyserma Apr 15, 2025

amaltaro Apr 15, 2025

LinaresToine commented Apr 15, 2025 •

edited

Loading

dmwm-bot commented Apr 15, 2025

amaltaro commented Apr 15, 2025

T0 to produce skimmed RAW data through Repack Workflow #12298

Are you sure you want to change the base?

T0 to produce skimmed RAW data through Repack Workflow #12298

Conversation

LinaresToine commented Mar 7, 2025

Status

Description

Is it backward compatible (if not, which system it affects?)

Related PRs

External dependencies / deployment changes

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 7, 2025

dmwm-bot commented Mar 12, 2025

dmwm-bot commented Mar 12, 2025

dmwm-bot commented Mar 12, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 13, 2025

dmwm-bot commented Mar 14, 2025

dmwm-bot commented Mar 14, 2025

dmwm-bot commented Apr 7, 2025

dmwm-bot commented Apr 7, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

dmwm-bot commented Apr 10, 2025

LinaresToine commented Apr 10, 2025

amaltaro left a comment

Choose a reason for hiding this comment

amaltaro Apr 14, 2025

Choose a reason for hiding this comment

amaltaro Apr 14, 2025

Choose a reason for hiding this comment

LinaresToine Apr 15, 2025 • edited Loading

Choose a reason for hiding this comment

jeyserma Apr 15, 2025

Choose a reason for hiding this comment

amaltaro Apr 15, 2025

Choose a reason for hiding this comment

LinaresToine commented Apr 15, 2025 • edited Loading

dmwm-bot commented Apr 15, 2025

amaltaro commented Apr 15, 2025

LinaresToine Apr 15, 2025 •

edited

Loading

LinaresToine commented Apr 15, 2025 •

edited

Loading