fix: time conversion for efficiency reports with jobs taking longer than 1 day #362

cmeesters · 2025-10-06T10:04:41Z

based on PR #360 and issue #361:

The current time conversion for the efficiency report does not consider long jobs, where sacct output may be in the D-HH:MM:SS format. Also, the conversion of fractional seconds was not taken into account. While the latter is actually ridiculously playing with accuracy, the HPC world sometimes is concerned about it.

Included test cases.

This PR is based upon #360 and might replace it.

Summary by CodeRabbit

Bug Fixes
- Improved SLURM time parsing in efficiency reports to robustly handle many sacct formats (days, hours, minutes, seconds), fractional seconds, varied HH:MM:SS variants and whitespace; treats invalid/empty/NA inputs as 0 to improve runtime and CPU metric accuracy.
Tests
- Added comprehensive tests validating diverse sacct time formats, fractional seconds, whitespace and NA handling, and integration with existing parsing checks.

… Cheshakoneko [email protected]

coderabbitai · 2025-10-06T10:04:49Z

Walkthrough

Replaces manual colon-splitting with iterative datetime-format parsing in time_to_seconds to handle SLURM sacct variants (days, HH:MM:SS, MM:SS, SS, fractional seconds), trimming whitespace and returning 0 for empty/NA/invalid inputs; adds tests covering these formats and edge cases.

Changes

Cohort / File(s)	Summary
Time parsing utility `snakemake_executor_plugin_slurm/efficiency_report.py`	Rewrote `time_to_seconds` to attempt multiple datetime formats (including `D-HH:MM:SS(.ffffff)`, `HH:MM:SS(.ffffff)`, `MM:SS(.ffffff)`, `SS(.ffffff)`), handle fractional seconds and day components, trim whitespace, return 0 for empty/NaN/invalid inputs; added `datetime` import and expanded docstring.
Tests `tests/tests.py`	Added `TestTimeToSeconds` test suite exercising Elapsed and TotalCPU-style SLURM sacct formats (days, hours, minutes, seconds, fractional seconds), whitespace handling, pandas NA values, and invalid inputs; imports updated to include `time_to_seconds`.

Sequence Diagram(s)

sequenceDiagram
  participant Test as tests/tests.py
  participant Eff as efficiency_report.time_to_seconds
  participant Parser as datetime.strptime

  Test->>Eff: call time_to_seconds(input_str)
  Eff->>Eff: trim & normalize input
  alt input empty / NA / invalid
    Eff-->>Test: return 0
  else
    loop try candidate formats
      Eff->>Parser: attempt parse with format (D-HH:MM:SS(.ffffff) / HH:MM:SS(.ffffff) / MM:SS(.ffffff) / SS(.ffffff))
      Parser-->>Eff: parsed components or error
    end
    alt parsed
      Eff->>Eff: compute total_seconds = days*86400 + h*3600 + m*60 + s + fractional
      Eff-->>Test: return total_seconds
    else
      Eff-->>Test: return 0
    end
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related issues

Efficiency report crashes when time is returned in D-HH:MM:SS format #361 — Implements robust parsing for D‑HH:MM:SS and related SLURM sacct formats, addressing the parsing gaps described.

Possibly related PRs

feat: measuring compute efficiency per job #221 — Modifies the same time_to_seconds implementation and adds tests for sacct time parsing; directly related.

Suggested reviewers

dlaehnemann
johanneskoester

Poem

I nibble on colons, seconds, and days,
Counting each tick in precise little ways.
Fractions? Delicious! NA? I hop past—
From SS to D-HH:MM:SS, I parse fast.
Tests like clover in morning light, I play—🐇✨

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title succinctly describes that the pull request fixes time conversion logic for efficiency reports when jobs exceed one day, which directly matches the changes to handle D-HH:MM:SS formats and fractional seconds.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch cpuTime_fix

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

tests/tests.py (1)
156-162: Simplify NA test; avoid pd.NaType()

pd.NaType isn’t part of the public API and this branch is redundant with the prior pd.NA check.
-        assert (
-            time_to_seconds(pd.NaType()) == 0 if hasattr(pd, "NaType") else True
-        )  # Skip if not available
+        # Already covered by the pd.NA assertion above; no extra check needed.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 104d0f7 and 10ec1b2.

📒 Files selected for processing (2)

snakemake_executor_plugin_slurm/efficiency_report.py (1 hunks)
tests/tests.py (2 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-06-24T12:41:31.837Z

Learnt from: cmeesters
PR: snakemake/snakemake-executor-plugin-slurm#221
File: tests/tests.py:62-70
Timestamp: 2025-06-24T12:41:31.837Z
Learning: In the snakemake-executor-plugin-slurm efficiency report feature, the executor is responsible for creating the efficiency_report_path directory when generating efficiency reports. Tests should not manually check for directory existence but should rely on the executor to properly create necessary directories as part of its normal operation.

Applied to files:

tests/tests.py

🧬 Code graph analysis (1)

tests/tests.py (1)

snakemake_executor_plugin_slurm/efficiency_report.py (2)

parse_sacct_data (115-221)

time_to_seconds (11-59)

🪛 Ruff (0.13.3)

snakemake_executor_plugin_slurm/efficiency_report.py

57-57: Consider moving this statement to an else block

(TRY300)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: testing

🔇 Additional comments (3)

snakemake_executor_plugin_slurm/efficiency_report.py (1)

6-9: Imports look good

datetime and os are used; fine.

tests/tests.py (2)

12-15: Good import additions

Brings time_to_seconds into scope; pandas used in tests.

66-176: Excellent coverage for sacct time formats

Covers days, HH:MM:SS, MM:SS, seconds-only, fractional seconds, whitespace, invalid/NA inputs, and large-day edge cases. These assert the intended behavior and will pass with a parser that doesn’t rely on datetime limits.

Please re-run the suite after adopting the proposed parser to ensure all new cases pass.

snakemake_executor_plugin_slurm/efficiency_report.py

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

tests/tests.py (1)
165-167: Simplify the conditional logic.

The conditional expression is unnecessarily complex. pd.NaType() creates an instance of pd.NA's type, which is redundant with the test on line 164 that already tests pd.NA.

Consider simplifying to:
-        assert (
-            time_to_seconds(pd.NaType()) == 0 if hasattr(pd, "NaType") else True
-        )  # Skip if not available
+        # pd.NA is already tested above; NaType() creates same type instance
Or if you want to keep the test, make the logic clearer:
-        assert (
-            time_to_seconds(pd.NaType()) == 0 if hasattr(pd, "NaType") else True
-        )  # Skip if not available
+        if hasattr(pd, "NaType"):
+            assert time_to_seconds(pd.NaType()) == 0

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 10ec1b2 and 990ffc9.

📒 Files selected for processing (1)

tests/tests.py (2 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-06-24T12:41:31.837Z

Learnt from: cmeesters
PR: snakemake/snakemake-executor-plugin-slurm#221
File: tests/tests.py:62-70
Timestamp: 2025-06-24T12:41:31.837Z
Learning: In the snakemake-executor-plugin-slurm efficiency report feature, the executor is responsible for creating the efficiency_report_path directory when generating efficiency reports. Tests should not manually check for directory existence but should rely on the executor to properly create necessary directories as part of its normal operation.

Applied to files:

tests/tests.py

🧬 Code graph analysis (1)

tests/tests.py (1)

snakemake_executor_plugin_slurm/efficiency_report.py (2)

parse_sacct_data (115-221)

time_to_seconds (11-59)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: testing

🔇 Additional comments (4)

tests/tests.py (4)

12-15: LGTM!

The new imports are well-organized and necessary for the comprehensive testing of the time_to_seconds function and pandas NA value handling.

Also applies to: 19-19

66-181: Excellent comprehensive test coverage.

The TestTimeToSeconds class provides thorough coverage of SLURM sacct time formats including:

Multiple day formats (single/double digit)

Hours, minutes, seconds in various combinations

Fractional seconds for TotalCPU format

Edge cases (empty strings, None, pandas NA, invalid inputs)

Whitespace handling

Real-world examples from actual sacct output

The test structure is well-organized with clear method names and helpful docstrings.

175-176: datetime.strptime supports non-zero-padded times
Verified that both "1:2:3" and "01:02:03" parse correctly with "%H:%M:%S", so no changes needed.

106-107: Import error: tests cannot run due to missing snakemake module. Please install snakemake or adjust PYTHONPATH to include the plugin’s package, then rerun pytest tests/tests.py::TestTimeToSeconds::test_totalcpu_format_with_days to confirm whether the "1-30:45" format is supported.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

snakemake_executor_plugin_slurm/efficiency_report.py (1)
35-46: Remove duplicate format entry.

Line 43 duplicates line 41: both are "%M:%S.%f". Python's strptime handles both zero-padded ("01:30.5") and non-padded ("1:30.5") inputs with the same format string, so the duplicate entry is unnecessary.

Apply this diff to remove the redundant line:
         "%M:%S.%f",  # MM:SS.ffffff (with fractional seconds)
         "%M:%S",  # MM:SS
-        "%M:%S.%f",  # M:SS.ffffff (with fractional seconds)
         "%S.%f",  # SS.ffffff (with fractional seconds)

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 990ffc9 and 959c9d5.

📒 Files selected for processing (2)

snakemake_executor_plugin_slurm/efficiency_report.py (1 hunks)
tests/tests.py (2 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-06-24T12:41:31.837Z

Learnt from: cmeesters
PR: snakemake/snakemake-executor-plugin-slurm#221
File: tests/tests.py:62-70
Timestamp: 2025-06-24T12:41:31.837Z
Learning: In the snakemake-executor-plugin-slurm efficiency report feature, the executor is responsible for creating the efficiency_report_path directory when generating efficiency reports. Tests should not manually check for directory existence but should rely on the executor to properly create necessary directories as part of its normal operation.

Applied to files:

tests/tests.py

🧬 Code graph analysis (1)

tests/tests.py (1)

snakemake_executor_plugin_slurm/efficiency_report.py (2)

parse_sacct_data (122-228)

time_to_seconds (11-66)

🪛 Ruff (0.13.3)

snakemake_executor_plugin_slurm/efficiency_report.py

63-63: Consider moving this statement to an else block

(TRY300)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: testing

🔇 Additional comments (4)

tests/tests.py (2)

12-19: LGTM! Imports support the new test suite.

The additions of time_to_seconds and pandas imports are appropriate for the comprehensive testing of SLURM time format parsing.

66-175: Excellent test coverage for SLURM time parsing.

The test suite comprehensively covers:

Day formats (D-HH:MM:SS, DD-HH:MM:SS) for jobs longer than 1 day

Fractional seconds for TotalCPU accuracy

Various HH:MM:SS, MM:SS, and seconds-only formats

Real-world sacct examples from actual output

Edge cases (empty, NA, invalid, whitespace)

This thoroughly validates the fix for issue #361 and addresses the PR objectives.

snakemake_executor_plugin_slurm/efficiency_report.py (2)

6-8: LGTM! Import additions support the new implementation.

The datetime import enables robust time parsing with strptime, and os supports the debug logging at line 260.

11-66: Robust implementation correctly handles SLURM time formats.

The datetime-based parsing approach successfully addresses the PR objectives:

Handles jobs longer than 1 day (D-HH:MM:SS format)

Accurately parses fractional seconds (TotalCPU)

Falls back to 0 for invalid inputs

Supports various format combinations (days, hours, minutes, seconds)

The implementation is appropriate for the HPC use case where jobs run ≤31 days (per maintainer's confirmation in past review). The comprehensive test suite in tests/tests.py validates correctness across all supported formats.

Note: Ruff flags TRY300 (return in try block) as a style preference. This is informational only and was acknowledged in the past review.

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 959c9d5 and 69e8b2e.

📒 Files selected for processing (2)

snakemake_executor_plugin_slurm/efficiency_report.py (1 hunks)
tests/tests.py (2 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

tests/tests.py

🧰 Additional context used

🪛 Ruff (0.13.3)

snakemake_executor_plugin_slurm/efficiency_report.py

64-64: Consider moving this statement to an else block

(TRY300)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: testing

🔇 Additional comments (4)

snakemake_executor_plugin_slurm/efficiency_report.py (4)

6-6: LGTM! Necessary imports for the updated parsing logic.

The datetime import is required for the new strptime-based time parsing, and the os import repositioning maintains clean organization.

Also applies to: 8-8

11-24: LGTM! Comprehensive documentation with clear examples.

The expanded docstring effectively documents the SLURM time format variants and provides accurate examples that clarify the expected behavior.

25-30: LGTM! Robust input validation.

The validation correctly handles NA values, empty strings, and explicitly invalid strings, returning 0 as a safe default.

49-67: LGTM! Correct parsing logic for SLURM time formats.

The implementation correctly:

Tries multiple formats in appropriate order

Computes total seconds including fractional seconds

Handles day-prefixed formats by adding day * 86400

Returns 0 as fallback for unparseable inputs

Note: Ruff flags TRY300 (return inside try) as a style preference, but this was previously discussed and acknowledged as acceptable.

snakemake_executor_plugin_slurm/efficiency_report.py

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

snakemake_executor_plugin_slurm/efficiency_report.py (1)
11-67: Excellent implementation of SLURM time format parsing!

The datetime-based approach correctly handles all documented SLURM sacct variants (days, hours, minutes, seconds, fractional seconds). Logic verified for the docstring examples:

"1-12:30:45" → 131445s ✓

"23:59:59" → 86399s ✓

"45:30" → 2730s ✓

"30.5" → 30.5s ✓

Input validation (NA, empty, "invalid") and fallback to 0 for unparseable inputs are appropriate.

Optional: Address Ruff TRY300 style suggestion.

The return statement at line 63 inside the try block can be moved to an else block for cleaner exception handling. This is a style preference, not a correctness issue:
 for fmt in time_formats:
     try:
         time_obj = datetime.strptime(time_str, fmt)
+    except ValueError:
+        continue
+    else:
         total_seconds = (
             time_obj.hour * 3600
             + time_obj.minute * 60
             + time_obj.second
             + time_obj.microsecond / 1000000
         )
 
-        # Add days if present (datetime treats day 1 as the first day)
+        # Add days if present
         if fmt.startswith("%d-"):
             total_seconds += time_obj.day * 86400
 
         return total_seconds
-    except ValueError:
-        continue
 return 0
Minor clarity improvement: The comment "datetime treats day 1 as the first day" could be simplified to just "Add days if present" since the code correctly multiplies day value by 86400 regardless of datetime's internal month-day semantics.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4850016 and 1af12a3.

📒 Files selected for processing (1)

snakemake_executor_plugin_slurm/efficiency_report.py (1 hunks)

🧰 Additional context used

🪛 Ruff (0.13.3)

snakemake_executor_plugin_slurm/efficiency_report.py

63-63: Consider moving this statement to an else block

(TRY300)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: testing

🔇 Additional comments (1)

snakemake_executor_plugin_slurm/efficiency_report.py (1)

6-8: LGTM!

The datetime import is correctly added for the enhanced time parsing, and the import reordering has no functional impact.

FelixKnopp13 and others added 5 commits October 1, 2025 17:27

fix: time_to_seconds to include day format D-HH:MM:SS Co-authored-by:…

1f99ac6

… Cheshakoneko [email protected]

fix: formatting

ae7ead0

fix: removed unnecessary os import

6264212

feat: moved time conversion to python's datetime lib

320e55d

tests: added robust testing for time conversions

34aebed

fix: formatting

10ec1b2

cmeesters mentioned this pull request Oct 6, 2025

fix: time_to_seconds to include day format D-HH:MM:SS #360

Open

fix: formatting

990ffc9

coderabbitai bot reviewed Oct 6, 2025

View reviewed changes

snakemake_executor_plugin_slurm/efficiency_report.py Show resolved Hide resolved

coderabbitai bot reviewed Oct 6, 2025

View reviewed changes

cmeesters added 2 commits October 7, 2025 09:11

fix: test cases

e02cdf0

fix: additional time strings to parse - upon failure return 0

959c9d5

coderabbitai bot reviewed Oct 7, 2025

View reviewed changes

cmeesters added 2 commits October 7, 2025 09:50

fix: removed riduculous test case

0e24c41

fix: added another time format to parse

69e8b2e

coderabbitai bot reviewed Oct 7, 2025

View reviewed changes

snakemake_executor_plugin_slurm/efficiency_report.py Show resolved Hide resolved

cmeesters and others added 2 commits October 7, 2025 13:30

Update snakemake_executor_plugin_slurm/efficiency_report.py

4850016

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

fix: formatting

1af12a3

coderabbitai bot reviewed Oct 7, 2025

View reviewed changes

cmeesters requested a review from johanneskoester October 7, 2025 13:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: time conversion for efficiency reports with jobs taking longer than 1 day #362

fix: time conversion for efficiency reports with jobs taking longer than 1 day #362

Uh oh!

cmeesters commented Oct 6, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 6, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: time conversion for efficiency reports with jobs taking longer than 1 day #362

Are you sure you want to change the base?

fix: time conversion for efficiency reports with jobs taking longer than 1 day #362

Uh oh!

Conversation

cmeesters commented Oct 6, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cmeesters commented Oct 6, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 6, 2025 •

edited

Loading