Skip to content

New module: modkit/extract/calls#11286

Open
sahuno wants to merge 1 commit intonf-core:masterfrom
sahuno:add-modkit-extract-calls
Open

New module: modkit/extract/calls#11286
sahuno wants to merge 1 commit intonf-core:masterfrom
sahuno:add-modkit-extract-calls

Conversation

@sahuno
Copy link
Copy Markdown

@sahuno sahuno commented Apr 24, 2026

PR checklist

  • This comment contains a description of changes (with reason).
  • Stub + real tests added.
  • New tool follows module conventions.
  • Versions broadcast via `topic: versions`.
  • Naming/parameter/I-O conventions followed.
  • Resource label set (`process_high`).
  • BioConda + BioContainers used.
  • `nf-core modules lint modkit/extract/calls` — 49/0/0.
  • `nf-test test --profile conda` — 2/2 passed.

Summary

Adds a new nf-core module wrapping `modkit extract calls`, which produces a per-read per-position table of base-modification calls (pass / fail / filtered, with the called base) using the same thresholding algorithm as `modkit pileup`.

Complementary to `modkit/extract/full`: `extract calls` emits the thresholded categorical decision per site per read, while `extract full` emits the underlying probabilities.

The module auto-detects `--bgzf` in `ext.args` and adjusts the output filename suffix accordingly.

Why

`modkit extract calls` is the go-to tool for per-read allele-specific methylation, methylation-aware phasing validation, and read-level QC where you want the same thresholded labels as the pileup output but at read level rather than site level.

Test data

Uses the existing `test.sorted.phased.bam` from nf-core/test-datasets (modules branch). No new test data required.

🤖 Generated with Claude Code

Add new nf-core module wrapping `modkit extract calls`, which emits a
per-read per-position table of base modification calls using the same
pass/fail thresholding as `modkit pileup`. Complementary to
`modkit/extract/full` (raw probabilities): this module emits the
thresholded categorical decisions. Useful for per-read downstream
analysis such as allele-specific methylation and methylation-aware
phasing.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@sahuno sahuno force-pushed the add-modkit-extract-calls branch from 5035e89 to fafc527 Compare April 24, 2026 02:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant