-
Notifications
You must be signed in to change notification settings - Fork 754
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
new module: FASTQDL #7506
base: master
Are you sure you want to change the base?
new module: FASTQDL #7506
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is generally fine, but the linting doesn't currently cope with using pip. I'm not sure when this will be resolved...
See here for the tools linting issue |
Co-authored-by: Simon Pearce <[email protected]>
One for the March hackathon! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few comments for the implementation of this module :)
""" | ||
|
||
mkdir ${prefix} | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this needs to be a bit more specific:
echo "" | gzip > ${accession}.fastq.gz | |
echo "" | gzip > ${accession}_1.fastq.gz | |
echo "" | gzip > ${accession}_2.fastq.gz | |
touch ${prefix}-run-info.tsv |
tuple val(meta), val(accession) | ||
|
||
output: | ||
tuple val(meta), path("test"), emit: db |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tuple val(meta), path("test"), emit: db | |
tuple val(meta), path("*.fastq.gz"), emit: fastq | |
tuple val(meta), path("*-run-info.tsv"), emit: runinfo | |
tuple val(meta), path("*-run-mergers.tsv"), emit: runmergers, optional: true |
I would update this to emit each file that we expect (might be a bit complicated now but I think this will makle handling everything in the pipeline easier).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After changing this here we also need to adjust the meta.yml :)
$args \\ | ||
--accession $accession \\ | ||
--cpus $task.cpus \\ | ||
--outdir ${prefix} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
--outdir ${prefix} | |
--outdir . |
def prefix = task.ext.prefix ?: "${meta.id}" | ||
""" | ||
|
||
mkdir ${prefix} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
mkdir ${prefix} |
def prefix = task.ext.prefix ?: "${meta.id}" | ||
""" | ||
fastq-dl \\ | ||
$args \\ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
$args \\ | |
$args \\ | |
-- prefix ${prefix} \\ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The run you used belongs to Experiment ERX4876079
BioSample: SAMEA7787147
Study / Project: PRJEB37886
But I downloading a whole project or Sample will definetly be too many files :D
Maybe we can also test to use the Experiment Accession and then put a comment in the module that this is only tested for Runs & Experiments
(E|D|S)RR[0-9]{6,}: Run accession E = Ena, S = NCBI, D= DDBJ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The run you used belongs to Experiment ERX4876079
BioSample: SAMEA7787147
Study / Project: PRJEB37886
But I downloading a whole project or Sample will definetly be too many files :D
Maybe we can also test to use the Experiment Accession and then put a comment in the module that this is only tested for Runs & Experiments
PR checklist
Closes #7505 by adding new module: FASTQDL
versions.yml
file.label
nf-core modules test <MODULE> --profile docker
nf-core modules test <MODULE> --profile singularity
nf-core modules test <MODULE> --profile conda
nf-core subworkflows test <SUBWORKFLOW> --profile docker
nf-core subworkflows test <SUBWORKFLOW> --profile singularity
nf-core subworkflows test <SUBWORKFLOW> --profile conda