-
Notifications
You must be signed in to change notification settings - Fork 45
Pipeline fails with "Cannot open file x.tsv for writing. Too many open files" #158
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @imnuvi It appears that in This is strange because the tags files should not contain any transcript info at this point Is it possible that you have have supplied a transcriptome sequence instead of a genome sequence? What is the content of the following file?
|
Hi @nrhorner, Thanks for your reply! Attaching the first few lines from <refgenome_dir>/fasta/genome.fa `>ENST00000448914.1 cdna chromosome:GRCh38:14:22449113:22449125:1 gene:ENSG00000228985.1 gene_biotype:TR_D_gene transcript_biotype:TR_D_gene gene_symbol:TRDD3 description:T cell receptor delta diversity 3 [Source:HGNC Symbol;Acc:HGNC:12256]
|
Hi @imnuvi This is the issue. That file should be a genomic DNA sequence. |
Hi @nrhorner, I ran the pipeline with full genome sequence and all steps have run except the process matrix step. Attaching the error below
|
Hi @imnuvi please see the troubleshooting section in the Readme that relates to this. Thanks |
@imnuvi In v3.1.0 the numba cache is set to the process directory, which should fix this issue |
@imnuvi Did you get round to trying out the new version? |
Hi @nrhorner I haven't tried it out yet. Will try it out and update. |
Operating System
Other Linux (please specify below)
Other Linux
Red Hat Enterprise Linux 8.8 (Ootpa)
Workflow Version
v3.0.0
Workflow Execution
Command line (Cluster)
Other workflow execution
No response
EPI2ME Version
No response
CLI command run
nextflow run software_packages/wf-single-cell
--expected_cells 10000
--fastq gene_expression/library_2_reads_merged
--kit 3prime:v3
--ref_genome_dir data/RefGenome/
-profile standard
--out_dir runs/run5
Workflow Execution - CLI Execution Profile
singularity
What happened?
Setup:
kit: 3prime:v3
executor: singularity
environment: computing cluster
RAM: 128GB
cores: 12
Current Behaviour
Tried running the epi2me single cell workflow on our sequencing data and the pipeline fails at the cat_tags_by_chrom step. This command uses awk which seems to open a lot of files and exceeds the system limit for files.
Expected Behaviour
The pipeline runs fine without any issues and gives out the results matrix
Relevant log output
Application activity log entry
Were you able to successfully run the latest version of the workflow with the demo data?
yes
Other demo data information
The text was updated successfully, but these errors were encountered: