Skip to content

MultiOmeGEX on GCP=> no space left on device #456

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
nick-youngblut opened this issue Mar 25, 2025 · 14 comments
Open

MultiOmeGEX on GCP=> no space left on device #456

nick-youngblut opened this issue Mar 25, 2025 · 14 comments
Labels
bug Something isn't working

Comments

@nick-youngblut
Copy link
Contributor

nick-youngblut commented Mar 25, 2025

Description of the bug

At least for --aligner cellranger --protocol ARC-v1, the CellRanger job runs out of temp file space:

/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 25: echo: write error: No space left on device

The job resources need to be modified to provide more disk space, at least for retried of the job.

Command used and terminal output

nextflow run main.nf \
    -profile docker \
    -process.executor "google-batch" \
    -work-dir gs://arc-genomics-nextflow/scrnaseq/work \
    --input ${INPUT_SAMPLES_FILE} \
    --outdir ${OUTPUT_DIR} \
    --cellranger_index ${REF_DATA_DIR} \
    --aligner cellranger \
    --protocol ARC-v1

Relevant files

The job log:

Martian Runtime - v4.0.12
Serving UI at http://f2b851f78adc:33617?auth=hqdPQDbJCmabH5DhlowPAsWLN8vUqNi9Jhi-Qu7f_CU

Running preflight checks (please wait)...
Checking sample info...
Checking FASTQ folder...
Checking reference...
Checking reference_path (/mnt/disks/arc-genomics-references/cellranger/refdata-gex-GRCh38-2024-A) on f2b851f78adc...
Checking optional arguments...
mro: v4.0.12
mrp: v4.0.12
Anaconda: Python 3.10.11
numpy: 1.24.3
scipy: 1.10.1
pysam: 0.21.0
h5py: 3.6.0
pandas: 2.1.4
STAR: 2.7.2a
samtools: samtools 1.16.1
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.WRITE_GENE_INDEX
2025-03-25 22:27:44 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.WRITE_GENE_INDEX.fork0.chnk0.main
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_CHEMISTRY_DETECTOR.CHECK_SINGLE_BEAM_MODE
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.SPLIT_VDJ_INPUTS
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.FULL_COUNT_INPUTS.WRITE_GENE_INDEX
2025-03-25 22:27:44 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.FULL_COUNT_INPUTS.WRITE_GENE_INDEX.fork0.chnk0.main
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MAKE_FULL_CONFIG._MAKE_VDJ_CONFIG
2025-03-25 22:27:44 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MAKE_FULL_CONFIG._MAKE_VDJ_CONFIG.fork0.chnk0.main
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.COPY_CHEMISTRY_SPEC
2025-03-25 22:27:44 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.COPY_CHEMISTRY_SPEC.fork0.chnk0.main
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.DISABLE_CORRECTION_FACTOR
2025-03-25 22:27:44 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.DISABLE_CORRECTION_FACTOR.fork0.chnk0.main
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.COPY_VDJ_REFERENCE
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.SANITIZE_MAP_CALLS
2025-03-25 22:27:44 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.COPY_CHEMISTRY_SPEC
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_CHEMISTRY_DETECTOR.DETECT_COUNT_CHEMISTRY
2025-03-25 22:27:44 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_CHEMISTRY_DETECTOR.DETECT_COUNT_CHEMISTRY.fork0.chnk0.main
2025-03-25 22:27:44 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MAKE_FULL_CONFIG._MAKE_VDJ_CONFIG
2025-03-25 22:27:44 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.DISABLE_CORRECTION_FACTOR
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.COMPUTE_CORRECTION_FACTOR
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_B_GEM_WELL_PROCESSOR.MULTI_SETUP_CHUNKS
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GEM_WELL_PROCESSOR.MULTI_SETUP_CHUNKS
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GEM_WELL_PROCESSOR.SETUP_VDJ_DEMUX
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_B_GEM_WELL_PROCESSOR.SETUP_VDJ_DEMUX
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GD_GEM_WELL_PROCESSOR.SETUP_VDJ_DEMUX
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GD_GEM_WELL_PROCESSOR.MULTI_SETUP_CHUNKS
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.MAKE_SHARD
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_B_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.MAKE_SHARD
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GD_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.MAKE_SHARD
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.BARCODE_CORRECTION
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_B_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.BARCODE_CORRECTION
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.RUST_BRIDGE
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GD_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.BARCODE_CORRECTION
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_B_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.RUST_BRIDGE
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.ASSEMBLE_VDJ
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_B_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.ASSEMBLE_VDJ
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GD_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.RUST_BRIDGE
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.MERGE_METRICS
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_B_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.MERGE_METRICS
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GD_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.ASSEMBLE_VDJ
2025-03-25 22:27:44 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.VDJ_T_GD_GEM_WELL_PROCESSOR.SC_VDJ_CONTIG_ASSEMBLER.MERGE_METRICS
2025-03-25 22:27:48 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_CHEMISTRY_DETECTOR.DETECT_COUNT_CHEMISTRY
2025-03-25 22:27:48 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.CREATE_MULTI_GRAPH
2025-03-25 22:27:48 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_CHEMISTRY_DETECTOR.CHECK_BARCODES_COMPATIBILITY
2025-03-25 22:27:48 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_CHEMISTRY_DETECTOR.CHECK_BARCODES_COMPATIBILITY.fork0.chnk0.main
2025-03-25 22:27:48 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR.MULTI_SETUP_CHUNKS
2025-03-25 22:27:48 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR.MULTI_SETUP_CHUNKS.fork0.chnk0.main
2025-03-25 22:27:48 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_CHEMISTRY_DETECTOR.CHECK_BARCODES_COMPATIBILITY
2025-03-25 22:27:48 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR.MULTI_SETUP_CHUNKS
2025-03-25 22:28:20 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.WRITE_GENE_INDEX
2025-03-25 22:28:20 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.FULL_COUNT_INPUTS.WRITE_GENE_INDEX
2025-03-25 22:28:20 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR.PARSE_TARGET_FEATURES
2025-03-25 22:28:20 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR.PARSE_TARGET_FEATURES.fork0.chnk0.main
2025-03-25 22:28:20 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR.PARSE_TARGET_FEATURES
2025-03-25 22:28:20 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD
2025-03-25 22:28:20 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD.fork0.split
2025-03-25 22:28:20 [runtime] (ready)           ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.MAKE_VDJ_FILTER_SWITCH
2025-03-25 22:28:20 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.MAKE_VDJ_FILTER_SWITCH.fork0.chnk0.main
2025-03-25 22:28:20 [runtime] (chunks_complete) ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.MAKE_VDJ_FILTER_SWITCH
2025-03-25 22:28:27 [runtime] (split_complete)  ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD
2025-03-25 22:28:27 [runtime] (run:local)       ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD.fork0.chnk0.main
2025-03-25 22:34:30 [runtime] (update)          ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD.fork0 chunks_running
2025-03-25 22:40:28 [runtime] (update)          ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD.fork0 chunks_running
2025-03-25 22:46:28 [runtime] (update)          ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD.fork0 chunks_running
2025-03-25 22:49:58 [runtime] Pipestance directory out of disk space.
          /tmp/nxf.wjMnap2FOX/LB_brain13_8863_2_AMG_MultiOmeGEX/SC_RNA_COUNTER_CS has only 32452kB remaining space available.
To ignore this error, set MRO_DISK_SPACE_CHECK=disable in your environment.
2025-03-25 22:50:01 [runtime] (failed)          ID.LB_brain13_8863_2_AMG_MultiOmeGEX.SC_RNA_COUNTER_CS.SC_MULTI_CORE.MULTI_GEM_WELL_PROCESSOR.COUNT_GEM_WELL_PROCESSOR._BASIC_SC_RNA_COUNTER._MATRIX_COMPUTER.MAKE_SHARD

[error] Pipestance failed. Error log at:
LB_brain13_8863_2_AMG_MultiOmeGEX/SC_RNA_COUNTER_CS/SC_MULTI_CORE/MULTI_GEM_WELL_PROCESSOR/COUNT_GEM_WELL_PROCESSOR/_BASIC_SC_RNA_COUNTER/_MATRIX_COMPUTER/MAKE_SHARD/fork0/chnk0-u0063e32e0b/_errors

Log message:
/tmp/nxf.wjMnap2FOX/LB_brain13_8863_2_AMG_MultiOmeGEX/SC_RNA_COUNTER_CS has only 32452kB remaining space available.
To ignore this error, set MRO_DISK_SPACE_CHECK=disable in your environment.

Waiting 6 seconds for UI to do final refresh.
Pipestance failed. Use --noexit option to keep UI running after failure.

2025-03-25 22:50:07 Shutting down.
/opt/cellranger-8.0.0/bin/tenkit/tarmri: line 21: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 59: echo: write error: No space left on device
date: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 62: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 24: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 25: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 26: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 27: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 42: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 43: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 24: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 25: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 26: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 27: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 42: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 43: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 24: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 25: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 26: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 27: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 42: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 43: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 24: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 25: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 26: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 27: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 42: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 43: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 24: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 25: echo: write error: No space left on device
/opt/cellranger-8.0.0/bin/tenkit/sitecheck: line 26: echo: write error: No space left on de

System information

  • nextflow 24.10.5
  • scrnseq 4.0.0
@nick-youngblut nick-youngblut added the bug Something isn't working label Mar 25, 2025
@nick-youngblut
Copy link
Contributor Author

AFAIK, the disk directive would just increase the size of the cloud job's working directory, and not for the directories within the container image (e.g., /opt/cellranger-8.0.0/bin/tenkit), so that "simple fix" doesn't appear to be an option .

@grst
Copy link
Member

grst commented Mar 26, 2025

Huh, but the container isn't writable anyway, is it?
I think the error should refer to the local disk size of the VM.

@nick-youngblut
Copy link
Contributor Author

nick-youngblut commented Mar 26, 2025

Docker overlays a writable layer on top of the base file system. So, software can write to the install directory e.g., /opt/cellranger-8.0.0, unless file permissions explicitly prevent it.

Bind-mounting a larger volume to /opt/cellranger-8.0.0/tenkit would work. It would be much better if 10X just changed CellRanger so that it does not write to the install directory, which is a bad idea due to issues such as these.

@nick-youngblut
Copy link
Contributor Author

I'm surprised that no one has run into this issue as of yet. It is reproducible on my end. Maybe it only occurs when using --protocol ARC-v1?

Maybe updating to CellRanger v9 would help? v8 is nearly a year old now. I've contacted 10X about the issue and am waiting to hear back.

@grst
Copy link
Member

grst commented Mar 27, 2025

Bind-mounting a larger volume to /opt/cellranger-8.0.0/tenkit would work. It would be much better if 10X just changed CellRanger so that it does not write to the install directory, which is a bad idea due to issues such as these.

I don't think it writes to the install directory. It certainly shouldn't. Maybe docker by default creates a writable overlay file system, singularity certainly does not unless you specify --writable-tmpfs, so it would fail the tests immediately if something got written to the container directory.

The error message says specifically

Log message:
/tmp/nxf.wjMnap2FOX/LB_brain13_8863_2_AMG_MultiOmeGEX/SC_RNA_COUNTER_CS has only 32452kB remaining space available.
To ignore this error, set MRO_DISK_SPACE_CHECK=disable in your environment.

So it's the /tmp partition that appears to run out of space.

So I think changing the disk directive should fix that and I believe the reason nobody ran into this so far could be that they either provision their cloud instances with a lot of disk space by default (e.g. ours have 3TB) or run on a HPC with a shared file system.

@nick-youngblut
Copy link
Contributor Author

Yeah, you're right. /opt/cellranger-8.0.0/bin/tenkit/sitecheck is just referring to the executable throwing the error. I should have more carefully considered the log.

So, this error shows that the cellranger_count module needs to be updated. I'm guessing that the disk directive should be added to at least provide a disk resource escalation during retries.

Maybe a good time to update the entire set of Cell Ranger modules to v9? I could try creating a PR, but getting a sign-off on an update to all Cell Ranger modules will likely be non-trivial for a new contributor.

@nick-youngblut
Copy link
Contributor Author

nick-youngblut commented Mar 28, 2025

It appears that adding disk to the process fixed the issue:

process CELLRANGER_COUNT {
    tag "$meta.id"
    label 'process_high'
    disk '200 GB'

I first tried 100 Gb, but that was not enough, at least for a sample with ~765 mil paired-end reads.

I have another sample that I have yet to process with ~3.5 bil paired-end reads, so I'll let you know how much disk space is needed in that case.

@nick-youngblut
Copy link
Contributor Author

In regards to scaling:

For larger datasets, NFCORE_SCRNASEQ:SCRNASEQ:H5AD_CONVERSION:CONCAT_H5AD fails due to a lack of memory on larger datasets. Changing label 'process_medium' to label 'process_high' or increasing default number of retries should allow scrnaseq to scale to larger datasets without errors.

@grst
Copy link
Member

grst commented Mar 31, 2025

Thanks for testing this!

I'll double check if there are any guidelines within nf-core about the disk directive. I don't think I've seen it a lot in the wild and I could imagine that it interferes with the fusion file system.

Regarding memory consumption, we could also look into the on-disk concatenation feature of AnnData. It is still experimental, though. For the time being, increasing memory sounds reasonable.

@grst
Copy link
Member

grst commented Mar 31, 2025

Ok, so apparently, there's nothing against using disk, see this slack discussion.

The point was made that ideally the disk size could be set dynamically based on the input size.

@nick-youngblut
Copy link
Contributor Author

Yeah, disk seems to fix the issue. However, I'm now running into #460, which has been quite hard to troubleshoot.

@nick-youngblut
Copy link
Contributor Author

disk '200 GB' isn't big enough for the ~3.5 bil sample. Dynamically setting the disk size based on the input file size would definitely be helpful.

@nick-youngblut
Copy link
Contributor Author

FYI: it appears that >800 Gb of disk space is needed when the sample includes ~3.5 billion reads. I'm still working on getting this job to actually finish successfully.

@nick-youngblut
Copy link
Contributor Author

The Cell Ranger job took ~21 hours and required somewhere between 0.8 and 1 Tb of disk space for the ~3.5 billion paired-end read input.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants