-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dorado demux can't read custom barcodes fasta file, failed to extract sequences #1250
Comments
Hi @PhilliVanilli, There appears to be a typo in the docs - the custom barcode file must be a FASTA format. This is correctly shown in the example, but the format name has been mis-stated. |
Hi @malton-ont, Thanks so much for your quick reply. The custom barcode file is in fasta format (it is just the name that says fastq), but I've also tried with other barcode files (fasta in name and format, that's why the dorado cmd above says fasta) that work on other people's computer but somehow not on mine. To me there seems to be something deeper going on with v0.9 as the same files and command work for dorado 0.6/0.7/0.8 but suddenly stop working for 0.9. I feel it has something to do with permissions or something else on our computers (they are not managed by our institute but personal computers) but not sure what (even with chmod -R 777 it doesn't work). I'm a bit afraid that otherwise we will get stuck with dorado 0.8 forever :-) Thanks |
Dorado changed its custom file parsing between 0.8.3 and 0.9.0, so I suspect there's something in there that is triggering this - the new parser is possibly stricter than the old one. Things to check regarding your sequences file:
You should also check that the sequences file doesn't contain Windows-style line-endings since you're running on Linux. |
The file is in fasta format and works for all three previous dorados which also required fasta and I tried a barcode file from another computer where dorado 0.9 accepts this barcode file. So the issue can't be related to the actual file. The file has never been on a windows computer and I checked the line endings cause that's indeed often an issue |
As I said, the fasta validation has been tightened up in 0.9.0. Are you able to share the file? If the same file works on another machine with the same version of dorado, then this isn't a dorado issue - it would have to be something in your system, and you will need to talk to your IT support. |
Issue Report
Please describe the issue:
Please provide a clear and concise description of the issue you are seeing and the result you expect.
Dorado demux 0.9 can't access my custom barcode file, it outputs 'failed to extract sequences' though dorado returns the same error if you have a typo in the filename so I assume Dorado can't find it. If i use dorado 0.6 or 0.7 with same command it works perfectly
Dorado basecaller also works fine. Tried a toml and fasta file with same barcodes from a friend's computer where dorado demux works, but same error. So nothing wrong with command or with the custom files, nor with the paths. I tried dorado 0.9.1 but same issue. Can it be related with my system, ie during unpacking it makes links to certain files on my system? But I tried unpacking on our server (ubuntu 18) and on another Ubuntu 22 laptop and had the same issue.
All folders and files have rwx permissions
error message:
[2025-02-05 12:21:03.514] [info] Running: "demux" "--kit-name" "CUST" "-o" "/mnt/Data_SSD/test/demultiplexed" "--emit-fastq" "--barcode-both-ends" "--barcode-arrangement" "/home/pselhorst/metatropics_dev/dorado-0.9.0-linux-x64/bin/barcode_arrs_cust_dorado.toml" "--barcode-sequences" "/home/pselhorst/metatropics_dev/dorado-0.9.0-linux-x64/bin/barcodes_cust.fastq" "/mnt/Data_SSD/test/fastq"[2025-02-05 12:21:03.514] [info] num input files: 1[2025-02-05 12:21:03.528] [error] Failed to extract sequences from '/home/pselhorst/metatropics_dev/dorado-0.9.0-linux-x64/bin/barcodes_cust.fastq'.
Steps to reproduce the issue:
Please list any steps to reproduce the issue.
download the tar.gz file, unpack, dorado download --model, try and demux fastq file
Run environment:
Logs
The text was updated successfully, but these errors were encountered: