Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

import of taxdump files #204

Open
nick-youngblut opened this issue Jan 3, 2020 · 3 comments
Open

import of taxdump files #204

nick-youngblut opened this issue Jan 3, 2020 · 3 comments

Comments

@nick-youngblut
Copy link

As far as I can tell from reading the docs & paper on taxa, the package cannot directly import from taxdump files. This would be helpful for importing custom taxdump files created from non-NCBI taxonomies (eg., taxdump files created from the GTDB taxonomy)

@zachary-foster
Copy link
Collaborator

Hi @nick-youngblut, thanks for the idea, I will look into it!

@zachary-foster
Copy link
Collaborator

After looking into this a bit, I am not sure what file type you mean. I found the following examples:

I see this on GTDB's website:

https://data.ace.uq.edu.au/public/gtdb/data/releases/latest/bac120_taxonomy.tsv

And these from NCBI:

https://ftp.ncbi.nlm.nih.gov/pub/taxonomy/taxdump.tar.gz

https://ftp.ncbi.nlm.nih.gov/pub/taxonomy/new_taxdump/new_taxdump.tar.gz

Are one of these the format you were talking about?

@nick-youngblut
Copy link
Author

Here's the taxdump that I created from the GTDB: http://ftp.tue.mpg.de/ebio/projects/struo/GTDB_release89/taxdump/

The simple script used to create it can be found at: https://github.com/nick-youngblut/gtdb_to_taxdump

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants