Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple taxids for a single sequence #54

Open
nawrockie opened this issue Jan 27, 2021 · 0 comments
Open

Multiple taxids for a single sequence #54

nawrockie opened this issue Jan 27, 2021 · 0 comments

Comments

@nawrockie
Copy link
Contributor

nawrockie commented Jan 27, 2021

@nancyontiveros encountered a failure when running rfci.pl to check in RF00050 which has sequence AE017333.1 in its seed. AE017333.1 has multiple taxids in GenBank (https://www.ncbi.nlm.nih.gov/nuccore/AE017333.1/) due to the presence of prophage sequences in the genome (search for 'taxon' on that page).

Excerpts from the rfci.pl error, reported by Nancy:

A repository hook failed: Commit failed (details follow):: Commit blocked by pre-commit hook
genbank_fetch_seq_info for AE017333.1, > 1 taxids read: 279010 and 1230651

I remember coding this and putting in the check that only 1 taxid be returned per sequence because otherwise we'd have to know which one was correct and it was not immediately obvious how to know that. I'll look into it more now.

nawrockie added a commit that referenced this issue Jan 29, 2021
…k, now we take first taxid parsed from GenBank xml; this is github issue #54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant