-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
creating a taxonomy database for PR2 #208
Comments
I'll let @zachary-foster respond on the taxa side of things as he's the maintainer of it. Is your pr2 data a proper (SQL) database? Or a set of tabular files? Something else? It would be interesting to think about how to let users define their own data source, but it's quite complex since data can be so varied.
Unfortunately that's not quite how it works. I wish it was that easy! The
I think the "work well together" is with respect to the taxonomic data alone, that is, that data retrieved from data sources in taxize could be handled/managed/filtered with taxa. And (see below) taxa even used within taxize to output taxa objects. In terms of taxa and taxize integration, the current version of taxize does not use taxa pkg. BUT, the next major release does integrate taxa2 (https://github.com/zachary-foster/taxa2) - hopefully to be on CRAN soonish. In that taxize version we will use taxa2 to construct various objects of taxonomic data. |
Hi!
The PR2 database has multiple formats, including a SQL format (https://pr2-database.org/documentation/pr2-sqlite/). Maybe this is what I should have linked to in the
Oh, I see.
Ah, I see. And I am sort of wanting it to go the other direction - make a database that
Thanks for brainstorming about this with me! |
@ctekellogg thanks for your responses. Thanks for the details on the PR2 database. I'll have a look. Is |
No, thats right Scott, the
If you have a local database, you should be able to parse it with devtools::install_github("vaulot/pr2database")
library(pr2database)
pr2 The table |
Good point about the pr2 package. I opened an issue in taxize ropensci/taxize#866 - tldr, it's probably not doable but worth discussing at least, as it might be |
Thank you both! (sorry for the delay in my response). Yes, I installed the pr2database package in R (before I wrote here), but then was struggling to figure out how I might search it in the manner that And yes, @sckott I am primarily interested in using Thanks again! |
can you explain this a bit more? does this correspond to a certain function(s) |
Well, since I often work with amplicon sequencing data, my typical mode of operation is to QC the data and the classify the reads against a reference database using sequence classifiers within the Seems like it may be a bit of effort to actually make that happen on your end, and I don't want to trouble you too much (since I fully recognize this is a rather specific request), especially if it isn't a feature that would benefit your package overall. So, it is totally okay for you both to say no can do. Colleen |
Thanks for the explanation! So For the checking names part with |
I am just about to leave for camping until Monday. I will get back to this then. Sorry for the delay! |
Hi,
I recently learned for
taxa
andtaxize
and have been exploring them a bit today, as I am trying to merge microscopy data with an metabarcoding data so that I can compare what is observed by microscopy in the ocean with what we observe via sequencing. I have a list of species and genus names for the microscopy data but for the sequence data, which were annotated with the PR2 database (https://github.com/pr2database/pr2database/releases/tag/v4.12.0), I have a much more detailed taxonomy. I played around withtaxize
today and was able to get it to work for my list of taxa found by our microscopist for the built in databases. But, is it possible to generate ataxa
andtaxize
-friendly version of other publicly available databases, like PR2? I was thinking I could do this using:using the example you provide for NCBI. It seems to work without errors but then this
taxa
version of the PR2 database doesn't work withtaxize
(hoping to useclassification
to list of the taxonomies for our the critters found in our microscopy data)? I get the errorError: the provided db value was not recognised
. Maybe it isn't supposed to, but I guess i was secretly hoping it would be just that easy to create a new db endpoint intaxa
and then use it intaxize
. Or, probably more likely, I am missing some steps in between, since yourtaxa
manuscript suggest these two packages should work well together.Thank you so much for any insight you can provide. I'd really love to streamline this data merging process and I think your package(s) do exactly what I need...but just don't have the databases I need.
Thank you!
Colleen
The text was updated successfully, but these errors were encountered: