Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

discrepancy in number of samples #458

Open
Xiaofei-git opened this issue Apr 19, 2021 · 0 comments
Open

discrepancy in number of samples #458

Xiaofei-git opened this issue Apr 19, 2021 · 0 comments
Assignees

Comments

@Xiaofei-git
Copy link

Dear community,

I tried to download data by using "TCGAquery_recount2". But, I found the number of sample is different using different functions in TCGAbiolinks. Why does this happen? Does "TCGAquery_recount2" is download different version of the data? (I originally post the issue here https://support.bioconductor.org/p/9136385/#9136498) Thanks a lot!

If I used TCGAquery_recount2, the number of samples is 601 (542 Tumor and 58 Normal) for TCGA-LAUD. While it is 594 (535 T and 59 N) for TCGA-LUAD if I used "GDCquery", "GDCdownload", and "GDCprepare". The common samples are 594, and there are 7 more tumor samples using TCGAquery_recount2.

If I used TCGAquery_recount2 to download the GTEs data for Lung tissue, the number of samples is 374. But it is 419 from the GTEx website query. The common samples are 313 between these 2 ways.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants