Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add function to PA to re-order phenotype data if necessary #4

Open
lckarssen opened this issue Nov 4, 2015 · 0 comments
Open

Add function to PA to re-order phenotype data if necessary #4

lckarssen opened this issue Nov 4, 2015 · 0 comments

Comments

@lckarssen
Copy link
Member

This is a 'liftover' of feature request #5677 from the R-forge tracker.

This request is based on the forum post by user Siru at http://forum.genabel.org/viewtopic.php?f=10&t=876.

Currently ProbABEL (v0.4.3) checks whether the IDs in the phenotype and genotype files are in the same order (while reading in the data). We could improve on this be reading in the phenotype data completely first and then reorder the phenotype data according to the genotype data.

My guess is that this is not too expensive in terms of computation time when compared to running several million regressions. Of course, as genotype data may be split in chunks and many chunks run parallel (e.g. using a queue system), this cost increases, as each separate job has to do the order check, and if the data is out-of-order, the reordering.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant