-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ngsLD sometimes hangs indefinitely #20
Comments
Hi James, and sorry for the late reply. This is indeed strange, specially since you say that these failed runs eventually finish, if you launch them again, right? That makes me think that it might not be |
This is happening to me as well, in a slightly different way. I have over 100 datasets with the same number of SNPs in each (from different species and different chromosomes -- I use the same size of subsample of SNPs from each), and all of them run nicely except for one, which hangs indefinitely. In my case, it is one particular dataset that hangs every time. |
It's possible that my issue is caused by an input beagle file that is composed of mostly missing data (the vast majority of genotype likelihoods were 0.33333). Could it be getting stuck because of that? |
Is it stuck (as in, no output being generated), or is it just slow? |
It seems to get stuck -- I haven't let it run for more than 20 hours before killing it, but this is on a dataset of a size that typically takes about 10 minutes to run, so I don't think it's doing anything. |
I was able to get it to complete when I add the options --min_maf 0 and --ignore_miss_data. Obviously this particular dataset is pretty worthless, it's mostly missing, but hopefully it's helpful to reveal a particular circumstance that can cause ngsLD to hang up and never finish! |
Do you think you can sent an example dataset where |
Hi Filipe, I just emailed you some data to try. Thanks! |
Hello!
I've come across an interesting issue where
ngsLD
will sometimes hang on "Reading data from file..." and never proceed any further, even after approximately 48 hours.I noticed this problem while estimate pairwise LD in parallel across 16 separate chromosomes. Some chromosomes would finish within a couple hours, while others would just hang and eventually get terminated by the cluster due to timing out.
I'm estimating pairwise LD from genotype likelihoods that I calculated in
ANGSD
using the following command:This command succeeds without issue for all chromosomes and generates all output files, including the BEAGLE format genotype likelihoods used as input to
ngsLD
. Here are the first 10 lines of one of these files:Here is the
ngsLD
command that I've been running:where the first few lines of the position file looks like:
For reference, here is what the log file looks like for a successful run:
And here is what the log file looks like for a failed run that hangs indefinitely:
Finally, I should mention that the commands are being run inside of a singularity container (see HERE) into which I've installed
ngsLD
.Also, it doesn't appear to be a problem with particular chromosomes. If I restart the failed runs, they will eventually succeed so I'm not sure what is causing the occasional hanging.
Thanks!
The text was updated successfully, but these errors were encountered: