-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
java.lang.ArrayIndexOutOfBoundsException: Index 32770 out of bounds for length 32770 #8192
Comments
@shuaiwang2 Hi, we don't currently support indexes that long. We use a bai index for bams and tabix for vcf which only support up to 512 M. You need to use a CSI index for references that large but we don't support writing those. (Reading them is weird, I think we can read BAM csi indexes but not VCF ones). It might be possible to work around this issue by setting Otherwise I recommend splitting your chromosomes into two separate parts and calling on the split chromosomes. Splitting along a long region of N's should be a safe way to avoid missing any useful calls. (The telemere might be a good spot unless you have a T2T reference.). We should probably improve that error message to make it clear what the problem is. |
thank you, very useful advice for me |
I've opened a ticket (samtools/htsjdk#1651) to improve this error message and make it less confusing |
Hi, I think that means that the error only occurs when producing tbi index for vcf files of large chromosomes, while the variant calling progress can still be perform normally and I needn't to re-run the HaplotypeCaller. Am I right? |
I agree with your comment, but I don't test it. |
@Dentalium |
Hello, When I implement "HaplotypeCaller" commands, the reference genome is about 15G , every chromosome is more then 600M, I get some errors, could you give me some advice?
the commands
the bug:
The text was updated successfully, but these errors were encountered: