[go: nahoru, domu]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

java.lang.IllegalArgumentException: the number of genotypes is too large for ploidy 8 and 55 alleles: approx. 3381098545 #8842

Open
ChenDepp opened this issue May 22, 2024 · 3 comments

Comments

@ChenDepp
Copy link

bug reports

hi guys :

Hi all, when I run gatk (version: 4.5.0.0) CombineGVCFs to combine 240 8 ploidy samples gvcf, it reports the error as below
image

how call i solve it? ,replace CombineGVCFs with GenomicsDBimport ?
I think even though I got the merged gvcf file , this error is also will be reported when I run GenotypeGVCF?
I look forward to your suggestions
have a good day!

@gokalpcelik
Copy link
Contributor

GenomicsDBImport is definitely the way to go for this kind of operation. On the other hand STRs are quite prone to errors especially when higher ploidies are involved. You may wish to reduce them or even completely drop them if they are not of your interest.

@ChenDepp
Copy link
Author

hi @gokalpcelik
I used GenomicsDBImport to replace CombinedGVCFs, but it has new problems, GenotypeGVCFs for GenomicsDB is so slow, can only get 900K interval vcf in 9 hours. how can i speed it up.
waiting for your reply. hava a good day!

@gokalpcelik
Copy link
Contributor

Hi again.
You should be able to split your variants into multiple intervals and import all intervals in parallel under different genomicsDB import instances. Those instances can then be genotyped in parallel and finally combined into a single callset. By this way you can get your variants faster. This method is called scatter-gather which is what we do and suggest.

I hope this helps.
Regards.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants