freebayes for snp call in pooled samples: how do I set the parameters?
1
0
Entering edit mode
4.1 years ago

Hi everyone,

This is such a basic question coming from lack of experience. I was trying to run freebayes on 2 pools of samples (one with N=37 individuals and other with N=21). I'm using the option --pooled-discrete but I'm not sure on how to set --ploidy and --use-best-n-alleles. Can anyone help me figure out if, for diploid species, ploidy is double the number of samples you pool?

I know it's the number of genome copies per-pool but when I set --ploidy 74 (without --use-best-n-alleles) it keeps eating up all my memory and it never finishes (it's been 3 days). I'm calling only in a small region of a chromosome so it shouldn't take so long, right?

Thanks in advance :)

SNP freebayes pooled ploidy • 2.1k views
ADD COMMENT
0
Entering edit mode

Also see What do the freebayes parameters --pooled-continuous/--pooled-discrete mean?. Ploidy is alleles per sample I guess not the sum of it, so 2 for diploid organisms.

ADD REPLY
0
Entering edit mode
3.0 years ago
ross_whetten ▴ 10

According to the Freebayes documentation, with the --pooled-continuous option " When running with high --ploidy settings, it may be required to set --use-best-n-alleles to a low number to limit memory usage." Even a small region of a chromosome may take too long or use too much RAM given ploidy 74 and no limit on the number of alternative alleles.

ADD COMMENT

Login before adding your answer.

Traffic: 2633 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6