SNP Calling Parameter Settings
0
1
Entering edit mode
5.2 years ago
mcclintock ▴ 10

Hi everyone,

I am using family trios data to run GATK's SNP calling pipeline. The HaplotypeCaller gave me the father's SNPs file, which I have recalibrated using VariantRecalibrator. But the final VCF file still has 3 million “PASS” records. Actually, one human has no chance to carry so many SNPs.

Any advice for adjusting the parameters?

The parameters I used refer to the literature below.

Roazen, D., Thibault, J., Banks, E., Garimella, K., Altshuler, D., Gabriel, S. and DePristo, M. (2013). From FastQ Data to High-Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline. Current Protocols in Bioinformatics, pp.11.10.1-11.10.33.

Thanks.

SNP gatk vcf • 928 views
ADD COMMENT
0
Entering edit mode

Actually, one human has no chance to carry so many SNPs.

Why do you think so? What is the expected number of SNPs for one human subject on average?

ADD REPLY
0
Entering edit mode

A little more than 2 million is appropriate, as far as I know.

ADD REPLY

Login before adding your answer.

Traffic: 2524 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6