Question

Forum:SNP calling in 2018

2

Entering edit mode

6.1 years ago

Picasa ▴ 640

Hi!

I would like to perform a simple variant analysis (SNP) with multi samples. I use to used the GATK workflow before but I would like to know if there is anything "better" than GATK nowaday, or is it always the gold standard ?

Thanks

variant gatk SNP • 2.7k views

ADD COMMENT • link updated 11 months ago by Ram 43k • written 6.1 years ago by Picasa ▴ 640

1

Entering edit mode

Mods - Probably worth turning this into a forum post?

ADD REPLY • link 6.1 years ago by andrew.j.skelton73 6.5k

1

Entering edit mode

Good suggestion, done.

ADD REPLY • link 6.1 years ago by WouterDeCoster 47k

score 4 · Answer 1 · 2018-03-03

GATK hasn't changed a great deal in terms of math and calculations between 3.8 and 4, however there's been some significant enhancements under the hood in terms of speed and data structures. Variant calling is still a hotly contested topic, specifically filtering around what is truly variation and what isn't - see the blog post here which gives a nice comparison between Google's DeepVariant, GATK's VQSR methodology and GATK's still in development CNN. All in all, it's still an exciting area of development to keep an eye on.

So to get to the root of your question, I still feel that GATK's variant calling methodology is the gold standard to go off, but it certainly doesn't hurt to compare and contrast methodologies. I'd suggest you look at samtools / bcftools for an alternative approach which can sometimes be a great help when working with non-model organisms.