Biostar Beta. Not for public use.
Forum: SNP calling in 2018
2
Entering edit mode

Hi !

I would like to perform a simple variant analysis (SNP) with multi samples. I use to used the GATK workflow before but I would like to know if there is anything "better" than GATK nowaday, or is it always the gold standard ?

Thanks

ADD COMMENTlink 24 months ago Picasa • 390 • updated 24 months ago WouterDeCoster 39k
Entering edit mode
1

Mods - Probably worth turning this into a forum post?

ADD REPLYlink 24 months ago
andrew.j.skelton73
5.7k
Entering edit mode
1

Good suggestion, done.

ADD REPLYlink 24 months ago
WouterDeCoster
39k
4
Entering edit mode

GATK hasn't changed a great deal in terms of math and calculations between 3.8 and 4, however there's been some significant enhancements under the hood in terms of speed and data structures. Variant calling is still a hotly contested topic, specifically filtering around what is truly variation and what isn't - see the blog post here which gives a nice comparison between Google's DeepVariant, GATK's VQSR methodology and GATK's still in development CNN. All in all, it's still an exciting area of development to keep an eye on.

So to get to the root of your question, I still feel that GATK's variant calling methodology is the gold standard to go off, but it certainly doesn't hurt to compare and contrast methodologies. I'd suggest you look at samtools / bcftools for an alternative approach which can sometimes be a great help when working with non-model organisms.

ADD COMMENTlink 24 months ago andrew.j.skelton73 5.7k
Entering edit mode
0

To extend Andrew's answer, I was heavily favouring GATK for many years but their pipeline became overly restrictive (inflexible). So, like you, I sought alternatives.

samtools / bcftools is a very simple pipeline but this fact, ironically, is its advantage. samtools / bcftools is excellent for identifying SNVs; however, not good for indels, in which case I would call these with pindel.

The final point: no variant caller can completely mitigate the error that comes with using a sub-standard (but rapid) sequencing technology like NGS.

ADD REPLYlink 24 months ago
Kevin Blighe
43k

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.0