Biostar Beta. Not for public use.
Variant calling for MNP
0
Entering edit mode
16 months ago
nilus1432 • 30
Singapore

Hi All,

I would like to know is there any program for variant calling for multiple-nucleotide polymorphisms. For eg. detecting cases like, Reference = 'ATA', Sample = 'GTC'.

thanks a lot

RNA-Seq SNP • 2.2k views
ADD COMMENTlink
4
Entering edit mode
17 months ago
Len Trigg ♦ 1.3k
New Zealand

RTG Core calls haplotypes such as this. In reply to rbagnall, the benefit of calling as haplotypes rather than as individual SNPs is that the variant caller is able combine the evidence across both sites rather than calling them independently (e.g. read evidence containing all of ATA, ATC, GTA, GTC looks like het A->G and het A->C when looking at the sites separately, but is completely bogus from a haplotype perspective).

Whether the output is represented as multi-base calls or is decomposed into smaller variants is orthogonal and largely personal opinion. Either way, with complex calls you are likely to encounter cases where variants may have multiple alternative, equivalent representations. Even decomposition tools don't get you to a canonical form for all cases, so haplotype-based variant comparison tools such as rtg vcfeval are critical for accurate comparison of complex variants.

ADD COMMENTlink
0
Entering edit mode

Thanks Len, that makes sense.

ADD REPLYlink
2
Entering edit mode
2.5 years ago
Aberdeen, UK

Freebayes calls MNPs and complex polymorphisms as well as SNPs and indels

ADD COMMENTlink
0
Entering edit mode

I'm curious,

What is the benefit of calling a multiple-nucleotide polymorphism as shown in @nilus1432 question, rather than calling an A>G and an A>C single nucleotide variant? The latter are easier to annotate (missense, frequency in population controls etc...).

ADD REPLYlink
1
Entering edit mode

Because you can annotate them correctly. Calling two adjacent SNPs and passing them to something like VEP will give you two different SNP effects, but if it's a genuine dinucleotide change within say a codon, you need to be able to assess the effect of their presence as a unit.

ADD REPLYlink
1
Entering edit mode

Thanks Daniel,

Yes that's true and something I hadn't considered. I will have to see if VEP annotates such MNPs; it looks like SNPEff does. Also, the train of thought led me to this paper which describes the issue a bit more for anyone interested, though I haven't tried the described software:

http://www.biomedcentral.com/1471-2164/16/569

ADD REPLYlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.3.1