Question: Is There A Samtools/Bcftools Setting To Call Variants No Matter How Infrequent?
0
Entering edit mode

I can't get this position to be called by the samtools mpileup/bcftools stack.

I get the variant if I ask for all positions (not just variant):

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -

(showing one position for brevity)

myRef    21    .    C    T,G,X    0    .    DP=3796;I16=3730,0,55,0,135424,4994460,1972,72624,42241,1163355,634,7544,26664,192714,488,4340    PL    0,255,199,255,217,199,255,217,199,199

but if I ask for only variant positions I don't get anything.

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -v -

It is filtering but I can't figure out which filters to turn off.

The pileup itself is here:

myRef    21    C    3796    ...................................................................................................................TT........T.........T.....TT..T...T.T.T.....TT.TT...T.....T.TT..........T..TT......TT.T....T...TT.TT......T..T..T.T..TT...TT....TT.T..T..TT.TTT..........TT......TT............................................................................T...............................................................................................................................................................................T...........................................................................T................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................G.....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................T..............................................................................................................................................................................................................................................................................................................................................................................................................................    HG7GHHDFGFHHHGHHBH8FHGHHHHFDHHHCGEHEH6>HHHH4HFHHDHAHHHHFGCHH<G@GHCHH:FD?EHHEGHH7EHHHGEE?;1HGGHB14EDHHFEE:HH?HH<GHBFEHHHGH>HEHHGHHGH@EHGGB>HFH<GEH>HHGHFHFEHGHBGHHHHHHGHHEHHGHGBGHF@HFHHHHGEH4HDGHHEFDEHHEF@CBG/HHHHHHGHDHHH?HGDFHHHHF@3HGHHHHHHHE=:HEHHHHHHE@DHHHHHHHGHDHEEHGHFEBFFHGDGGFD<GEHHBHF=HGHHGGHHH/FHHEHEFH;HDFHEHHG?BEHEHHHDHHGHHEH;EHGEHDFD?DBHFHGGEHHC:HFHHGGHHGHBHGEHFHFHEEHBHHHHHDHHH6HHEHH8EH>HHHFBGHHGEHGGE?H0HF<HGGHFHFBHHH>HHBBHBHHHEHHGGB3:HHE7@=@EGEHHHHHHFHHE>HHHCHHEGEHHGHHBHEDHHDEEHDGHEG@GCH;HFHH>5EC>FHHGHGHCEHCHBHBCH=B<GHHDHEHHDEHFGHHFEHHFHHHEEH?GGHHH3HEEC?HGF:GH7EHCHHHEGHE4HH??>EEBHCDGBHHDHDG4?EG9HGGG@EEEHB1EHHHHGGDHHHCE6GGHHHFGHHFHGGHHCHHFHFH/4GGGEHFGHHGHH9EHHHHEGFGHGEE?G>EGEHHH@BEE?HGEGGHHGHDFDEHHHHFEHHEHHHHGHHHFGHHHFCHHHHGHHFHFH2HHFBHGHHHHGHBHHFG?HHH;H4EHCHHHHHBGHEHE=GGHHGCHHF2HHHFHBHHHHHGHHD2H<HH>HFHEHG9HHEHEHHGGHHCHEGG3HHHEHGHGEHGFBHGHG@GH>DHHH1HHHHHHE@HHHH0EBHBDE=FHH?GHHEHHHHHHBHHDFH>FG@EHFEHEAFHHHGHHEGHH<GGHHHH@H;HHHGH6HHCEHBEHGH>E;H4GHHBHDHH<F9FHEHHHHHHDE7GHHCHEHEEGHEHHHCHHHD;8E@G?HH9HBHBH>C@H?HHE@HHEHBGHEFEHEBFDHHEHH9HHH>GFHHHH;EHHBHHHHF6EGHHH@HEEGHBHFB>EHHH@HBHBHHHHHHEEHHFE?HHDEHGHEHHGH=HGG@GHGEEH7F:HFH:EHH@HEHGHECGBBGEBEF>FEGHHHEDHHEH/BHHHHEHGDC,<BBHCHFBHDGG2?DHHH?GF9GBGHGH<HHHHHHGGHHHEB@EGHCG=EGEHGG;CBBECH7EEGGAHFGEHHH=?EGH9HHHHFGGHH8HGHE83H>H>EGHFGHGGHHEGHGDHHHHGHGHH=EFGEHHH7?@HHHF2HEGGHDHHFFHHCHHHHFHHHH.DH;?CHHHHHHFHDHHH2H6HFFGB9EGEHEHHHEBHHH@H6H=HEDBHDHHHHE:HHHGCHEGHHEEEHHGHGHHHHDHBEHHGHEHHDGEHHFHHHDBHHHBHHHHFHHHH?HHCH?HH25/FHD4HHHDDEEHEEHHDHHEEDEHHH?DH2HHHHEBBHHBH@HHHDH@HHEFHHGGGHHHBF>F?HGHEGHHGHGHGDHHHEFHHHHH@HHDDGHG8C;GGEEH8H@HDHH<HHEHHH4EGHHBHBHGGFH@DHFH=EEHHHHHH?HGHHHHHHHGEFF,GG?H6GDBH@HHFHHHHHHHEHHGDGHBE<GGHHHEHHEF4DFGHHEHGH0HFH=GGGEH/HGDGHHBHEHHHHDEHCGHHEH?HFD9@FH3BHHFEEFGH>HHHA6GEH@2B/<EGEHHHH8HHFC/GGDFHEHHEBE=8HFEHBHG=HGHHBHEH?EHHHGHEHHFGHEHHE%H<HHHBH8HEEHE<DGG?@EGEHGHBFHHGBEEH@E<GH?HHGFDFH9B6GEBHHFHGFHHHHHHGGHFGHH@=GHFHHECEHFFAHHEFHE*G/?HFHBGF?G7G:GDHDHHGGDHHE@GHHG@HH?H>HF/FEEH3EFHHC;HHH?;GHHFEEF4EHH<;E=H9EHFCHGGHE>GEEDHHHHEFHHHGHHD?GHHEBB8EEBHBH>HH;FDCGHGGCC?HHGHHHEFHHHEHDHE?@HFE2HHDHHHEHFG:CDF:6>E>GHFFH9EDHH4EHHF6HD7HHHGFGDH4GDHEHHHEG6HGHH*HHHHEFGH.HHHFHHHBCCHB6HG=HHHHHGHHGEDHH6FGH>FDFHHFHHHHFEHHGHGHDHG?7HDHH;4@HEEHHHGGH:CEHFHHHHGHH:H2<HFGEDEEHHEFHEH?HEGHHH>BHCHBHHHHFHHFEH0HHHH=BC6HHHGHGGBGHEHHHH:EAHFHEHHHHHEHEHHHHEHHGHFGHHHEHHHHH?HHEH7HH4?HHEHEHHEFFED=GHGD;HHBHHHGHHFHHEHHGCHBFGHHHHEHGHEHGHE7HEHGFHHHF8HAGH=HEFGHEDHHHCHHHHHHEHHHDHEHHEDHECHCED:HEH:HHHHGGHH;;FEEHEEEHEHG@DHHDH5HFFFHHEDHHEEFHH=FHEHBHHHHHDHHHEGFDHGFHHHHHHFCHEHHH?G:EFGEHCGHGHHGHHHGE?GH*H>HHGHHHEFGHHEGGHHHHHH>GGH@HH=HHEHHBEGEHHDHHGHHHEHH9EDHHEGHHHHGHHGHHFECHH:HFGHHE9DE6HFHHH;EHE>HEEFEHBHHHHEHEFEEHHHHHG0@HC=HHBHBEGHHGH:BFEHHHFHH:HBGHDGHGEEHGHEHGGE6HBHHCEH?EB7GHEHFHH?GBG=@EEEBCE?HBHH?HEFHHHGHHHH6EHGHGGGGHHBEEHHCDHHHE=HHH:.HFHGGHEHHHGFGFHBHGHD@HHGHFBHHBHFHEGHHGHEHH9H3HHHHHBHFCHFHGE2GHHHHH<HHEHHCDH>EHHHBHHH>FECH0?EDHDE@HFGH<HHHFEFEHEFAHH@HEHGHHHH@BGEHHHHEHHEGHHHHHH?HH4FHHHEHHHHDHHHHE?FHHH?E:GHHHHHFHGHHHHEHHHHHHEG,EHBEHEHHHGDHBH8GDHH2HHGHHHHGHHBBEH;6HDHHH3H:HHH2HBHDHHHFHEHGHGFHHHGHGHH9HDHHE9HHE@HHHHHHHGHHHHAH=HH9HGECHBGBHEEHHHH@BHHEHHHGHHHHHHDHFHG@E=HD77DEHEH0HEDGEDEDHGHHHBHHHB;EHE)HGEEEFE:HHHEH7HEHHGHHHBHHGHGFHHGEHEHHEH@HEEHHH@?H@DHHHHFEEGC>GH9DH5H=H:HHHGG?H?EHHG;GEEHHHDGHHHHHHE@FCDHHEHHF9HHCCHGHEFHH=HH=G;HE=>HGHH<HHHHGDEF7G-GEH:GEHEFHHHHH>G4HGHHEGHHHFEF;HH@HHHHECHHH<HHHEHFE<GDFHHE'HHE=GHHH@9H@CHHHFHHHH>GH=HHHH?HH4HHE?GHHHH>HGDACHG2GEHHEEDGEHDHH5HH8HEHHHH>BHHBEH?DHBHH7EH1EHFHG:HHEGH<CHHGHH:GHH<=HGHGHH>EHHEGHGE=GFFHHHGHBDHHGGHH:BHHHEBHHHHEHHHG?HGFHHBHHHG>HFEEGGEHHHEGEFHHEHHGG6HGHGHHHHHDHHEGHAGFE1DEH:HHHDGH2G8CEFHHHHHGHGHHHBHHGAFHHGHH?HFHGGHHGHH?HH=HGHHEH;EFFGHDGHEEHHHHGHGHHHFGHE<HH?GHBHGHH>HHGHGFHFGHGHG<HHHHHH@HH,C@HGHABHHHHHHHGHHGEHGGHDHGBCDBGGGHHHHHHHGFHGHHGDGHBHGHHHHGHHEHDHFHHHHGHFHHHDBHGBHH?GHHHHHHGHHGG7HGHHHHGG=GC5HHHH?H:5?GHEHGHFGHFG
ADD COMMENTlink 8.3 years ago Jeremy Leipzig 18k • updated 8.3 years ago lh3 31k
1
Entering edit mode

Hi Jeremy, this could be a possible duplicate of this question, where the answer is that the use of BAQ by samtools reduces some true positives together with the false positive reduction and it is suggested to user either -B or -E in order to fix it.

also, your problem could be due to high coverage filtration, in which case you should use the -D / -d options to set the maximal accepted depth.

ADD COMMENTlink 8.3 years ago Doctoroots • 780
1
Entering edit mode

Bcftools assumes your sample is diploid, which greatly improves the power for variant calling from diploid sequences. Your SNP has <2% frequency, even approaching the sequencing error rate. There is no way bcftools can call it. If you really want to call it as a SNP "no matter how infrequent", you should just parse the I16 field (first 4 numbers are identical to DP4).

ADD COMMENTlink 8.3 years ago lh3 31k

Login before adding your answer.

Powered by the version 1.8