demystify pileup format -- how to properly filter pileup format
0
0
Entering edit mode
4.3 years ago
Lhl ▴ 760

Hi Folks,

I just found that with perl regex, I got variants frequencies that summed up to exceed sequencing depth.

cut -f5 test.pileup | perl -ne 'chomp; @del = $_ =~ /-\d+[AGCTN]/ig; @ins= $_ =~ /+\d+[AGCTN]/ig; @m = $_ =~ /./g; print scalar (@del) ."\t". scalar (@ins). "\t". scalar (@m). "\n";' 128 23 3781

obviously the number of deletions + number of insertions + number of (mis)matches > coverage

Can anyone explain why? Did I make mistake?

Thanks a lot.

The test.pileup file content is shown below:

contig1      200      T       3789    ...-1G....-1G.........+1G...........................-1G........-1G...........................-1G..............-1G.-1G.....-1G....*....-1G................-1G............C..C.............-1G*...........-1G..................................-1G..................................-1G............-1G..-1G...................-1G.......*......-1G...................................*.........................................*.......-1G.-1G.....-1G..............................-1G........................+1G.-1G.-1G....................................-1G......-1G..-1G.......+1G...........-1G.+1G......-1G....................+1G.........................-1G...........-1G..........-1G....................-1G........................-1G...........................................-1G..............-1G...-1G............................-1G.......-1G........-1G......................................................-1G.............-1G...-1G.-1G..................-1G......-1G.-1G...................+1G.................................-1G...-1G...................................................-1G......................+1G.................-1G...................-1G.................-1G.............-1G.........-1G............-1G........-1G....................................-1G..............-1G..-1G..-1G......+1G.............-1G......-1G..........................-1G..-1G......-1G.....-1G.......-1G................-1G....-1G..-1G................-1G.........................+1G....................................+1G...................+1G................-1G........-1G....................................-1G.-1G.....-1G......-1G.....-1G........-1G............................C..............-1G.......-1G......-1G..........-1G...-1G..............+1G....+1G...........................-1G.......-1G......-1G..........+1G.-1G.............-1G.-1G........-1G....-1G......................-1G.....-1G.....-1G.-1G..-1G.-1G...........-1G................-1G.....+1G...............-1G...............+1G....-1G.......-1G....................+1G........-1G...........-1G..+1G..............................+1G...........................-1G.................-1G..-1G.........+1G...............-1G...-1G.........-1G........................................-1G....-1G..............................................-1G...+1G...........................-1G...-1G..........-1G..+1G.-1G.-1G...-1G...................................................-1G............-1G.....-1G........................-1G........................................+1G............................-1G.-1G......-1G...................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................^].^].^".^].^].^].^3.^].^'.^2.^].^].^].^].^].^].^].^].^].^D.^].^].^7.^".^].^].^;.^].^5.^N.^].^].^].^].^].^].^].^C.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^V.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^6.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^A.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^=.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^R.^].^].^].^].^C.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^>.^N.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^Y.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^H.^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].^].   ,.(/1'+),+',.'!(.1/*,).$'(+')+1+'')-,+)&)&4-,,(-)(+,2*(&()&)*..','+/-!0./(),$0(/')((,*)*+)++(,+,(1))*+*!))+*-(*.**,',).*/(+-,,/,(&)!.),!,(!&&,)!)+%).)0-,('1*+)2-*.),,+!*,.),***,(*-*+!(),-'#)!*(..')%+,)(()-****).,$(()+!)+0)+),)+'*'()0),,,.&(*)())$/&,*'',,!+(.&*,().',()'/*)+))!+*)**%(',*((*'*-((),)+&-,),).*+()(*(,)(-'+**)*(0+)*)-+)+)++,)&+(+,*'(-,(,+++-)))',*(!+*)),,.)!*++&)(+))!($*0&)*')').(,**))&!+*,')+-(((!(,,*(*&(*)*,!*))+,,',*('.*,'**(!,*'+*)))')+!),*,-*+'*+.&'*)*&,-*-)))!)*+,),))*0*))/,/*!)+)+*)(!(-,&++()!'/+'(),,*$-.)**+/)!)+'')/(()*')+,))))*+('*('&%)*!)(*-!'&!('(*$&,,0&*-!+'!))*&**.(')!-&%&)*)%**))+%+)&*&(*.)(&-+0,&&2+(+'')*)'&-)/((,,%()(%,&.&*('&)(/!*!+('')*%*++*-&+**+-!,,.-&*,'$,(/,+((-++****,()*.'*++(+(+*(*+%,',-!!**!.*)&**(,**'*((+)(&/)*'+*+!+))'*+!.,*&*)-)+'*,)+(('*'**(')')*()',.''+++'(+..*+.*!,,-*+(())*,)*-(*'(*+)!!*((-*0')!&(-%*)((%&'')+*('((',*(/+%'*+*)+'+*(()-&+-/*+0,*'(,+(-+)*),(0'*++'%),+(+''/'4!(*(.($+&--'()((()()+(*$))&*-(-))).'-'..&*!('(())$'(+(-+.(*!+)'+)','+**,-0*+())*()'+(*'%&)+(-1)(-,!!)+()1+*(+))-&(('*')&*)*-)+&)()+*,(++,).(''*+*!+-(,()*!*'+,,((%*,(.+-&!(*+(-)!1+.!)().+)!')*(++!+'$(()*++)*!)2+)('*&)+)!+(+')**&)!-*)((**'*).))*+-)+,+,)-+'*.-,!*&+,,'-+-).',,&+)!*.(+''+,$*(')))*+*&*'*(')()-+!*)')*01)-(((++)).!!)())'*(*)&)-3',!*.).)*!-!)&&(#)(,(*+(*(,+&,*()*(((*)*'(&*(,)**+!+-,+*%('.''-&2/%&%*'/')(),!)*0!,.,/),*$+1)()'&',+'.&*./--)),)&&,!%++"*!)&(*)')!!-+2(+&(-+,''3(*',''&(*'*&--)!!*%(*).*((-,+''-+)*'(,')!-).'!'-*)(*#-((&(*+')'()'')*((-*!+',',(!**+,*1+()((*&))'++))(*(*)*((+,(*+)!*,**,-0(--/('('*'--*-++'(,+*(!'$&!--+),)($++$()$+)*+*),***+','),).()-)-'*%)*'*,!-).)'.)(+)(*',))!(-+)()%+,+(/()),(+')('/.+*'-*(),,,+()))!('.!+&))*')*('')*(!(+*,*'((***$)*-!(+*(/''()')(.())'*.())(*,*,*()!+')+.++/,,+('*('+(+((,'*.+*,+/*$+(+(!/,*-))*+(.(,*(('-++*++.)))(,(,..+))+*))*+))*-+**'%)'()*,+-(+*)'-).*,-.+*-*(+)*(+(*-(+,+*-'),).)'*(,('-!,+3.+'(,,*)*-))+(+&&&.')((!)(,((()-.(&--+('-(+(+-+'$((+!)(.))+'+*'*)-,(/*)!+&'*+))-)&&*(+&*+(**.*%)++*)''+*/,(**!*)+%*)%)!.+%)'))+'(.(.))')**)**+-,''),)')!,+&+'-)('),,!)(**)!*(.((-()))+(1(*(*-**)((-,**(+-*+)+(*+-**!)')-()!(!&)!,(&'''&)'+('&(*()*)'''!!)%!!)(+&**&$(#!(''&&!!!*(((-$'(''!&,*,'%&'%(())))))(),'&&'*)(')$!','*(''(&,+!('&**+).&)%*'!(!())')'+(')&&%)-#+)(,((%($)')%'*!((&')'*,(.!*#))!)++'-'.()!)%())&((*'+*&'**)'*'*/)!%(((/!'-'(-+'(&((()&%(&!*'(-&(%)&&(&-)'(('*(*$(&%))+&*+''!'('())(%*%','&(,$!*+)')!)*'+',!+&*!('&*'''('&*(()!*')&.''()''&(*'*(')),(*)+%%''*+'(()'(),'+(''%''()!)(&,,!('(')),',-'&)+*,''+')()')(*)(&)1'**!(*!)!(!)()$'-)$-**)*!)(),,!+'(&(')&('%+,+*'*(+*,$*'*')+'&&.(()'(*+((*(+(&+!())(*#,'&'+')+'')(+)('*!(-')&!%,)-%)((!$!-('('&,(*,$*,*)++((!&('%!(%'++''!'!'))(+&&(*!!*+!)())(&.+&&*('))')(,*'%()+&&(,)*'%('')&*!$'')!+&&(,(('&'!*('),(&('()'!%)+*('&'(''),!*)')(')..%(!&*)!'%*!&(.&)(,.'+!+&!'+++('***(%$,(+,!&)&')),'*'!&+)''!(&(+)($,'+())),&%*)))*(.)&*(&((*&-*)(+()''('&&'+'''3-*('*(((((+,'/&.+*,)),*''&)('(*'-*,!('+(&'))(*,&&'('*!''&''()'!((#'%*!#%(%!%!(%'('&%&$&'('()!)&&&&(&)&'$'('%/$%''(#'#(''!%''%%'#&&(*'&*%!'%&&''&%&)((),*(!(&!&$&!((&&#'&)&'%$&#&'('*')'&&#&*&+(&%)*)&('!'%')&%!*)&$(%'%($&$&(')&!%,&%%''+)'&*!%$(%%&(''))&$('''&)%'*''&$&(&)&'+&((%(!'(+&%&)$%$'&'')&'&('*)(('%&''(('&+&!+&&$%%)'((!&(')&)('&*%(%)#!'')'$)&*%&'!)('*'%)(%&(&&%%(+%&!%&%)%!&(%&!!$&&&*&%**''')!')&'&)#'''''',(&%+((*!'''&+%&$%(&'.&%$)''%%&+&'%'!(+'(''+((,'%&'%%('($!,+%)''$$))%)&)'&!',$(,%((('&+'&('&%%%&&&&$'&('&%%%'(''&%'%'('$'%')&&#+)%($!!!)&','(!&(&$&#$$#%$%$$%$*&*!%(,###'*))*#+-)0!&,)((&$$&!-)!/'&&,%2'))&,$**%'!$'*'!++&,%%$!!!#(!++&(+('2'*&$!*$&+&&)',(/))*&.'&)-+((($$$%&--%--).,)))&(%*&%-/$-$(+'-*&-&''*(*&$*!&+%/(&#*%&&(&)#''+"*)./%(,!($%&1,0&'6)#+'(('((+&$('%&(%)(&('+&$"+'$'$%$.''!*$*.'),'$(,!)()''+)$&(#'/"#%$,$%))-$$+(&&1!),&*!$#&/+'*+&'(($-!'$%'..(*+&%-$''$%'+!)+*%#'(!&$+/*(+$0%3!)*)%&&&'&6&&%+**'%))(.''%),,!&
snp next-gen alignment sequencing • 498 views
ADD COMMENT

Login before adding your answer.

Traffic: 1330 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6