samstat and samtools differ in the number of MAPQ=0 reads/alignments.
1
1
Entering edit mode
7.1 years ago
user230613 ▴ 360

I'm trying to retrieve the number of reads and the number of alignments with MAPQ=0. To address the question I'm using samtools in the following way:

samtools view -@ 4 BAM | awk '{if($5==0) {print $0}}' > mapq0.sam

cut -f1 mapq0.sam | sort -u | wc -l

1606759 # Number of reads

wc -l mapq0.sam

2952315 #Number of alignments

However, I'm getting different numbers when running Samstat:

MAPQ < 3 30814.0

I think I'm missing something very basic. Moreover, I'd expect to have a greater number in Samstat MAPQ<3 since obviously it also considers, MAPQ=1,2...

Any hint?

Thank you

bwa samtools samstat • 2.4k views
ADD COMMENT
0
Entering edit mode
7.1 years ago
James Ashmore ★ 3.4k

Maybe try using bioawk:

   samtools view input.bam | bioawk -c sam '$mapq == 0 {sum += 1} END {print sum}'
ADD COMMENT
0
Entering edit mode

Thank you for your answer, but actually I'm looking for an explanation in the difference between samstat and samtools counts :).

ADD REPLY
0
Entering edit mode

MAPQ=0 indicates multi-mappers. Have you considered/accounted for those when comparing those two numbers?

ADD REPLY
0
Entering edit mode

Yes, I have consider that, that's why I'm counting read names using cut -f1... command

ADD REPLY
0
Entering edit mode

Please could you move your answer to a comment? I want to keep the question opened and unanswered :)

ADD REPLY
0
Entering edit mode

Question never really closes. Even for questions where an answer has been "accepted" one can add new answers.

If you are referring this samstat program then I would be scpetical. It seems to not have been updated in 1.5+ years. If you need an alternate give Qualimap or Bamstat a try.

ADD REPLY
0
Entering edit mode

Yes, I'm referring to that samstat, and although it has not been updated in 1.5 years, we are not talking about a very up to date (changing ) issue, is only counting the number of reads with MAPQ=0. Is not a big deal :/

I've used qualimap, but as I said, I'm not looking for other tools to do the same, I'm looking for an explanation of the difference between samstat and samtools.

ADD REPLY
1
Entering edit mode

There seems to be some sort of a bug in samstat. I tried your method above on a small bam file and got different counts with samstats. You could report this to the developer.

ADD REPLY

Login before adding your answer.

Traffic: 2573 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6