Precision and recall of binned contigs
0
0
Entering edit mode
5.5 years ago
kimsumin94 ▴ 20

Suppose I have reads from 2 species and I assembled the reads into contigs. Then suppose I binned the contigs into 2 clusters, cluster1 and cluster2. And suppose the image below is the result of binned contigs.(supposing I know which contig derived from which species)

        species1    species2
cluster1    50          1000
cluster2    150            10

What is the precision and recall respectively in this case?
Is precision = (150+1000)/(1000+150+50+10)?
I've searched the definition of precision and recall here https://en.wikipedia.org/wiki/Precision_and_recall, and I'm not sure if there is true negative and false negative in this case.
At https://www.nature.com/articles/nmeth.3103, which is a paper on binning contigs, it says recall is the proportion of each species binned to the same cluser. What does the same cluster mean in this case?
Do I have to solve for each species its recall(50/(150+50), 1000/(1000+10) respectively) and get an average?

While searching through the internet, I've found precision and recall in clutering at https://stats.stackexchange.com/questions/15158/precision-and-recall-for-clustering Is this the definition I should apply for each pair of contigs?

Thank you very much.

binning • 1.1k views
ADD COMMENT

Login before adding your answer.

Traffic: 2467 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6