What does the 'E%' represent in BUSCO results?
1
0
Entering edit mode
19 days ago
林明德 • 0

I performed a BUSCO genome mode analysis using the insecta_odb10 dataset.

The results were as follows: C: 99.8% [S: 91.1%, D: 8.7%], F: 0.1%, M: 0.1%, n: 1367, E: 3.7%.

Can someone help me interpret what "E: 3.7%" stands for?

I would really appreciate any help.

BUSCO Image tag version: ezlabgva/busco:v5.7.0_cv1 and ezlabgva/busco:v5.7.1_cv1

Code: nohup busco -i genome.fasta --auto-lineage-euk -o /path/to/outdir -m genome -c 80 &

Result report:

BUSCO • 324 views
ADD COMMENT
0
Entering edit mode

According to ChatGPT:

In BUSCO, the "E" category represents "End" or "Endof" gene fragments. These are orthologous groups for which the gene model is fragmented at the end of the sequence, indicating that the sequence is incomplete at the 3' or 5' end.

ADD REPLY
0
Entering edit mode

funny enough i tried the same thing (i've used BUSCO heaps times but never saw the E:! even the manual lists 'C:89.0%[S:85.8%,D:3.2%],F:6.9%,M:4.1%,n:3023', without the E:) and ChatGPT waffled something about E: being the percentage of eukaryotic genes in the proteome. One of us is wrong, or both :)

ADD REPLY
1
Entering edit mode
19 days ago

Ah your edit solved this. 50 BUSCOs contain internal stop codons, indicating they might be erroneous gene models, or the genes themselves are inactive/lost.

50 / 1364 * 100 = 3.66 ~ 3.7%. That's a new feature! So ChatGPT was wrong in both comments :)

This seems to be a new feature, BUSCO 5.6.0, the commit that introduces this detail is 4 months old. https://gitlab.com/ezlab/busco/-/blame/master/src/busco/busco_tools/hmmer.py?page=2#L1158

ADD COMMENT
0
Entering edit mode

Thank you for your answer! It really helped me resolve a doubt!

ADD REPLY
1
Entering edit mode

No worries! I found the commit with the maths too: https://gitlab.com/ezlab/busco/-/blame/master/src/busco/busco_tools/hmmer.py?page=2#L1350

                100 * self.complete_stop_codon_count / complete_count
ADD REPLY

Login before adding your answer.

Traffic: 1116 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6