How many genes get covered by my assembly
1
1
Entering edit mode
6.4 years ago
Ric ▴ 430

Hello, I have a new assembly and I would like to know many genes get covered by the new assembly (completely or partially).

The gene annotation is in GFF3 format.

Is there any software available to do it or what would be the best way to do it?

Thank you in advance

Assembly gene gff3 • 1.3k views
ADD COMMENT
2
Entering edit mode

I've never used it, but as far as I know this is a job for BUSCO

ADD REPLY
0
Entering edit mode

A good start would be to describe your problem in more detail

ADD REPLY
0
Entering edit mode

The gene annotation is from another species? Another strain, same species? It is an annotation from your own assembly (then your question wouldn't make sense, but I have to ask anyway)?

ADD REPLY
0
Entering edit mode

Annotation is from another species.

ADD REPLY
0
Entering edit mode

Dear Ric, Hi

You can also "Count your Full Length Transcripts".

ADD REPLY
0
Entering edit mode
6.4 years ago
h.mon 35k

I will assume you have an unannotated assembly, and want to discover how many genes from an annotated close assembly are present in your assembly:

You can try RATT - it will transfer the annotation to the new assembly, and will output several stats about the outcome. You can also try QUAST, it has an option to compare a new assembly to a reference genome.

ADD COMMENT
0
Entering edit mode

RATT requires embl file which I do not have and quast does not work with big genomes.

ADD REPLY
0
Entering edit mode

RATT requires embl file

And apparently you did not search for convert gff to embl. Try EMBLmyGFF3, a software developed by someone with lots of experience on this issue.

From QUAST FAQ:

Current QUAST version is not optimised for large genomes yet.

It is not the same as "does not work". If you have the computational resources, it will work, if not, check FAQ #14.

ADD REPLY

Login before adding your answer.

Traffic: 2875 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6