Finding fragemented transcriptome
2
0
Entering edit mode
6.9 years ago
EVR ▴ 610

Hi,

I have de novo assembled transcriptome genereated using trinity pipeline. I would like to know whether this de novo assembled transcriptome is fragmented or not.

Thanks in advance

transcriptome denovo_assembly • 1.5k views
ADD COMMENT
0
Entering edit mode
6.9 years ago
Michael 54k

Even though you gave no other details, like assembly statistics, the answer to the question given as is, is YES. Any transcriptome you can get with current technology will be fragmented, because the sequencing technology generates fragments, where there is no guarantee that all fragments can be assembled correctly and completely. However, I assume that that was not what you had in mind. Possibly, you want to know how much -- gradually -- your transcriptome is fragmented, how much it affects your analysis, and the like, or better how complete it is. To address this, you need to calculate transcriptome statistics. Trinity has a script to give you some of the stats, but maybe you want to try other methods of evaluation like BUSCO.

ADD COMMENT
0
Entering edit mode
$TRINITY_HOME/util/TrinityStats.pl  Trinity.fasta

Trinity also has downstream analysis tools for assessing the completeness of the transcriptome.

ADD REPLY
0
Entering edit mode
6.9 years ago
Rohit ★ 1.5k

Usually you can't expect full transcripts from denovo illumina assemblies, unless you have some long-reads to scaffold these. One way you could go about is to look for the transcript orthologs in the close model species and compare it to your transcript length. For this purpose you would need to extract the CDS, translate it into a protein sequence and then perform a blast to uniprot or ncbi-nrdb. With the most similar sequence you can compare your protein length and know how much you might be off by. Also, there might be cases where your transcripts are smaller since the protein is smaller in your organism, in those cases you just have to check if your CDS is complete i.e. a start and stop - but there might be cases where it is complete but still an exon or more might be missing - due to misassembly or isoforms.

BUSCO as mentioned by Michael is always a good standard check at the end.

ADD COMMENT

Login before adding your answer.

Traffic: 2721 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6