RNA-Seq data analysis mapping to closest genome as reference genome
1
0
Entering edit mode
5.3 years ago
ramzz.1987 • 0

Can I use the closest genome as the reference genome for RNA-Seq analysis instead of Denove genome assembly. For example, I have a genome with strain A and can I use the same genome with strain B as my reference genome. Please give your feedback if it makes sense biologically.

RNA-Seq alignment • 1.2k views
ADD COMMENT
1
Entering edit mode

Is it possible that you can assemble the transcript?

ADD REPLY
0
Entering edit mode
5.3 years ago
unawaz ▴ 60

It depends on how closely related the two organisms are. Otherwise you can perform a de novo transcriptome assembly using tools like Trinity (https://github.com/trinityrnaseq/trinityrnaseq/wiki), and then BLAST it against the genome of the closely related one.

ADD COMMENT
0
Entering edit mode

Actually, I want to perform Reference based analysis and when I tried mapping to closest genome I found around 80% alignment rate. But I want to make sure does this analysis make biological sense.

ADD REPLY
0
Entering edit mode

You might see an 85-90% alignment rate for some protocols aligned against a well-characterized genome (such as human or mouse).

So, in that context, an 80% alignment rate wouldn't seem too bad, and having the more thoroughly reviewed annotations might be helpful (versus trying to work with the Trinity assembly). However, for your particular library and analysis strategy, I don't know what the alignment rate against a complete, published genome would be.

ADD REPLY
0
Entering edit mode

Hi Dr. Charles,

Thank you very much for your reply. It is a bacterial published complete genome. We have reviewed annotations and seems to be good.

ADD REPLY
0
Entering edit mode

If you use TopHat, perhaps you check the unaligned reads for contamination? For example, you could try doing an assembly of the unaligned reads and then BLAST some large and/or high coverage contigs.

If you had a pure sample for a bacterial genome with relatively low population variation (and/or minimal regions that are hard to assembly for the genome sequence), an 80% RNA-Seq alignment may be low. However, it is hard for me to say for certain.

ADD REPLY

Login before adding your answer.

Traffic: 2038 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6