Finding UTR regions in the RNA sequencing data from a non-model organism
2
1
Entering edit mode
7.5 years ago
seta ★ 1.9k

Hi all friends,

I'm working on a RNA-seq project of a non-model plant, the library is generated from mRNA fraction (enriched with polyA) and sequenced as PE, stranded-specific. I have done de novo transcriptome assembly and annotation. Now, I would like to know if there is any way to determine the 3 and 5 UTR region of genes? Regarding 3 UTR, Since the library enrichment was done by oligo-dT primers, I'm not concerned about it, but I don't know what is the right procedure to determine these regions? Could you please advise me on this issue?

Thanks in advance

UTR RNA-seq non-model organism • 4.1k views
ADD COMMENT
0
Entering edit mode

Hi!, I have to run a similar task...Did you find any solution?

Thanks!

ADD REPLY
7
Entering edit mode
7.5 years ago
EVR ▴ 610

Hi Seta,

Use Transdecoder. It will output gff3 file which includes all the transcripts and their respective UTRs location and CDS location. Its very effective and reliable.

ADD COMMENT
3
Entering edit mode
7.5 years ago

You want to use an ORF finder to find the longest open reading frame in each transcript. EMBOSS has tools for this. One thing to be careful of is that if you have missed the 5' end of the transcript, you might not find a start codon for all your transcripts. The UTR is then most probably the sequence after the longest ORF.

ADD COMMENT

Login before adding your answer.

Traffic: 2957 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6