Question

Abundance Counting for Non-Model Organism for edgeR

0

Entering edit mode

4.9 years ago

sdbaney ▴ 10

I am working with a non-model organism (no annotated genome or the option to use an annotation file in my abundance counting). I performed de novo transcriptome assembly of the Illumina HiSeq4000 reads.

I originally used StringTie as my abundance counter because I didn't need a gtf annotation file and it created one I could use for the differential expression analysis but I am now seeing that because of its feature to predict isoforms, I am having trouble matching up their IDs with my assembly IDs (there being more stringtie IDs than assembly because of its novel prediction).

Is there any way to disable this feature in StringTie? Is there another program I can use to create a gtf file so that I can use featureCounts or just count abundances that will not require a gtf?

RNA-Seq abundance edgeR • 1.0k views

ADD COMMENT • link updated 4.9 years ago by ATpoint 81k • written 4.9 years ago by sdbaney ▴ 10

score 0 · Answer 1 · 2019-05-22

0

Entering edit mode

4.9 years ago

ATpoint 81k

You can use salmon to quantify reads against a transcriptome in fasta format. From there on you can summarize transcript count to the gene level with tximport.

ADD COMMENT • link 4.9 years ago by ATpoint 81k

0

Entering edit mode

Thank you, I think that’s what I’m going to end up doing. As stringtie seems to be causing more headache than anticipated.

ADD REPLY • link 4.9 years ago by sdbaney ▴ 10