Question

Novel TSS in Cufflinks - false positives adjacent to reference

0

Entering edit mode

8.4 years ago

Chris Gene ▴ 80

For an RNA-Seq data set, I performed transcription start site usage analysis using cufflinks. The workflow was as follows: multiple replicates of each genotype (2) sequenced on an Illumina platform; the reads were aligned using STAR; performed assembly using cufflinks; assembled gtf files were merged using cuffmerge (with reference annotation included). Isoform quantification was performed using cuffquant.

I wanted to analyse differential TSS usage. However when I actually look at the differentially expressed TSS, I see that this includes several "novel" TSS, but many of these have a start which differs only 1 nucleotide from the reference. Is this just a mmapping issue, and what can I do to systematcally indentify and remove these that seem to be false positive novel TSS?

Thanks

RNA-Seq Cufflinks TSS alignment • 2.5k views

ADD COMMENT • link updated 8.4 years ago by Devon Ryan 104k • written 8.4 years ago by Chris Gene ▴ 80

score 0 · Answer 1 · 2015-11-30

0

Entering edit mode

8.4 years ago

Devon Ryan 104k

You'll just have to remove any TSS within some small distance of an annotated TSS of the same gene. This isn't a mapping issue, it's an issue with how cufflinks works. Perhaps you'll get better results with stringTie, but I wouldn't hold my breath that the situation is any better there.

ADD COMMENT • link 8.4 years ago by Devon Ryan 104k

0

Entering edit mode

Thanks Devon. Can you think of any automated way to do that? For the novel TSS groups that are adjacent to the gene start I could use a cut-off; but for other TSS more downstream that are also next to annotated TSS, I'm not sure how that would be feasible, even with the annotated files.

I'm also wondering with what confidence one can trust the differential expression analysis then - if reads are redistributed to these TSS that are not actually novel, then the value of the fpkm assigned to this TSS and the adjacent canonical TSS may be miscalculated. Would you agree with that rationalle?

ADD REPLY • link 8.4 years ago by Chris Gene ▴ 80

0

Entering edit mode

bedtools closest could be used to filter the novel TSSs. Regarding how reliable the results are for differential TSS usage I can't really say. To be honest, I've never personally used the differential TSS testing.

ADD REPLY • link 8.4 years ago by Devon Ryan 104k