RNAseq more sequences than reads after mapping
0
0
Entering edit mode
3.6 years ago
avelarbio46 ▴ 30

Hello everyone! I'm doing QC analysis of my RNA-seq data so we can do the library prep for more samples

We got one sample that failed (5M reads) and one good to go (60M reads). We got this number of reads via FASTQC on the fastq file after trimming adapters and bad quality (<30) bases.

But when I map with galaxy, in STAR RNA (2pass filter hg38) and then align with htseq, we got 140 million sequences aligned, half duplicated and half unique, which is very good for transcript.

My question is if that this is normal, because I've never seen the number of sequences so much bigger than number of reads (i'm a newbie). 140Million is more than we expected!

Also, only less than 5% aligned to rRNA using CollectRNAmetrics, some aligned to intergenic regions (probably lncRNA) and some were small (microRNA)

RNA-Seq mapping reads • 574 views
ADD COMMENT
1
Entering edit mode

You have secondary (a read mapped to more than one location) and supplementary alignments in your file.

ADD REPLY

Login before adding your answer.

Traffic: 2038 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6