simulate rnaseq reads from ref. genome
1
1
Entering edit mode
6.4 years ago

Hi Biostars,

My question might be a duplicate, but what software would you recommend to simulate RNAseq reads (with different parameters like read length, PE/SE, error-rate, strandedness, etc), by using reference genome and annotations as input.

Google search and omicstools give many results, but I didn't find any soft so far fitting to my needs. ART looks good (but not specific to RNAseq), and I bet that there would be something in BBtools :)

Would appreciate any comments and suggestions. Thanks

RNA-Seq simulation bbtools • 2.0k views
ADD COMMENT
1
Entering edit mode
6.4 years ago
GenoMax 141k

randomreads.sh in BBMap suite will do what you want.

I assume you want to generate reads only from the transcriptome (i.e. provide a milti-fasta of transcripts. Don't think of any programs that can take ref genome+GTF file)? Otherwise there is nothing special about RNAseq as far as read generation would be concerned.

ADD COMMENT
1
Entering edit mode

Gffread can extract the transcripts for you.

ADD REPLY
0
Entering edit mode

Hi genomax, Thanks, and yes, basically I need reads from coding region to simulate RNAseq data. As far as I know, randomreads.sh generate reads from reference sequence, so in my case I'd have to extract transcriptome from genome and then feed it to randomreads.sh, is that correct?

ADD REPLY
0
Entering edit mode

Yes. If you have the GTF/GFF available then use @Botond's solution to get the reference transcriptome.

ADD REPLY
0
Entering edit mode

Great, thank you both!

ADD REPLY

Login before adding your answer.

Traffic: 1877 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6