Transcriptome reconstruction from both short reads and long sequences
3
1
Entering edit mode
9.0 years ago
Prakki Rama ★ 2.7k

Hi all, Could I please know if there is any tool to reconstruct transcriptome from reference genome using both short read and long sequences from pacific biosciences at one go simulataneously.

I need something like Scripture, but it is limited to short reads. I would prefer a tool which can deal with both short and long at the same time. Thanks in advance for your suggestions.

Mapping BAM SAM • 2.4k views
ADD COMMENT
2
Entering edit mode
9.0 years ago
mark.ziemann ★ 1.9k

How long are your reads?

You can use pre-aligned data (bam files) for scripture.

http://www.broadinstitute.org/software/scripture/Walkthrough_example

Use a dedicated long-read aligner to generate the alignment and then use Scripture to do the reconstruction. STAR might be a good option for alignment depending on error rates.

ADD COMMENT
0
Entering edit mode

Thank you. I want to use both Illumina as well as pacbio data. The example shows only Illumina reads mapped to genome. So, you say, as long as the data is in pre-aligned format (bam files) it should be ok? My long read data mean length is 6.3 kb.

ADD REPLY
0
Entering edit mode

Recent versions of STAR (i.e., 2.4.1c) are distributed with STARlong that is optimised for reads >200bp in length. There isn't anything in the manual about it, but here is a comment from the author about it. You may want to use standard STAR for Illumina reads and STARlong for the PacBio reads.

ADD REPLY
1
Entering edit mode
9.0 years ago
5heikki 11k

A de novo approach using idba_tran:

idba_tran -r $1 -l $2 -o $3 --num_threads 16 --mink 20 --maxk 100 --step 5
-r pe reads in interleaved fasta
-l long reads in fasta
-o output dir
ADD COMMENT
0
Entering edit mode

Have not tried this. Will have to check. Thank you.

ADD REPLY
1
Entering edit mode
9.0 years ago
CraigM ▴ 90

How about MIRA?

An EST assembler which can perform hybrid assemblies using platforms of different read lengths.

I do not have first hand experience of trying this tool yet but believe it can do what you are looking for.

http://sourceforge.net/p/mira-assembler/wiki/Home/

A list of software for PacBio assembly, including hybrid assemblies, can be found here https://github.com/PacificBiosciences/Bioinformatics-Training/wiki/Large-Genome-Assembly-with-PacBio-Long-Reads

ADD COMMENT
0
Entering edit mode

MIRA appears denovo assembler. I want to use genome as reference and run it. Will have check the tool. Thanks for the suggestion.

ADD REPLY

Login before adding your answer.

Traffic: 2168 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6