Get bowtie2 to ignore isoform variants from Trinity assembly?
1
0
Entering edit mode
4.8 years ago
CephBirk ▴ 20

So I've run bowtie2 against my Trinity assembly and get a high proportion of paired end reads that align concordantly at more than one place. I am suspicious that they're aligning to multiple isoform variants all from the same "gene" group.

Fortunately, I've got a Trinity assembly that also includes a gene_trans_map that identifies which isoforms belong with which "gene" grouping.

How can I get bowtie2 to ignore isoform variants with a given "gene" group (defined by gene_trans_map) when it calculates the percent of reads that concordantly align exactly once or more than one time? I haven't been able to find anything like this in the help page but I would suspect this may be a common thing people want to do... I would prefer not to have to use the -a parameter given how much this slows things down.

RNA-Seq • 1.0k views
ADD COMMENT
2
Entering edit mode
4.8 years ago
h.mon 35k

Try collapsing the trascriptome assembly into supertranscripts, this should alleviate the multi-mapping problem. See the Trinity documention on SuperTranscripts.

ADD COMMENT
0
Entering edit mode

Thanks! This is just what I was looking for and does shift about 20% of my reads from the "aligned concordantly >1 times" category to the "exactly one time" category.

ADD REPLY

Login before adding your answer.

Traffic: 2412 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6