This is a beta test.
Question: Get bowtie2 to ignore isoform variants from Trinity assembly?
Entering edit mode

So I've run bowtie2 against my Trinity assembly and get a high proportion of paired end reads that align concordantly at more than one place. I am suspicious that they're aligning to multiple isoform variants all from the same "gene" group.

Fortunately, I've got a Trinity assembly that also includes a gene_trans_map that identifies which isoforms belong with which "gene" grouping.

How can I get bowtie2 to ignore isoform variants with a given "gene" group (defined by gene_trans_map) when it calculates the percent of reads that concordantly align exactly once or more than one time? I haven't been able to find anything like this in the help page but I would suspect this may be a common thing people want to do... I would prefer not to have to use the -a parameter given how much this slows things down.

ADD COMMENTlink 5 months ago CephBirk • 0 • updated 5 months ago h.mon 25k
Entering edit mode

Try collapsing the trascriptome assembly into supertranscripts, this should alleviate the multi-mapping problem. See the Trinity documention on SuperTranscripts.

ADD COMMENTlink 5 months ago h.mon 25k

Login before adding your answer.

Powered by the version 1.6