Detecting Mutation In A Gene From Rna Seq Data
3
1
Entering edit mode
12.1 years ago
Dataminer ★ 2.8k

Hi!

This might appear to be a very naive question to many of you.

I have RNA seq data of a cell say any myeloid cell type (kasumi). I want to see or know if the gene X in RNA seq data has a mutation. How can I do this, I have also uploaded the track for the same in UCSC browser and I can see SNPs from dbSNP130, for this gene X.

Is their any other approach?

Thank you for your time.

rna seq data analysis • 5.8k views
ADD COMMENT
3
Entering edit mode
12.1 years ago

Yes, you can align against known mRNAs, and, if you have a novel exon via alternate splicing, align against genomic DNA. This will work if your RNA-Seq data contains an allele of a SNP that is different from the reference genome - you'll see that difference and it may align with a known SNP.

On the other hand, the source of your RNA-Seq data may contain only the reference allele of the SNP, matching with the genomic or RefSeq mRNA perfectly. In this case, you'd like to align or compare to known SNPs to see what is present.

The third possibility is one where the source is heterozygous - two alleles at one position. Exactly how you detect this depends on how your RNA-Seq data were assembled into mRNAs. If the two alleles, say A and G, are each expressed at roughly the same levels, does the mRNA carry an A, G, R, or N at that position? What about the situation when you have allele-specific differences in expression of the gene, where the A level has 9 transcripts for every one with the G. Perhaps your assembler called an A, when in reality the situation is: G is present, was detected, but is less stable/preferred. If you are aligning the individual reads and see this allele difference, the assessment is much easier.

ADD COMMENT
1
Entering edit mode
12.1 years ago

Once you have the transcript align it (you can use BLAST) against the known sequences.

ADD COMMENT
0
Entering edit mode
12.1 years ago
Dataminer ★ 2.8k

Hi!

The answer to my own question is quite complicated:

I will show the easiest path,

First convert the ELAND export file to SAM format (say, x.sam)

load this x.sam file in IGV browser and apply IGV utility tools to sort and index the file before zooming in and out of the genome browser to see the mismatches with aligned genome.

Life is simple now.

phewwwwww

Whole weekend gone

ADD COMMENT
1
Entering edit mode

Sounds like you have figured it. However, since you are looking at RNAseq, I recommend you use a splice-aware tool like tophat/bowtie for alignment. This will allow your RNAseq reads which span exon junctions (gapped alignments) to also be viewed and should result in better total coverage. I don't think eland does a terribly good job aligning RNAseq data to the reference genome.

ADD REPLY
0
Entering edit mode

I was also looking for the mutations in RNA seq data. I used Tophat/bowtie and samtools for data. The .bam file include the variants (including both SNP's and mutants) How can we differentiate them?

Tophat gives great output files (.bed files of insertions, deletions and junctions) and it can be viewed in UCSC genome browser. Through these .bed files, is it possible to detect the mutations only?

ADD REPLY

Login before adding your answer.

Traffic: 2617 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6