Standard options for mapping with STAR in context of RNA-seq data analysis regarding splicing
1
0
Entering edit mode
6.2 years ago
caggtaagtat ★ 1.9k

Hi,

During my analysis of RNA-seq data regarding alternative splicing and splicing pattern, I came across some standard options of the STAR algorithm, which I could not quiet follow.

--alignSJoverhangMin 8
(minimum overhang for unannotated junctions)
--alignSJDBoverhangMin 1
(minimum overhang for annotated junctions)

I have concerns using this standard options, which regulate the minimal overlap of a read over the exon junction. I don't understand why one nucleotide overlap is enough to map a read to an annotated exon junction. And why would you not generally use the same minimum overlaps for a junction to begin with?

So my question is, what would you use as an minimal overlap? Would you use the standard settings or for example 6nt for both junctions?

RNA-Seq STAR splicing options • 2.8k views
ADD COMMENT
2
Entering edit mode
6.2 years ago

1 base is enough for an annotated junction because the it's known a priori to be possible. The threshold is higher for novel junctions, simply because they're novel and smaller values will tend to lead to more spurious findings.

As a general rule, always start with the default settings (except for the number of threads).

ADD COMMENT
0
Entering edit mode

Ok I will continue using the default settings than.

However, I still can't wrap my head around why lowering the treshold for annotated junctions does not lead to missmapped reads at this position? How is this possible? Or does STAR prevent missmapping at positions of annotated junctions at another level?

ADD REPLY
0
Entering edit mode

I'm not sure why you'd think such mappings would be wrong, they're incredibly likely to be correct for the simple reason that the junction is annotated.

ADD REPLY
0
Entering edit mode

Would'nt the possibilty for wrong mapping be 3/4, when a read is mapped with just one nucleotide overlap?

EDIT: Thanks for editing btw, this forum is great! :)

ADD REPLY
0
Entering edit mode

You a priori expect reads spanning that junction, so no the probability would be reasonably low.

ADD REPLY

Login before adding your answer.

Traffic: 2052 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6