Histogram of p values for all tests
2
1
Entering edit mode
5.5 years ago

Hi, I am trying to do differential-gene-expression analysis with DESeq2, for that i have done pseudo-alignment using Salmon , and with tximport, gene-level abundance is calculated. I am following "Analyzing RNA-seq data with DESeq2", tutorial. I got Histogram of p values for all tests.

enter image description here

Please, guide me to interpret this plot. Can I proceed with the data, (for heatmap generation), showing above plot.

Thanks in advance!

RNA-Seq DESeq2 • 4.2k views
ADD COMMENT
0
Entering edit mode

Correct what? Please specify.

ADD REPLY
1
Entering edit mode
5.5 years ago

Start reading this excellent article here : http://varianceexplained.org/statistics/interpreting-pvalue-histogram/

These two peaks at the right tail are strange. Did you use the nominal pvalue or the corrected one (you should use the nominal for the plot) ?

Otherwise the distribution seems to be anti conservative so it's good 😉

ADD COMMENT
0
Entering edit mode
5.5 years ago
tw617 ▴ 40

The two main patterns you should look for when interpreting p-value histograms are: a sharp spike near 0 ( like you have here) that drops off into a somewhat uniform distribution. This indicates that you have some significant differences. The other pattern would be a totally uniform (flat) distribution, this indicates no significant difference. Here it looks like you have the first and ideal pattern although those spikes near one are a bit weird. I think they are probably errors/genes with no or little reads that were not filtered out. Usually you should exclude genes with less than 10 total reads.

ADD COMMENT

Login before adding your answer.

Traffic: 3918 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6