Calculating significant variance of gene expression across multiple samples
1
0
Entering edit mode
5.4 years ago
caggtaagtat ★ 1.9k

Hello,

For my experiments, I want to calculate if a gene is significantly variable expressed across 40 biological similar samples in R. Until know, I assigned two random groups of the size of 20 and followed the DSEQ2 protocol for differential expression. However, 99.9% of the genes were asigned with an adjusted p-value of 0.9987168, which does not allow further differenciation. Maybe just looking at the variance of the normalized gene expression could be an alternative solution. So my question is, how would you check for the most variable expressed genes across 40 samples and would you use the DESEQ2 package for that purpose?

RNA-Seq DGE DSEQ2 R • 3.1k views
ADD COMMENT
1
Entering edit mode
5.4 years ago
ATpoint 81k

You can start by transforming your raw counts with either vst or rlog, followed by using the rowVars function to select the most variable genes:

## be dds your DESeq2 object:
vsd <- vst(dds)

rv <- rowVars(assay(vsd))

## say you want the top 500 (ntop=500)
ntop <- 500
select <- order(rv, decreasing = TRUE)[seq_len(min(ntop, length(rv)))]

## get the DESeqTransform object with the top 500 most variable genes
vsd500 <- vsd[select,]

This is pretty much copied from getMethod("plotPCA", "DESeqTransform"), the PCA function of DESeq2 that by default performs the PCA with the 500 most variable genes.

ADD COMMENT
0
Entering edit mode

Thank you, I will try that!

ADD REPLY

Login before adding your answer.

Traffic: 2418 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6