Biostars beta testing.
Question: RPM data analysis with DESeq2
0
Entering edit mode

I have a table of RPM data for gene expressions of all human genes in two groups of patients. (each group includes several samples)

I am going to use DESeq2 to explore if there is any gene whose expression significantly changes between two groups.

DESeq2, however, seems like accepts only nu-normalized counts for the RNA seq. Is there a way to feed the DESeq2 with RPM data?

ADD COMMENTlink 10 months ago Pamira • 10 • updated 10 months ago manuel.belmadani • 830
Entering edit mode
0

Cross-posted on Bioconductor: https://support.bioconductor.org/p/120164/

ADD REPLYlink 10 months ago
Kevin Blighe
43k
1
Entering edit mode

I think DESeq2 expects counts, so passing it some normalized values (like TPM, RPKM, FPKM) is probably not optimal.

Do you have RPKM, or RPM (Reads-per-million)? If you have the latter, you can convert them to RPKM by dividing each RPM by the length of the gene in kilobases. If you want to get counts, you could multiply your RPM by (Number of reads in Sample / 1 000 000) and that should give you counts, which you should be able to use with DESeq. Make sure you know whether you have RPMs or RPKMs.

See also: https://support.bioconductor.org/p/56275/#56299, which suggests converting back to count preferably, or if you're stuck with RPKM then use limma instead of DESeq2.

Also, handy reference on RPKM, FPKM, TPM measures: https://www.rna-seqblog.com/rpkm-fpkm-and-tpm-clearly-explained/

ADD COMMENTlink 10 months ago manuel.belmadani • 830
Entering edit mode
1

Thank you so much for clarification! I got the count data. I will work on it. This is my first experience w/ DESeq though

ADD REPLYlink 10 months ago
Pamira
• 10

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.0