Is it correct to pool transcriptomes coming from different experiment for DE analysis?
1
0
Entering edit mode
5.7 years ago
keepclam ▴ 10

Hello everybody,

I downloaded from SRA all available transcriptome experiments from a nonmodel organism in order to perform comprehensive analyses. I want to analyze both sequences and differential expression. While the former seems feasible with different experiments, the latter concerns me a bit.

Is it methodologically safe to compare differential expression levels across different experiments? All of them are performed on Illumina machines, but with different technology levels (2000 / 2500 / 4000) and with different coverage.What steps do I have to perform in order to compare their transcription level? Can I pool them and do some sort of normalization?

If you'd have some reading to suggest me (papers that do an analogous analysis) it would be excellent as well.

RNA-Seq differential-expression • 839 views
ADD COMMENT
0
Entering edit mode

Q: Is it methodologically safe to compare differential expression levels across different experiments?

A: I did this exercise myself. In summary, I think it is possible, but it is usually non-trivial. The answer depends a lot on the kind of differential expression analysis that you are want to do. The level of differential expression of a gene usually depends on both the technical and experimental covariates. If the experimental covariates are actively driving the differential expression of a gene and the DE has huge effect size, it generally shows in both studies seamlessly, and sometimes the expression profiles from different studies cluster very naturally after a z-score. But often the case is that the expression profiles cluster by studies instead. And when that happens, there are Combat or some other batch correction tools that you can try using. If yeah, there are caveats associated with batch correction too......

ADD REPLY
2
Entering edit mode
5.7 years ago

Is it methodologically safe to compare differential expression levels across different experiments?

No. For transcriptomics, everything matters. The kit used for library prep, the kit used for RNA extraction, the technician performing the library prep, the temperature in the lab, probably what the technician had for lunch and the music in the lab.

You could do proceed with caution if you have something like the following setup:

  • Experiment A has 5 samples for condition 1 and 4 samples for condition 2
  • Experiment B has 3 samples for condition 1 and 5 samples for condition 2

Then you can have the "unknown experimental artefacts" as a batch effect between those experiments in your design formula.

If you have:

  • Experiment A has 5 samples for condition 1
  • Experiment B has 5 samples for condition 2

Then you can never compare condition 1 with condition 2 because it is confounded by the experimental artefacts.

ADD COMMENT

Login before adding your answer.

Traffic: 2575 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6