13 months ago
Republic of Ireland
Your question could not have been asked at a better time, as the area to which you are alluding is more or less called integrated omics, i.e., piecing together the clues given by each technology and technique used to probe disease in order to make better sense of disease. If we just look at DNA, we won't solve disease; if we look at just gene expression, we'll neither solve it; if we just look at metabolomics, same story... However, by piecing everything together, we stand a really great chance of understanding disease mechanisms better.
That said, it's not an easy task and the way of integrating all of these various datatypes is by no means standardised. From my perspective, it will have to be a very carefully planned and methodical process. My logical way of approaching it would be to analyse each type of data separately, build separate predictive (or other) models, and to then try to make sense of the results in combination. I would love to comment further but I've just been completing my own study on a type of cancer where I have specifically been integrating multiple data-types, and the manuscript has already been submitted.
There are tentative examples of analysing data-types together, most notably eQTL analysis, which looks at gene expression in combination with genotyping. It would also be possible to, for example, see how mutations in certain genes (DNA-seq) affect the level of copy number alterations (NGS or copy number array) in a tumour (indicative of genomic instability), or to check how, for example, an intergenic mutation (DNA-seq) produces a novel transcription factor binding site (ChIP-seq) and drives expression of a nearby gene (RNA-seq).
Other than what I've mentioned here, the best way to learn about how we process cancer data is by reading published manuscripts. There have been many great TCGA publications. Also, if you are really new at this area, then I would love for you to read the following manuscript, which should open your eyes as to the power of merging all of these data-types together in order to better understand disease: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4720521/
I would also love for you to read this manuscript in order to open your eyes about the importance of 'thinking in 3D' in relation to DNA: https://www.ncbi.nlm.nih.gov/pubmed/23817049