Hi everyone,
I have a question about the GDAC firehose data from this website: (https://gdac.broadinstitute.org). I downloaded the RNASeq data: illuminahiseq_rnaseqv2-RSEM_genes and clinical data: Clinical_Pick_Tier1, and I found multiple columns in RNASeq corresponds to the same column clinical data. e.g. In BRCA:
In RNASeq data, there are columns:
TCGA-BH-A208-11A-51R-A157-07
TCGA-BH-A208-01A-11R-A157-07
However, in the clinical data, there is only one column:
TCGA-BH-A208
I have two questions:
1) Is the sample TCGA-BH-A208 a tumor sample or normal sample?
2) If I wish to match the clinical data to RNASeq data, since there is only one clinical column but two RNASeq columns, which RNASeq column should I use?
I'd appreciate it if anyone could help. Thanks.
Got it, thanks Kevin!