Where can I find the analysis ID to Firehose TCGA RNA-Seq samples
1
0
Entering edit mode
7.1 years ago
-_- ★ 1.1k

I have been able to download RNA-Seq expression levels from Firehose, and it looks like

python>  import fbget
python>  print fbget.mrnaseq("egfr", cohort="ucs")
tcga_participant_barcode    gene    expression_log2 z-score cohort  sample_type protocol    geneID
TCGA-QN-A5NN    EGFR    7.06162500905   -0.59899352506  UCS TP  RSEM    1956
TCGA-QM-A5NM    EGFR    8.16734387649   -0.298443593752 UCS TP  RSEM    1956
TCGA-NG-A4VW    EGFR    8.93092623547   0.0932667888031 UCS TP  RSEM    1956

However, I thought the sample identification is not specific enough because each (tcga_participant_barcode, cohort, sample_type) forms an ID for each sample. E.g. what if a participant have multiple EGFR TP samples? I learned from my metadata downloaded from CGHub previously that it's possible for the same patient to have multiple TP samples in the same cohort, and they are identified the a unique analysis ID that looks like 8b846dc9-2a7b-4145-aa7e-c9b5af8beeda. Where are the additional samples in Firebrowse?

RNA-Seq TCGA Firehose Firebrowse • 2.5k views
ADD COMMENT
0
Entering edit mode

I don't see the use of adding "urgent" to your post, you won't get a higher priority. Everyone here is a volunteer and will reply whenever that suits the best.

ADD REPLY
0
Entering edit mode

Thanks. I removed it.

ADD REPLY
0
Entering edit mode
7.1 years ago
-_- ★ 1.1k

I ended up bulk downloading all RSEM gene normalized data from firehose with firehose_get. It seems to work, and the download data contains longer barcode, specific enough for my usage.

/firehose_get  -o RSEM_genes_normalized__data.Level_3 stddata latest
ADD COMMENT
0
Entering edit mode

Still, the TCGA barcode is not specific enough to can correspond to multiple analysis IDs. :(

ADD REPLY

Login before adding your answer.

Traffic: 1901 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6