How to find out sample type information of '.rna_seq.augmented_star_gene_counts.tsv' files in TCGA?
1
0
Entering edit mode
3 months ago

I downloaded around 900 '.rna_seq.augmented_star_gene_counts.tsv' files from TCGA using GDC manifest tool. Now I want to know the sample type information (eg., primary tumor or normal ) of each of these files. I have looked into 'Biospecimen.json' and all possible .json files for getting the sample type information. 'Biospecimen.json' has all the information about portions and aliquots but associated RNA seq file names are not there. Also, any ID that can map these RNA seq files are not present.

I found a similar question in Biostars. It goes here : how to find sample type information from TCGA?

The solutions provided here did not work for me. Please help. TIA.

star_gene_counts TCGA • 568 views
ADD COMMENT
0
Entering edit mode

TCGA is not specific enough as a source. You can use TCGABiolinks to download clinical and other metadata based on cohort and platform.

ADD REPLY
0
Entering edit mode
3 months ago
Zhenyu Zhang ★ 1.2k

When you added your files, there is a row of buttons in the GDC cart that you can use to download additional metadata. "sample type" is directly downloadable in the sample_sheet tsv file buttons

Btw, this sample_type is very overloaded, and has been refactored. I think you'd better to look into tissue type, tumor descriptor, etc in the future.

ADD COMMENT
0
Entering edit mode

Thanks a lot for the help!

ADD REPLY

Login before adding your answer.

Traffic: 1844 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6