Biostar Beta. Not for public use.
PanCanAtlas EBPlusPlus-corrected RNA-seq TCGA dataset
0
Entering edit mode
16 months ago
Ld_60 • 30

Hi, I am wondering in which normalisation format (RPKM, FPKM, TPM,... etc) the PanCanAtlas EBPlusPlus-corrected RNA-seq TCGA dataset (the EBPlusPlusAdjustPANCAN_IlluminaHiSeq_RNASeqV2.geneExp.tsv file available here) is in? I know it is batch-corrected, but I don't know in which normalisation format the original data was in.

Thanks a lot for your help.

ADD COMMENTlink
0
Entering edit mode
16 months ago

Hi, I am making a deep learning based multicategory tumor classification project, for which I have also downloaded the same file as my dataset, I wanted to know where can I get the associated 33 Tumor types for each TCGA- case data. Any Help will be useful.

ADD COMMENTlink
0
Entering edit mode
14 months ago
David_emir • 340
India

TCGA Uses the Fragments Per Kilobase of transcript per Million mapped reads (FPKM) and FPKM Upper Quartile (FPKM-UQ) methods for Normalisation. Usually, they normalize for sequencing depth and gene length. FPKM takes into account that two reads can map to one fragment (and so it doesn’t count this fragment twice). TCGA has clearly mentioned about its normalization methodshere & I quote " normalized using the Fragments Per Kilobase of transcript per Million mapped reads (FPKM) and FPKM Upper Quartile (FPKM-UQ) methods with custom scripts."

ADD COMMENTlink
0
Entering edit mode
14 months ago
user31888 • 60
United States

The matrix 'EBPlusPlusAdjustPANCAN_IlluminaHiSeq_RNASeqV2.geneExp.tsv' was generated following the Firehose pipeline: MapSplice + RSEM, then normalised by setting the upper-quartile to 1,000.

Pipeline details here and here.

This was discussed in another thread here.

ADD COMMENTlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.1