Biostar Beta. Not for public use.
Find out the mutation type from TCGA data
0
Entering edit mode
18 months ago
Pin.Bioinf • 240
Malaga

Hello,

I downloaded mutation data from TCGA from here: http://firebrowse.org/ select SKCM >SNP6_copynum

and the data looks as follows:

    Sample  Chromosome  Start   End Num_Probes  Segment_Mean
TCGA-3N-A9WB-10A-01D-A38I-01    1   61735   534551  31  -0.3442
TCGA-3N-A9WB-10A-01D-A38I-01    1   564621  16345255    8476    0.0159
TCGA-3N-A9WB-10A-01D-A38I-01    1   16363940    16389566    8   0.8209

Is there a way I can find out which type of mutations these are? I tried looking at the SNP6 manifest and could not get that information either.

Thank you!!

ADD COMMENTlink
1
Entering edit mode

Isn't this copy number data?

ADD REPLYlink
0
Entering edit mode

is it? I don't know, i thoutght it was both SNP and CNV data, as it said: SNP6_copynum in the folder containing the files. File name: SKCM.snp__genome_wide_snp_6__broad_mit_edu__Level_3__segmented_scna_hg19__seg.seg.txt

ADD REPLYlink
2
Entering edit mode
11 months ago
Republic of Ireland

It is processed (by GISTIC 2.0, I believe) copy number data derived from Affymetrix SNP 6.0 microarray, which has probes that target both SNPs and copy number variants. It is a commonly used platform for determining genome-wide CNV profiles (I used it during my PhD in breast cancer).

Note that you may want to instead download the file with 'minus_germline_cnv' in its name, i.e., download it from Firebrowse.

If you want to process it, take a look here:

  1. Part I
  2. Part II
  3. Part III

Kevin

ADD COMMENTlink
0
Entering edit mode

Kevin, I have a question about this: I want to get SNP data for the melanoma project from TCGA, is there any way I can find it? Maybe the protected access files in TCGA have it? I can't seem to find that information.

ADD REPLYlink
1
Entering edit mode

You mean somatic mutations? They will be stored in the open access MAF files. If you want VCF files for all tumour and normals, then they are indeed protected and you will require authorisation.

ADD REPLYlink
0
Entering edit mode

Okay thank you, and if i want to request acces for the VCF files, I know I have to specify exactly the files I need: which ones should I ask access for? there are so many VCF files here

(What I want to do is relate these mutations to the clinical data from TCGA in melanoma, which I already have: I want to look into the mutations in those samples )

ADD REPLYlink
0
Entering edit mode

I see - sounds interesting. I know of at least 2 groups who are actively working in that area, so, you better hurry up! As I understand, a key part of the immune response revolves around knockout mutations in PD1 and PDL1. Can you not just use the MAF (Mutation Annotation Format) data and use that?

ADD REPLYlink
0
Entering edit mode

But I have read that processing MAF files is very complicated... I have never done it! Or can I open them in IGV? How can I match the mutation to the gene and the specific patient (sample) from a MAF file?

EDIT: I think I found a R package that allows to read the files (maftools). Thanks!

ADD REPLYlink
1
Entering edit mode

MAF files are not easy, I admit, but neither impossible. I believe that Cyriac Kandoth developed a MAF2VCF script, but I am not sure of its utility.

Are you prepared to wait possibly months to get dbGaP approval for accessing the GDC controlled access data? It involves a lot of admin work. I recently passed through it.

ADD REPLYlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.1