Binary mutation data format
1
0
Entering edit mode
5.0 years ago

Hi I want to run a package named DawnRank in R. This package has some data requirements and file format. One of these requirements is a mutation file that the rows represent the genes and the columns represent the sample IDs and the mutation for each gene in each samples are binary (0,1) (you could see an example of this file here). I am not very familiar with the TCGA file formats and I couldn't found such data format. Does anyone know how can I find or create this file format?

Thank you for your help.

Mutation data TCGA DawnRank R • 1.3k views
ADD COMMENT
1
Entering edit mode

You're going to need to custom-process TCGA data to get to this matrix. Linkedomics has this sort of matrices available for download (example: BRCA dataset), although the datasets were cleaned to remove genes that were mutated in fewer than 20% of samples (I think). You may want to email them to find out what sort of filters were used.

ADD REPLY
2
Entering edit mode
5.0 years ago
igor 13k

I think the matrix you are looking for can be found at Xena gene level non-silent mutations.

ADD COMMENT

Login before adding your answer.

Traffic: 2291 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6