[TCGAbiolinks] miRNA expression analysis error: unidentified columns
0
0
Entering edit mode
4.2 years ago

Hello,

I am new to using R and have wanted to use to analyze expression data from TCGA. Right now I am using TCGAbiolinks and I am following their documentation on miRNA expression analysis.

I have made a few changes to code outlined there to fit analyzing the L-AML dataset, which is as follows:

Getting L-AML data

require(TCGAbiolinks) LAMLdata <- "TCGA-LAML" DataDirectory <- paste0("LAML", gsub("-", "_", LAMLdata)) FileNameData <- paste0(DataDirectory, "_", "miRNA_gene_quantification" , ".rda")

query.miR <- GDCquery(project = LAMLdata, data.category = "Transcriptome Profiling", data.type = "miRNA Expression Quantification")

samplesDown.miR <- getResults(query.miR, cols=c("cases"))

dataSmTRBM.miR <- TCGAquery_SampleTypes(barcode = samplesDown.miR, typesample = "TRBM")

dataSmNBM.miR <- TCGAquery_SampleTypes(barcode = samplesDown.miR, typesample = "NBM")

dataSmTRBM_short.mir <- dataSmTRBM.miR[1:10]

dataSmNBM_short.mir <- dataSmNBM.miR[1:10]

queryDown.miR <- GDCquery(project = LAMLdata, data.category = "Transcriptome Profiling", data.type = "miRNA Expression Quantification", barcode = c(dataSmTRBM_short.mir, dataSmNBM.miR))

GDCdownload(query = queryDown.miR, directory = DataDirectory)

dataAssay.miR <- GDCprepare(query = queryDown.miR, save = TRUE, save.filename = FileNameData, summarizedExperiment = TRUE, directory = DataDirectory)

rownames(dataAssay.miR) <- dataAssay.miR$miRNA_ID

using read_count's data

read_countData <- colnames(dataAssay.miR)[grep("count", colnames(dataAssay.miR))] dataAssay.miR <- dataAssay.miR[,read_countData] colnames(dataAssay.miR) <- gsub("read_count_", "", colnames(dataAssay.miR))

dataFilt <- TCGAanalyze_Filtering(tabDF = dataAssay.miR, method = "quantile", qnt.cut = 0.25)

dataDEGs <- TCGAanalyze_DEA(mat1 = dataFilt[,dataSmNBM_short.mir], mat2 = dataFilt[,dataSmTRBM_short.mir], Cond1type = "Normal", Cond2type = "Tumor", fdr.cut = 0.01, logFC.cut = 1, method = "glmLRT")

When I source the code, I am ultimately met with this error:

Error in [.data.frame(dataFilt, , dataSmNBM_short.mir) : undefined columns selected

What I have noticed is that dataSmNBM.miR and dataSmTRBM.miR have a character (empty) value assigned to them, and thus I suspect this to be the first cause of my woes.

Can someone explain to me where things might have gone awry? I have gone over the code multiple times and have consulted documentation and everything still seems to be logical to me.

Thank you for your time and many thanks!

tcgabiolinks • 757 views
ADD COMMENT

Login before adding your answer.

Traffic: 2409 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6