Biostar Beta. Not for public use.
Question: How to overcome "duplicate row.names" error while importing tpm/count/fpkm matrix/table in R ?
0
Entering edit mode

HI,

I used RSEM to produce TPM matrix for 100 RNA seq reads (row names= gene, columns are cell number) I get the following error, can anyone help me please!

> all <-read.table(file="tpm_matrix.xls",header=T)
Error in read.table(file = "tpm_matrix.xls", header = T) : 
  duplicate 'row.names' are not allowed

I changed the format of row names from "gene10000_Ermap" to "Ermap".

ADD COMMENTlink 23 months ago Jon • 90 • updated 23 months ago andrew.j.skelton73 5.7k
Entering edit mode
1

If your file is rather a text file but no MS xls, you can use cut -f 1 tpm_matrix.xls | sort | uniq -c | sort -k1,1nr to find the duplicated row names. If there are only a few, you'll curate that manually; if you have a lot, you can use e.g. awk to add a consecutive number to the name.

ADD REPLYlink 23 months ago
michael.ante
♦ 3.3k
Entering edit mode
0

This helped, Thank you so much

ADD REPLYlink 23 months ago
Jon
• 90
1
Entering edit mode
library(gdata)
data.in <- read.xls("tpm_matrix.xls")

read.table is trying to create a matrix, and as you've observed duplicate row names aren't allowed. You can get around this by importing it as a data frame.

Disclaimer: Untested, and probably a Tidyverse way to do it now instead of openxlsx...

ADD COMMENTlink 23 months ago andrew.j.skelton73 5.7k
Entering edit mode
0

Hi, Thanks but I get the following error!

data.in <- read.xlsx("tpm_matrix.xlsx")
Error in file(con, "r") : invalid 'description' argument
In addition: Warning message:
In unzip(xlsxFile, exdir = xmlDir) : error 1 in extracting from zip file
ADD REPLYlink 23 months ago
Jon
• 90
Entering edit mode
1

I've edited my OP - assumed it was an xlsx file, not xls... my bad.

ADD REPLYlink 23 months ago
andrew.j.skelton73
5.7k
Entering edit mode
0

tidyverse way is to use readxl. See here

ADD REPLYlink 23 months ago
russhh
♦ 4.4k

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.0