Hello.
I'm traying to run a GO analysis with the R package "goseq" but I have problems finding the GO Terms mappings for Prunus Persica genes. I understand that goseq needs the name of the gene and the corresponding GO term (ej: GO:0000014 AT3G52115). In the GDR FTP repository there's a "Prunus_persica_v2.0.a1_genes2GO.txt" file, but when running in goseq it throws me this error:
pwf=nullp(diff_express, bias.data=lengths, plot.fit=TRUE) Error in "row.names<-.data.frame"("tmp", value = value) : duplicate 'row.names' are not allowed In addition: Warning message: non-unique values when setting 'row.names': ‘-’, ‘Prupe.1G015800’, ‘Prupe.1G023400’, ‘Prupe.1G036900’, ‘Prupe.1G051600’, ‘Prupe.1G065300’, ‘Prupe.1G073300’, ‘Prupe.1G081800’, ‘Prupe.1G083200’, ‘Prupe.1G088300’, ‘Prupe.1G095400’, ‘Prupe.1G103000’, ‘Prupe.1G108400’, ‘Prupe.1G120600’, ‘Prupe.1G120800’, ‘Prupe.1G124400’, ‘Prupe.1G131400’, ‘Prupe.1G163700’, ‘Prupe.1G166400’, ‘Prupe.1G166700’, ‘Prupe.1G168000’, ‘Prupe.1G174800’, ‘Prupe.1G185400’, ‘Prupe.1G188100’, ‘Prupe.1G189000’, ‘Prupe.1G193700’, ‘Prupe.1G206400’, ‘Prupe.1G217600’, ‘Prupe.1G218900’, ‘Prupe.1G229000’, ‘Prupe.1G234600’, ‘Prupe.1G236400’, ‘Prupe.1G240700’, ‘Prupe.1G243400’, ‘Prupe.1G244900’, ‘Prupe.1G276000’, ‘Prupe.1G280000’, ‘Prupe.1G285700’, ‘Prupe.1G289900’, ‘Prupe.1G314100’, ‘Prupe.1G332600’, ‘Prupe.1G334500’, ‘Prupe.1G342900’, ‘Prupe.1G350900’, � [... truncated]
I load the data by running this command: cat_map=read.table("./Prunus_persica/GO/Prunus_persica_v2.0.a1_genes2GO.txt", head=F, sep="\t")