Biostar Beta. Not for public use.
Question: gene IDs not recognized by DAVID
0
Entering edit mode

I have a gene list generated by RefSeq data downloaded from UCSC genome browser, and they have IDs, begining with NM or NP. (for example, NM_001032214) They are transcript_IDs.

And I'm gonna run GO term analysis by DAVID, by selecting identifier as RefSeq-mRNA, only 560/980 of my gene IDs are recognized. I don't understand why is this happened? What should I do to include all of my genes?

Thank you

ADD COMMENTlink 5.3 years ago catherine12243 • 120 • updated 5.3 years ago Denise - Open Targets ♦ 5.0k
Entering edit mode
0

In addition to Denises answer, a lot of the data might be outdated. You probably have modern gene names, while Davids are 5 years old. Hence, no mapping exists.

ADD REPLYlink 4.4 years ago
Endre Bakken Stovner
• 880
0
Entering edit mode

There might not be anything wrong going here. Without seeing some of your examples, I suspect that many of your NMs (or NPs) actually correspond to the same gene entity. The DMD gene in human for example has 30 spliced isoforms, and lots of NMs and NPs cross referenced to at least 8 of those isoforms. See the DMD example in Ensembl. You may also want to confirm in BioMart what DAVID is telling you. BioMart allows you to convert the NMs and NPs IDs into gene IDs (Ensembl IDs, HGNC, Entrez Gene IDs, etc) and get the GO terms for each of them.

ADD COMMENTlink 5.3 years ago Denise - Open Targets ♦ 5.0k

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.0