Why is there 55,092 unique ensembl ENSG IDs?
1
0
Entering edit mode
2.1 years ago
4galaxy77 2.8k

I've annotated an imputed dataset (~40m variants) with CADD scores from the CADD database and the associated ENSG (e.g. ENSG00000283761) and ENST IDs.

There are 55,092 unique ENSG IDs in my dataset. Given that I thought there was one per gene and that humans contain ~20,000 genes (give or take a few thousand), this is quite a lot more than I expected.

Why is there this number of IDs and do they correspond to unique genes?

gene ensembl • 400 views
ADD COMMENT
5
Entering edit mode
2.1 years ago
GenoMax 142k

There are ~22K protein coding genes. This entire list contains all sorts of other things like "pseudogenes" etc.Here is a summary from BioMart.

list

Here are other types

Other types

ADD COMMENT

Login before adding your answer.

Traffic: 1884 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6