databases of variants
2
0
Entering edit mode
9.0 years ago
Richard ▴ 590

Hi all,

I'm working on a project where we'll need to check if any SNVs have been previously annotated in other databases.

Additionally, I'll need to know which variants are in samples that are cancer or normal (or some other disease).

So far, I know that at least some of the variants in dbSNP have population information. I'll be looking into COSMIC, http://www.cbioportal.org/, and http://www.intogen.org/

Are there any other databases I should be looking at to be able to determine how many times (if any) a variant has been discovered in other tumours, normals, etc ?

Thanks,
Richard

API dbSNP SNV • 2.3k views
ADD COMMENT
5
Entering edit mode
9.0 years ago
dandan ▴ 370

In terms of population allele frequencies for germline variants, 1000 Genomes, ESP 6500, and ExAC will be probably more than you need. For somatic variants, COSMIC is probably the easiest, but TCGA and ICGC are also possibilities. If you want databases of clinically annotated variants, ClinVar is your best bet. If you have an exorbitant amount of money sitting around, HGMD Pro is pretty good. If you really want to get detailed but have a high tolerance for headaches, there are many small datasets with clinically annotated variants (but not with genomic coordinates - those will have to be parsed out somehow). These include the LSDBs, Emory's EmVClass, Clinvitae, and many many more.

If you'd like to access any of these programmatically, you should check out SolveBio. We've imported most of these databases in into our Data Library.

Hope that helps

ADD COMMENT
1
Entering edit mode
9.0 years ago
Vivek ★ 2.7k

1000 Genomes, ESP & ExAC would be pretty comprehensive for population allele frequencies. I haven't come across anything other than COSMIC for cancer related variants.

ADD COMMENT

Login before adding your answer.

Traffic: 1826 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6