GRCh37 database, population filter
2
0
Entering edit mode
8.3 years ago
gwas_maniac ▴ 20

Good morning to all! I am here with another question that has been bugging me. I am using the GRCh37 database for SNPs, and those SNPs are taken from 5 big population across the globe. My question is, when I browse the SNPs for example http://grch37.ensembl.org/Homo_sapiens/Transcript/Variation_Transcript/Table?db=core;g=ENSG00000131495;r=5:140018325-140027370;t=ENST00000252102, I can see ALL the SNPs from the 5 populations but I want to know if there is a filter so I can take the SNPs from a particular population and not all of them. Do you happen to know anything?

population-genetics database Ensembl grch37 snps • 2.1k views
ADD COMMENT
2
Entering edit mode
8.3 years ago
trausch ★ 1.9k

If you feel comfortable using VCF files then my approach would be to download the 1000 Genomes GRCh37 variants

ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/

For every variant, INFO:AF is the global allele frequency and INFO:EAS_AF, EUR_AF, AFR_AF, AMR_AF and SAS_AF are the super-population allele frequencies.

ADD COMMENT
0
Entering edit mode

wow greatly appreciated!

ADD REPLY
2
Entering edit mode
8.3 years ago
Emily 23k

You can use BioMart. There's a help video here. Use the short variation database and filter by gene and variation set to only get specific populations.

ADD COMMENT
0
Entering edit mode

thank you so much!!

ADD REPLY
0
Entering edit mode

i have also noticed that biomart has two options for every population. eg: EUR and EUR-COMMON. what is the difference between the two?

ADD REPLY
0
Entering edit mode

Common means frequency > 1%.

ADD REPLY

Login before adding your answer.

Traffic: 2592 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6