Exome sequencing and linkage analysis : obtain a list of SNPs
2
0
Entering edit mode
8.1 years ago
wester1086 ▴ 10

Hello everyone, this is my first post on biostars.

I'm currently working on linkage analysis with exome data.

I'm trying to get a list of reference SNPs which are within target capture intervals.

So I have a bed file with all intervals targeted by WES (Agilent Sureselect V5 with UTRs included) , and I would like to get a list of 1000 Genomes SNPs (rs#) which are located in these intervals.

My first idea was to use biomaRt with ENSEMBL SNP database, but it doesn't seem to work as the number of SNPs located in capture regions is too large (I get an error: "Error during query execution: Got a packet bigger than 'max_allowed_packet' bytes")

Do you know any alternative method or solution to the biomaRt problem? Any help would be appreciated.

Thank you for your time.

SNP Exome Linkage biomart • 1.9k views
ADD COMMENT
1
Entering edit mode
8.1 years ago
Emily 23k

You could try getting the VCF files directly from 1000 Genomes, then running VCF tools vcf-query on them for your regions. You could probably script this in some way.

ADD COMMENT
0
Entering edit mode
8.1 years ago
wester1086 ▴ 10

Thank you for your answer!

I ended up using bedtools --intersect with VCF files from 1000 Genomes and my bed file.

ADD COMMENT

Login before adding your answer.

Traffic: 2453 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6