VCFTools not filtering out biallelic sites
1
0
Entering edit mode
9.0 years ago

I have some VCF files I'm trying to run through Haploview to calculate pairwise LD scores (I also need LOD scores, which is why I'm not simply using VCFTools for this), however I get the error "More than two alleles at marker XX" for a number of the files, as Haploview only works for biallelic markers.

I've tried to filter them out using:

./vcftools --gzvcf tmp/vcf/EURchr3.vcf.gz --min-alleles 2 --max-alleles 2

However although it appears to run successfully, I still get the error with Haploview.

Also, I'd like to keep the markers rather than simply filtering them out if possible...

Genotype VCFTools SNP • 7.9k views
ADD COMMENT
2
Entering edit mode
9.0 years ago
./vcftools --gzvcf tmp/vcf/EURchr3.vcf.gz --min-alleles 2 --max-alleles 2 --recode --stdout | ./bgzip -c > tmp.vcf/filtered_EURchr3.vcf.gz

seems to do the job!

ADD COMMENT

Login before adding your answer.

Traffic: 2721 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6