Exclude sites on the basis of the proportion of missing data using BCFTOOLS
1
0
Entering edit mode
4.9 years ago

I want to do something very simple. I want to keep sites that do not have any missing genotypes (i.e. 100% present across all samples).

In vcftools I usually do --max-missing 1.0 as indicated in the manual, but I want to use BCFTOOLS as it is much quicker. However, I cannot find the equivalent flag to do this. I have only found F_MISSING, which I think removes individuals and not sites.

genome vcf vcftools bcftools • 3.7k views
ADD COMMENT
4
Entering edit mode
4.9 years ago
$ bcftools view -e 'GT[*] = "mis"' multi-sample.vcf

This would exclude sites where any genotype is missing.

ADD COMMENT

Login before adding your answer.

Traffic: 1801 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6