Total Samples In 1000 Genomes Vcf File
2
0
Entering edit mode
10.3 years ago
win ▴ 970

Hi all, I wanted to know if someone was aware of how many exact number of samples are in the 1000 genomes VCF file?

Thanks in advance.

vcf 1000genomes • 2.9k views
ADD COMMENT
2
Entering edit mode
10.3 years ago

There are 1092 samples in the chromosome 1-23 VCF files from phase 1. The Y and MT chromosome VCFs had one or two sample IDs not among the 1092, iirc.

ADD COMMENT
1
Entering edit mode
10.3 years ago
donfreed ★ 1.6k
grep '#CHROM' 1.1-50000.ALL.chr1.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.vcf | awk '{print NF; exit}'

prints 1101. 1101-9 (for the other fields in the VCF) = 1092

ADD COMMENT
1
Entering edit mode
bcftools view -h ALL.chr22.phase3_shapeit2_mvncall_integrated_v5.20130502.genotypes.vcf.gz | grep "^#CHROM" | cut -f10- | awk '{print NF}'

just as an update, latest 1000genomes September 2014 release contains 2504 samples.

ADD REPLY

Login before adding your answer.

Traffic: 2929 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6