I have downloaded a vcf file from 1000 genomes:
I want to convert this vcf file to ms-style output. I came across this python package scikit-allel that I can use to read a vcf file and convert that to haplotypeArray. I have read the vcf file using scikit-allel:
import allel
callset = allel.read_vcf('example.vcf')
Callset is a dictionary. I can use key 'calldata/GT' to get my genotype 3D array which is of shape (2548903, 2548, 2) where my first dimension are the variants, second dimension are the samples. I want to take this vcf file and convert this into Haplotype array. How can I do that using scikit-allel or is there any tool that could convert my vcf file to ms-style output? Insights will be appreciated.
Link to scikit-allel scikit-allel