Biostar Beta. Not for public use.
ReadVCF not reading samples
0
Entering edit mode
4.1 years ago
United Kingdom

Hi, I'm trying to read a VCF file I have using readVcf, however the following occurs:

> fl <- file.path('tmp', 'vcf', 'rs11780156.genotypes.vcf.gz')
> vcf <- readVcf(fl, 'hg19')
> scanVcfHeader(fl)
class: VCFHeader
samples(190): HG00096 HG00097 ... NA12889 NA12890
meta(2): META contig
fixed(2): FILTER ALT
info(27): CIEND CIPOS ... EX_TARGET MULTI_ALLELIC
geno(0):
> dim(vcf)
[1] 13161     0
> geno(vcf)
List of length 0
names(0): 

I've looked at the file itself, and it definitely has the sample genotypes in it, so I'm not sure why geno(vcf) is empty.

The file in question is at https://www.dropbox.com/s/qwa0xoaksnnyev4s/rs11780156.genotypes.vcf.gz?dl=0 however it seems to apply to any that I try and load via readVcf. These are just VCF I files I downloaded from 1000 Genomes using tabix.

ADD COMMENTlink
0
0
Entering edit mode

Please paste output of sessionInfo(), gonna guess that you are missing a parameter in the read

ADD REPLYlink
0
Entering edit mode

Hi Ying, it turned out that the issue was 1000 Genomes (the original source for the data) were missing the FORMAT lines in the header. I got in touch with them and they fixed it, so this is no longer an issue.

ADD REPLYlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.3.1