Clarify interpretation of slash (/) and vertical pipe (|) in GATK's vcf output?
1
0
Entering edit mode
23 months ago
evoclive • 0

This post has some explanation about the use of slash (/) and vertical pipe (|) in GATK's vcf output.

Apparently, a slash (/) indicates that "we don’t know which chromosome they (alleles) are on". If this is the case, why does my VCF have many (virtually all) slashed (/) allele calls for loci with a denoted chromosome and position?

For example,

CM014970.1 4450225 . C T 31.37 . <VARIOUS INFO> 0/0:2,0:2:6:0,6,49 0/0:2,0:2:6:0,6,49 ./.:0,0:0:.:0,0,0

are the called alleles on chromosome CM014970.1 or is it unknown?

VCF GATK formatting • 564 views
ADD COMMENT
0
Entering edit mode
23 months ago

If this is the case, why does my VCF have many (virtually all) slashed (/) allele calls for loci with a denoted chromosome and position?

if you're using short reads, one can only phase variants if two variants are found one the same reads. So finding phased variants is more difficult with short reads than with long reads.

ADD COMMENT
0
Entering edit mode

I don't understand how that answers the point about not knowing "which chromosome they are on" when loci with slashes have stipulated chromosomal information, e.g.:

CM014970.1 4450225 . C T 31.37 . <VARIOUS INFO> 0/0:2,0:2:6:0,6,49 0/0:2,0:2:6:0,6,49 ./.:0,0:0:.:0,0,0

To clarify: are the called alleles on chromosome CM014970.1 or is it unknown?

ADD REPLY

Login before adding your answer.

Traffic: 1731 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6