Remove symbolic alleles for REF in a VCF file.
0
0
Entering edit mode
5.1 years ago
pmorellmi • 0

Hi there!

I'm trying to do BQSR on some sheep genomes. I downloaded the known variants from Ensembl and converted the GVF file to VCF using their script. However, once I try to use it to ran GATK 3.5.0 BaseRecalibrator, it propt this message "Cannot tag a symbolic allele as the reference allele". I've looked for "<>" symbols, as they are the markers for symbolic alleles in VCF as far as I've read, but grep didn't found any outside of the header.

Any idea of what could be happening and how can I solve it?

Thank you for your help.

vcf GATK BaseRecalibrator NGS gvf • 1.9k views
ADD COMMENT
0
Entering edit mode

what is the exact command line ? what is the output of

grep -v "#" < your.vcf | cut -f 4 | grep -F '<' -m 10
ADD REPLY
0
Entering edit mode

The command for BaseRecalibrator is this:

 java -Xmx16g -jar $GATK_HOME/GenomeAnalysisTK.jar -T BaseRecalibrator \
  -R ref.fa -I in.bam -o cal.tbl  -knownSites var.vcf

And there was no output.

ADD REPLY
0
Entering edit mode

isn't there a stack trace in the error message ?

ADD REPLY
0
Entering edit mode

The whole error message is this:

##### ERROR ------------------------------------------------------------------------------------------
##### ERROR A GATK RUNTIME ERROR has occurred (version 3.5-0-g36282e4):
##### ERROR
##### ERROR This might be a bug. Please check the documentation guide to see if this is a known problem.
##### ERROR If not, please post the error message, with stack trace, to the GATK forum.
##### ERROR Visit our website and forum for extensive documentation and answers to 
##### ERROR commonly asked questions http://www.broadinstitute.org/gatk
##### ERROR
##### ERROR MESSAGE: Cannot tag a symbolic allele as the reference allele
ADD REPLY
0
Entering edit mode

try with gatk 3.8 ?

ADD REPLY
0
Entering edit mode

Same result, I'm affraid.

ADD REPLY
0
Entering edit mode

Let's check if there are lines with other than CATG in the REF column:

$ awk -v FS="\t" -v OFS="\t" '$4 !~ /[CATG]/' input.vcf
ADD REPLY
0
Entering edit mode

Nothing other than the header as output:

##fileformat=VCFv4.1
##fileDate=20190306
##reference=./Oar/Ovis_aries.Oar_v3.1.dna.toplevel.fa
##INFO=<ID=TSA,Number=1,Type=String,Description="A SO term describing the type of sequence_alteration (child term of SO sequence_alteration).">
##INFO=<ID=dbSNP_150,Number=0,Type=Flag,Description="dbSNP_150 membership">
#CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO
ADD REPLY

Login before adding your answer.

Traffic: 1933 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6