Are there any "complete" VCF files for tomato available?

0

Entering edit mode

9.7 years ago

gabriel.jabud ▴ 40

I'm following the GATK pipeline for variant calling of tomato whole genome data I am analyzing and am stuck on the base recalibration step (http://www.broadinstitute.org/gatk/guide/best-practices?bpm=DNAseq). It requires me to use a VCF file containing known variant sites. The only VCF file I've found so far is the one on NCBI's dbSNP database (ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/tomato_4081/VCF/). However the combined VCF file for all 12 chromosomes is only about 237 lines long including the header, so I doubt it will help to recalibrate with such a small amount of known variants.

Should I just go ahead and skip the recalibration step or is there a more complete VCF file containing variants of tomato?

GATK tomato vcf • 2.5k views

ADD COMMENT • link updated 2.3 years ago by Ram 43k • written 9.7 years ago by gabriel.jabud ▴ 40