Annotate hs37d5 genome calls using VEP
1
0
Entering edit mode
3.9 years ago
igor • 0

I have genome sequencing data in vcf format that I would like to annotate using VEP. The genomic variants were called against the hs37d5 reference genome, and I can't seem to find VEP cache files for this assembly. How can I annotate these variants using VEP?

vep annotation 1kgenomes hs37d5 • 1.8k views
ADD COMMENT
0
Entering edit mode

Check for hg19, I am 100% sure you will find it via that name.

ADD REPLY
0
Entering edit mode
ADD REPLY
1
Entering edit mode

It is the same coordinate system with additional decoy sequences, you are good to go with using the hg19 VEP annotations. These decoys are intended to catch false alignments in case that somatic cells in fact have viral integrates which is not uncommon. It is the same reference genome though.

ADD REPLY
0
Entering edit mode

That doesn't appear to be completely true either. For instance, when I open the BAM file in IGV using hg19 as the reference, the mitochondrial genome is completely misaligned. It is properly aligned when I use b37+decoy.

Edit: Chromosome Y and 3 are also different (https://gatk.broadinstitute.org/hc/en-us/articles/360035890711?id=23390#comparison)

ADD REPLY
0
Entering edit mode

I always thought they were (at least for the major chromosomes) fully identical, hmm... This is based on checksum, I do not know, cannot contribute any further, sorry.

ADD REPLY
0
Entering edit mode

Hi Igor, were you able to find a solution?

ADD REPLY
1
Entering edit mode
3.8 years ago
Ben_Ensembl ★ 2.4k

Hi igor,

The VEP can annotate variants using a custom cache defined in GFF/GTF files supplied in the VEP query. More information can be found on the following documentation page: https://www.ensembl.org/info/docs/tools/vep/script/vep_cache.html#gff

Best wishes

Ben Ensembl Helpdesk

ADD COMMENT

Login before adding your answer.

Traffic: 2806 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6