Is there any robust way to convert VCFs to SQL (for easier comparisons) in R or Ruby, Java and Python?
2
0
Entering edit mode
7.7 years ago

Is there any modern robust way to convert VCFs into SQL (for easier comparisons) in R or Ruby, Java and Python?

vcf sql ngs wgs • 2.2k views
ADD COMMENT
1
Entering edit mode

What comparisons you would like to make that are not possible with existing tools ? Just curious.

ADD REPLY
0
Entering edit mode

Multiple and 4 producing readable, recordable and processable data, experience showed that GATK, NextGene and VAAST are not very good for VCF comparisons. Can you suggest any other software for VCF file comparison?

ADD REPLY
0
Entering edit mode

Many of the problems with VCF comparisons will not be solved by just putting the information into a database. The difficultly of VCF comparison is, at least in part, due to the ability to represent events in multiple ways. For example, a VCF containing a 1bp deletion and SNV 10bp upstream can be equivalent or not equivalent to a SNV in the deletion position and the deletion in the SNV position depending on whether the events are phased together and what the intervening sequence is. Even if you're just dealing with SNPs the problem doesn't go away as a SNV can have an upstream deletion with a compensating insertion downstream thus effectively rendering the nominal SNV coordinates incorrect. Ignoring indels, or even left-aligning them, isn't enough to resolve these issues in the general case.

Be aware that throwing VCF records into a database won't resolve all your issues.

ADD REPLY
0
Entering edit mode
7.7 years ago

VCFtools

vcflib

SnpSift

intersectBed Useful to quickly get overlapping stats.

ADD COMMENT
0
Entering edit mode

thx, and what tools can convert VCFs to SQLite, MySQL or NoSQL MondoDB or Casandra?

ADD REPLY
0
Entering edit mode
7.1 years ago
brian.repko ▴ 20

ADAM will let you read VCFs and save as parquet file. Spark SQL might be available on that.

ADD COMMENT

Login before adding your answer.

Traffic: 2015 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6