A command line option for region in GenotypeGVCFs?
1
0
Entering edit mode
8.2 years ago

Is there a command line option for a single region (not a region file)?

Does anyone have suggestions for rapidly running GenotypeGVCF? The last step (merging) took over three weeks for ~ 150 human genomes.

Thanks!

GATK haplotypecaller • 2.2k views
ADD COMMENT
2
Entering edit mode
8.2 years ago
donfreed ★ 1.6k

From https://www.broadinstitute.org/gatk/guide/tooldocs/org_broadinstitute_gatk_engine_CommandLineGATK.php#--intervals:

--intervals / -L

One or more genomic intervals over which to operate
Use this option to perform the analysis over only part of the genome. This argument can be specified multiple times. You can use samtools-style intervals either explicitly on the command line (e.g. -L chr1 or -L chr1:100-200) or by loading in a file containing a list of intervals (e.g. -L myFile.intervals).

Engine parameters should work with every tool.

After genotyping the gVCFs over small intervals, you can use bcftools concat to produce a single VCF for all individuals over all regions.

ADD COMMENT

Login before adding your answer.

Traffic: 2308 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6