filtering repetitive regions with low quality/no snps in a vcf file
0
0
Entering edit mode
2.9 years ago
peter ▴ 20

I have a vcf file and I want to filter for repetitive regions with low quality or no SNPs in my vcf file. I have a repeat masker bed file for repeats but I don't know how to use that for repetitive regions. How can I eliminate sites masked by RepeatMasker? Insights will be appreciated.

snp vcf • 963 views
ADD COMMENT
0
Entering edit mode

Hi Peter,

Did you ever end up working this one out? I am trying to make a .bed file with the location of all repetitive regions of contig, so that I can mask these repeat regions for another analysis.

Cheers,

Tim

ADD REPLY
0
Entering edit mode

Hi. I ended up using bedops for example:

$ bedops --chrom chr14 -n 1 <(vcf2bed < variants.vcf) <(sort-bed repeats.bed) > answer.bed
$ bedops --chrom chr14 -n 1 <(vcf2bed < variants.vcf) <(rmsk2bed < repeats.rmsk) > answer.bed

Let me know if this helps or if you came across a better solution.

ADD REPLY

Login before adding your answer.

Traffic: 2710 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6