known unmappable or poorly sequenced regions in hg38
2
3
Entering edit mode
7.2 years ago
Paul ★ 1.5k

Dear all,

is there any similar file to this, which known unmappable or poorly sequenced regions for hg38? Or any way to download it from ucsc table browser?

Thank you for any link or comments.

dna-seq bed ngs • 3.5k views
ADD COMMENT
2
Entering edit mode
7.2 years ago
igor 13k

The blacklist files are available from ENCODE directly (there is a drop-down menu to select the proper genome build): https://www.encodeproject.org/annotations/ENCSR636HFF/

I don't know if using LiftOver for these regions is a good idea, since their coordinates are uncertain in the first place.

ADD COMMENT
0
Entering edit mode

Thank you Igor. Did you validate this regions? I can see, that for build GRCh38 is there only 38 regions. For Hg19 it is 411 regions. Also when I check this version of BED, there are 1649 regions.

ADD REPLY
0
Entering edit mode

There are actually two different BED files there. There are DAC and Duke lists. More info: Difficult to Sequence Regions for WGS

ADD REPLY
1
Entering edit mode
7.2 years ago

You can use a combination of blacklisted regions from hg37 with the UCSC liftOver utility (or maybe someone else can recommend a blacklist for hg38).

ADD COMMENT
0
Entering edit mode

Thank you so much. I was thinking to use "wgEncodeDukeMapabilityRegionsExcludable.bed.gz" from here and then use UCSC liftOver, but is this good procedure? Isnt those regions "out of date" for HG38?

ADD REPLY

Login before adding your answer.

Traffic: 2496 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6