Assigning RepeatMasker classes and families to Repbase repeats
0
1
Entering edit mode
7.1 years ago
Mike ▴ 60

I'm using mouse mm10. The RepeatMasker track on the UCSC Genome Browser categorizes each repeat with a name, class, and family (there are ~1500 different repeat names, 10 classes and ~50 families). RepBase uses an ID and one or more keywords. I know there's not a one to one relationship between the two databases (http://www.repeatmasker.org/faq.html#faq4) but I'm wondering if it's possible to at least assign a RepeatMasker class and family to each Repbase entry. Is there a tool or database already available that can do this?

Trying some myself, sometimes it's easy like when there's a perfect match between a RepeatMasker name and a RepBase ID. Sometimes the keywords for a RepBase record match one or more RepeatMasker classes or families so it's ambiguous. Sometimes the keywords are similar but not identical to a RepeatMasker class or family so while I could probably determine a pairing by manually looking it, automating it (with a script) is difficult.

repeatmasker repbase repeats • 2.2k views
ADD COMMENT

Login before adding your answer.

Traffic: 2156 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6