Masking short sequences between gaps in genome assembly
0
0
Entering edit mode
5.7 years ago
pbigbig ▴ 250

Hi,

I have assembled a genome, using Illumina Pair-end reads for assembling, Mate pair reads for scaffolding. In resulted fasta files, I notice some patterns like this:

...NNNNNNNNNNNNNNNNGTGTGTAGGATCTCACNNNNNNNNNNNNNNNNNNNNNN...

I would like to hardmask those small "island" sequences between gaps with defined maximum length (e.g masking if < 200bp), could you please give some suggestion?

Thank you very much in advance!

genome assembly masking • 1.0k views
ADD COMMENT
0
Entering edit mode

When you say masking do you mean remove it ?

ADD REPLY
0
Entering edit mode

Hi, I mean to hard masking it, which would turn any A C G T to N

ADD REPLY

Login before adding your answer.

Traffic: 2949 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6