FastX Collapser and duplicate removal
0
0
Entering edit mode
6.9 years ago
seta ★ 1.9k

Hi all,

I don't work with FastX collapse, but I read that it combines identical reads to a single read and keeps count of the reads. However, I didn't understand its meaning from "keeps count of the read". Could you please a bit explain to me? if it meant the program show the count of reads in, for example, in the header of the single read or keeps in the memory or what else?

Thanks

ngs read duplicate Fastx collapser • 3.5k views
ADD COMMENT
0
Entering edit mode

I would suggest the you switch to clumpify.sh from BBMap suite. You can dedupe the data and it will keep a count of duplicate reads in the header of the deduplicated file (count=).

ADD REPLY
0
Entering edit mode

Thanks, genomax2 for your suggestion. I don't want to use FastX collapser now, just to understand what it does. Please kindly tell me if it also shows a count of reads in the header of the single (deduplicated) reads?

ADD REPLY
0
Entering edit mode

Should be easy enough to test. Sorry I have not used fastx-toolkit for a few years now.

ADD REPLY

Login before adding your answer.

Traffic: 1874 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6