Sequences with molecular indexes processing
1
0
Entering edit mode
7.7 years ago

Hi, I have some sequencing files which include 8nt molecular indexes on both ends. So I remove them before mapping to the genome, and now I would like to deduplicate the reads with same molecular indexes. I found https://github.com/mbusby/AddUMIsToBam but not sure it is maintained and properly tested (and could not compile as, I think, some header files are missing) which theoretically would be good option when followed by a picard deduplicate step.

Would you have any experience or suggestion for this?

Thanks, Manu

NGS • 1.7k views
ADD COMMENT
0
Entering edit mode
7.7 years ago

Check out our UMI-tools package design to do UMI deduplication while accounting for sequencing errors in the UMI sequences. https://github.com/CGATOxford/UMI-tools

Pre-print available on Biorxiv: http://biorxiv.org/content/early/2016/05/10/051755

Hope this helps. Get in contact if you need anything more.

ADD COMMENT

Login before adding your answer.

Traffic: 2143 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6