How to remove adapters from DNA-seq illumina reads?
1
0
Entering edit mode
7.4 years ago
hed.robin ▴ 30

Hi all,

I have some metagenomics reads, I trim them with trimmomatic and remove adapters using some illumina adapters library I found on the internet.

Then I look at them in Fastqc and it says that "adapter content" is very bad and "kmer content" is very bad as well (which must mean that there are indeed lots of adapters), the question is -- how do I remove these adapters?

How does Fastqc know they are there? Is it using some adapter database? If so, I would like to use that database to remove them! Where is the Fullest Possible Illumina Adapters Library (because the one I'm using right now isn't full enough)? Or Fastqc is mistaking?

sequencing trimming • 4.1k views
ADD COMMENT
1
Entering edit mode

there is a nice tool called BBmap from which you can use bbduck.sh that contains what you need

and it would be nice if you can support us with the tool, command and parameters that would make the answer more clear

ADD REPLY
1
Entering edit mode

Program is actually called bbduk.sh. BBMap suite also includes a adapters.fa file (in "resources" directory in source) that includes common commercial adapter sequences so you don't need to search for dodgy sources on the net or type the sequences in by hand.

ADD REPLY
0
Entering edit mode

according to the author documentation he wrote

I'd like to introduce another member of the BBTools package, BBDuk.

which will take you to the BBmap web (maybe I am mistaken about it)

ADD REPLY
0
Entering edit mode

duk stands for decontamination using k-mers. You wrote bbduck.sh in the post above (which is a mistake I sometimes do). I was just correcting the program name. All BBTools are included in a single download.

ADD REPLY
0
Entering edit mode

you are totally right I did not see this C (my bad :) )

ADD REPLY
0
Entering edit mode

@hed.robin Yes. FastQC uses a database kept inside Configuration directory. Look for files: adapter_list.txt and contaminant_list.txt

ADD REPLY
0
Entering edit mode
7.4 years ago

using some illumina adapters library I found on the internet

And did you verify that those are the same ones that are used in the sequencing reaction that was performed to realize your data? I've been using trimmomatic a lot and never had problems, I guess that you might have specified the wrong adapters!

Check your sequencing information: which Illumina technology was used?

Then, search for it here: http://support.illumina.com/content/dam/illumina-support/documents/documentation/chemistry_documentation/experiment-design/illumina-adapter-sequences_1000000002694-01.pdf

You'll find the right sequences. ;)

ADD COMMENT

Login before adding your answer.

Traffic: 2695 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6