Question

How to identify every read in a fastq file?

1

Entering edit mode

7.6 years ago

gerberd1990 ▴ 30

Hi, Everybody is looking for their target reads in a fastq file, and I am just sitting here and can not find a good program to identify the remaining (junk) reads. I am working on ancient DNA (currently horse) illumina reads, and I want to identify the exact organisms (possibly pathogens, human or other contamination, etc) of the remaining reads besides the horse sequences (approx 20-30% of the data contains horse DNA actually). So, can anyone recommend a good program for this task? Thanks in advance :)

genome next-gen blast • 2.8k views

ADD COMMENT • link 7.6 years ago by gerberd1990 ▴ 30

0

Entering edit mode

WOW, thanks for everybody, I see many valuable information here :)

ADD REPLY • link 7.6 years ago by gerberd1990 ▴ 30

score 3 · Answer 1 · 2016-09-14

3

Entering edit mode

7.6 years ago

Medhat 9.7k

First you need to align your reads to the expected reference that it may be contaminated with using FastQ Screen Or DeconSeq, then below post to remove it

http://seqanswers.com/forums/showpost.php?p=109308&postcount=6

ADD COMMENT • link 7.6 years ago by Medhat 9.7k

score 1 · Answer 2 · 2016-09-14

1

Entering edit mode

7.6 years ago

Simon Cockell 7.4k

This would seem like a good use case for something like Kraken: https://ccb.jhu.edu/software/kraken/

But your ability to assign every read in your experiment to an organism of origin will depend entirely on the completeness of your database.

ADD COMMENT • link 7.6 years ago by Simon Cockell 7.4k

score 1 · Answer 3 · 2016-09-14

1

Entering edit mode

7.6 years ago

WouterDeCoster 47k

This might be worth trying: https://github.com/smangul1/rop, see paper: http://biorxiv.org/content/early/2016/05/13/053041

ADD COMMENT • link 7.6 years ago by WouterDeCoster 47k

1

Entering edit mode

I think it is specialized in:

discover the source of all reads, which originate from complex RNA molecules, recombinant antibodies and microbial communities.

ADD REPLY • link 7.6 years ago by Medhat 9.7k

0

Entering edit mode

Yes, it is not meant for this application but might give some ideas.

ADD REPLY • link 7.6 years ago by WouterDeCoster 47k

score 0 · Answer 4 · 2016-09-14

0

Entering edit mode

7.6 years ago

igor 13k

There are some suggestions in these previous threads:

ADD COMMENT • link 7.6 years ago by igor 13k