Trimming reads with prinseq
2
0
Entering edit mode
6.4 years ago
deepti1rao ▴ 50

What quality trimming options can be used with prinseq? I intend to use a min qual of 20 and would like to trim from the right and retian those reads that have a min length of 30. What window size is good? I want to de novo assemble a genome with these reads.

Forward reads fileReverse reads file

prinseq reads preprocessing qual trim • 3.3k views
ADD COMMENT
0
Entering edit mode

enter image description here

ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode

My reads are 150 bases long. Chan, do you suggest 6 as the sliding window? How does one decide?

ADD REPLY
0
Entering edit mode

I think 4~6 window size is good to try with.

ADD REPLY
0
Entering edit mode

Please use ADD COMMENT/ADD REPLY when responding to existing posts to keep threads logically organized.

This should have gone under @Chen's answer below.

ADD REPLY
2
Entering edit mode
6.4 years ago
GenoMax 141k

You can check the manual/in-line help for prinseq program to get this type of information.

I would recommend that you use bbduk.sh from BBMap suite. You can easily understand options you need. e.g. in your case trimq=20 minlen=30.

ADD COMMENT
2
Entering edit mode
6.4 years ago
chen ★ 2.5k

fastp is a good tool for your task. Run fastp with:

fastp -i in.fq -o out.fq -3 -W 6 -M 20 -l 30

-i in.fq specifies the input fastq file
-o out.fq specifies the output fastq file after filtering
-3 enables the sliding window cutting in 3'
-W 6 specifies 6 as the sliding window size for quality cutting
-M 20 specifies Q20 as the mean quality requirement within the sliding window
-l 30 indicates that reads shorter than 30bp will be discarded

ADD COMMENT

Login before adding your answer.

Traffic: 2859 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6