Biostar Beta. Not for public use.
Perl script for discarding sequences less than 200 nucleotides before running CPC in rnaseq analysis
0
Entering edit mode
2.3 years ago
@yaminivadapally48917

Ii want the perl script to discard sequences less than 200 nts from fasta file to run CPC

RNA-Seq perl • 299 views
ADD COMMENTlink
0
Entering edit mode

what have you tried so far ?

ADD REPLYlink
5
Entering edit mode
2.3 years ago
karthic • 100
@karthic42122

I hope this below script works...just save and run with script name followed by fasta file and trim_length (integer)

#!/usr/bin/perl
use strict;
use warnings;

my $minlen = shift or die "Error: `minlen` parameter not provided\n";
{
    local $/=">";
    while(<>) {
        chomp;
        next unless /\w/;
        s/>$//gs;
        my @chunk = split /\n/;
        my $header = shift @chunk;
        my $seqlen = length join "", @chunk;
        print ">$_" if($seqlen >= $minlen);
    }
    local $/="\n";
}
ADD COMMENTlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.3