Make transeq add identifier to each line
1
0
Entering edit mode
8.3 years ago

I recently installed transeq and used ti to create six protein sequences from a fasta file created via de novo assembly. The issue is that all the sequences in the file have the same identifier in from of them. Is there a command to have transeq copy the identifiers from the fasta file for each nucleotide sequences and add them only to the appropriate protein sequences?

protein emboss transeq • 1.7k views
ADD COMMENT
0
Entering edit mode

Provided a file in that same manner and am running EMBOSS 6.5.7 as I couldn't find 6.6.

ADD REPLY
0
Entering edit mode

Depending on your linux distribution, EMBOSS 6.6.0 may be the version on official repositories. Here is a link to EMBOSS 6.6.0 sources, from Debian.

ADD REPLY
0
Entering edit mode
8.3 years ago
Michael 54k

My transeq does that by default, which version do you use and how do you call it? I have EMBOSS:6.6.0.0

If you provide a file like this:

>id1
...
>id2
...
>id3
...

Output of transeq -frame=1:

>id1_1

>id2_1

>id3_1
ADD COMMENT

Login before adding your answer.

Traffic: 2628 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6