Cutting Off The First Domain From All The Proteins In An Alignment
1
0
Entering edit mode
10.4 years ago
khorms ▴ 230

I have a multiple alignment of some proteins from the same family. All of them have 2 functional domains. Both of them are presented in Pfam database. There are no overlaps of these two domains in my alignment. I want cut the first domain off from all the proteins in the alignment. But I don't know how conservative is the boundary of first domain is. I don't want to watch each sequence individually and cut off the first domain by hand. Does there exist some way to do it automatically?

alignment • 2.3k views
ADD COMMENT
0
Entering edit mode

Few lines of python code will do the job.

ADD REPLY
0
Entering edit mode
10.4 years ago
Niek De Klein ★ 2.6k

Not an automatic way, but faster than doing them all individually:

If you use program Clustal X (and probably also the newer version Clustal Omega, but when I worked with sequence alignments this wasn't out yet so don't know if the GUI is the same), after aligning the sequences you can select the columns that contain the alignment of the first domain, and you can delete these columns (either right-click delete/remove or something similar, or there is an option in the upper menu). If you know that the alignment of the first domain is correct, you can remove the first domain from all sequences with one click.

This is probably also possible in other alignment programs, but I only know about Clustal X for sure.

ADD COMMENT

Login before adding your answer.

Traffic: 2773 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6