Question

Good Multiple Sequence Alignment Tool For Ordered/Disordered Proteins

0

Entering edit mode

11.7 years ago

miquelduranfrigola ▴ 780

Hi all,

I'm dealing with a protein that has both ordered and disordered regions (most of it is disordered). I need to align this protein with very high accuracy (I don't care much about speed) with 40 of its orthologs, in a multiple sequence alignment. I want both ordered and disordered regions to be aligned properly, so I do not want to depend on substitution models. What would you recommend? MAFFT? MUSCLE?

Thanks!

As an aside, is some of you aware of a substitution model available for intrinsically disordered proteins? The only one I know of can be found here.

alignment • 2.8k views

ADD COMMENT • link updated 11.7 years ago by Whetting ★ 1.6k • written 11.7 years ago by miquelduranfrigola ▴ 780

score 1 · Answer 1 · 2012-08-09

1

Entering edit mode

11.7 years ago

Whetting ★ 1.6k

I am not sure what you mean by I do not want to depend on substitution models? What would you like to depend on? You could use any aligner you want (Mafft L-ins-i is good at this kinda stuff) open the alignment in some editor (I like seaview) and realign the regions that were not aligned well using another alghoritm

ADD COMMENT • link 11.7 years ago by Whetting ★ 1.6k

0

Entering edit mode

Hi Whetting,

by not depending on substitution models I mean not having to specify a predefined model and use, for instance, HMM. In this case (where I have both ordered and disordered regions), I would feel more confident than using BLOSUM matrices or similar. Is this notion right?

In such a case, do you think ProbCons is a good option?

ADD REPLY • link 11.7 years ago by miquelduranfrigola ▴ 780

1

Entering edit mode

It seems to me that you may want to try a couple algorithms. It is usually good practice to empirically determine which alignment method worked best. t-coffe, probcons,...all have good and bad tendencies

ADD REPLY • link 11.7 years ago by Whetting ★ 1.6k

3

Entering edit mode

Not to mention the fact that we don't REALLY have a great idea yet half of the time about what even constitutes good performance from an alignment algorithm when it comes to indels for instance. Whether the algorithms that produce more "gappy" alignments in loop regions are better or worse. People tend not to like it but the evidence suggests they are probably modelling biological reality better in lots of ways. I would guess this would be a big concern for the disordered regions.