Define own alphabet and perform MultipleSequenceAlignment in biopython
1
0
Entering edit mode
8.6 years ago
osterland • 0

I want to do a MultipleSequenceAlignment in biopython but with a self defined Alphabet. The Background is: My sequences are sequences of numeric states and there are up to 5000 states. Thus I need an alphabet with 5000 letters, e.g. '0001', '0042', '4999'. Those sequences are up to 50 states/letters long.

So my main questions are:

  • How can I define such an Alphabet?
  • How can I use this Alphabet with the MultipleSequenceAlignment?

Alternatively: Is it possible to perform a MultipleSequenceAlignment on Lists/Arrays instead of Sequences?

Thanks for you Time & Help!

python biopython sequence-alignment • 2.2k views
ADD COMMENT
0
Entering edit mode

Have you had any success here? I am very much interested in your solution.

ADD REPLY
0
Entering edit mode
8.4 years ago
Peter 6.0k

While the Biopython Alphabet class system could probably mostly handle this, the Seq, SeqRecord and MultipleSequenceAlignment objects all focus on single-letter character representations. You might have some success with the array based MutableSeq object, but I think you would have to write some new subclasses if you really wanted to go down this route.

ADD COMMENT

Login before adding your answer.

Traffic: 2514 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6