Markov Chain For Generating Random Protein Sequences
1
3
Entering edit mode
14.1 years ago
Perry ▴ 290

I have 1000+ protein sequences. I want to generate random sequences using a Markov model based on residue transitions found my sequences. I'm told Matlab will make a Markov chain based on multiple sequences, but I would like to use a free alternative to Matlab (python, ruby, R, etc). Can anyone provide me with a library or module?

sequence protein random • 5.3k views
ADD COMMENT
4
Entering edit mode
14.1 years ago

It seems Python programmers like writing Markov generators. I often see this topic pop up on Python blogs in the context of generating pseudo random text. A quick search shows a few hits:

I guess you would only need to change to tokenizer to split on letters rather than words.

ADD COMMENT
0
Entering edit mode

Thanks. The first link helped.

ADD REPLY
0
Entering edit mode

For 1st and 2nd order chains you can use Sean Eddy's Squid lib. It's written in C. But, you could add some randomizations as a control. In this case you could use uShuffle to preserve high-level orderings. Then you'll see why biology is very much local . . .

ADD REPLY
0
Entering edit mode

All links, but the last one, are dead.

ADD REPLY
0
Entering edit mode

pretty sad state of the internet - you can't trust anything to be around

ADD REPLY

Login before adding your answer.

Traffic: 2269 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6