How find the best model for calculate dN/dS with big data (727 sequences)?
0
0
Entering edit mode
3.1 years ago

Hi!

I´m working on my thesis project to calculate dN/dS with CODEML, my data have 727 sequences. I have 3 different proteins (727 sequences each one) of different viruses. I performed different analyses, for example, a parallel analysis with Pairwise Comparison (Yiang and Nielsen 2000), model 0 (M0) and site model.

Also, I had evaluated the likelihood of the sequences for a variety of ω values and the sensitivity of ω to the transition/transversion ratio (k). My principal goal now is to find the best model for my data, to calculate ω ratio. I know that with de Lnl I can make an LTR for then evaluate the best model for my data, but my problem is finding a unique Lnl for my 727 sequences which a specific model, for example, M7 (LnL = 0,84848....), other for M8 (LnL = ....).

My problem is that with all analyses that I made previously obtain an LnL for pairwise comparison for each sequence's data, this means 727 * 727 LnL differents.

I´m looking for something like this:

Model 0: one-ratio ... lnL(ntime: 23 np: 25): -1133.892429 +0.000000 ... kappa (ts/tv) = 2.27532 omega (dN/dS) = 0.91089 branch = 14..1 t = 0.024 N = 234.4 S = 38.6 dN/dS = 0.9109 dN = 0.0078 dS = 0.0086 NdN = 1.8 0.3 SdS = 0.3

Model 1: NearlyNeutral (2 categories) lnL(ntime: 23 np: 26): -1110.361397 +0.000000 ... kappa (ts/tv) = 2.28035 dN/dS (w) for site classes (K=2) p: 0.49440 0.50560 w: 0.08871 1.00000 ... branch = 14..1 t = 0.025 N = 234.4 S = 38.6 dN/dS = 0.9109 dN = 0.0078 dS = 0.0086 NdN = 1.8 SdS = 0.3

Model 2.... Model 7 ... Model 8...

In this moment I can´t continue with my thesis, because I can´t find the way for made the best model to represent ω.

Any idea or comments is welcome!

Best wishes!

gene PAML CODEML evolution sequence • 698 views
ADD COMMENT

Login before adding your answer.

Traffic: 1852 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6