Dear All,
I am trying to use JTT model in codeml. It seems the file should be dat/jones.dat. However, when I jump to the file and other amino acid models (attached bellow), I found something I can not understand, I wish someone can catch me up.
For jones.dat, lines 1 - 24 seems showing the symmetrical part of the rate matrix and aa frequencies and the rate seems showing in absolute count number. However, when compare to wag.dat and lg.dat, the rate seems showing in relative decimal values. Therefore, my first question is are these rate matrix all have the same meaning? For wag.dat and lg.dat, the file itself clearly saying the matrix was not scaled and it will be scaled by paml, but there is no words about jones.dat, does this matrix can be scaled by using the same method? After scaling, can I use the scaled matrix to derive the Q matrix (I think it should be referred transition rate matrix) by using this formula Q_ij = s_ij*pi_j ?
For jones.dat, line 42 - 94 seems not used by paml, but I really want to know what are they and why one is an asymmetrical matrix but the other one is a symmetrical matrix. Does they both can be used in some situation to derive JTT model?
For jones.dat, line 97 - 158, it seems this part showing a transition probability matrix (since all rows add up to 1, it should be a matrix denoted as P, not a Q showed above, which requires all rows add up to 0 and the diagonal element should be a negative value after scaling) and stationary frequencies for 20 amino acids. Is this the right thing for this part want to show?
For dat/wag.dat and dat/lg.dat, both of them only contains the symmetrical part of the rate matrix and aa frequencies, how can I get a transition probability matrix as showed in jones.dat (line 97-159).
Thanks for your consideration.
FEI