Hi Johannes,
Thank you very much for the detailed instructions on how to set up and run RAxML with a custom 3Di alphabet and model (charmap.txt + PROTGTR{…}+M{…}). I appreciate the clear steps for defining the character mapping and substitution matrix in PAML format.
I do have one follow-up question about the comparability of likelihoods: you noted that “likelihoods between different datasets cannot be compared,” even when both datasets use the same number of states (20), the same character labels, and the very same sequence alignment as a starting point (one translated into amino-acid states, the other into 3Di structural states). I understand that the likelihood is tied to the specific data type and its underlying distribution, but given that our 3Di alignment is generated directly from the AA alignment (so it has identical length, taxa, and state alphabet), could you elaborate a little on the statistical rationale for why the two likelihoods—and therefore BIC or AIC scores—remain fundamentally incomparable?
I’m interested in evaluating which models (AA vs. 3Di) yields more reliable phylogenetic estimates. Ideally one would benchmark against a known “true” tree, but simulated alignments are inherently biased by the simulation model itself. Do you have recommendations for robust model-comparison workflows under different alphabets that minimize such circularity?
Thanks again for your help!
Best regards,
David FerreiroHi David,
I might have been wrong on the comparability. We tried to get phylogenic inference working with a structural alphabet (a different one from 3Di) for the past six months, and we didn't compare likelihoods out of fear that changing the underlying data source (i.e. exchanging amino acid data with structural data) has an effect that we cannot accurately correct for with BIC/AIC. But this probably has been done before, when people compared Codon models with Amino Acid models, so maybe it works. I am not sure.
In any case, we came to the conclusion that the signal in structural data is worse. In our case we compared to reference taxonomies for various phyla, and were unable to match the performance of Amino Acid models, neither with the structural sequences, nor
with a combined approach of using both sequences at once. In our case the structural sequence had more states (36) than amino acid, so without BIC our likelihoods were incomparable. Thus, i don't know if that would have yielded different results.
However, a similar study to ours, if less in-depth, has been done with 3Di as well, so you may be able to compare to their methods: https://doi.org/10.1101/2024.08.02.606352
Best regards,
Johannes