Hi Amber,
I have just had time to check your input files now, apologies for the delay. It seems that you are not using the correct format (please see
the PAML Wiki for details about required format). You may want to read our latest protocol (
Álvarez-Carretero et al., 2023) before you run your analysis to get familiar with the input files and the tests that you can run with CODEML. In addition, you will find all the details regarding input files, examples of tests that you can run, and results interpretation.
In short, your sequence file is in FASTA format and your tree file still has branch lengths. You need to format these input files so that your alignment file is in PHYLIP format and your tree file contains a PHYLIP header (<num_taxa> <num_trees, e.g., `5 1` in your case) followed by the tree in Newick format without branch lengths or other labels that are not labels to be used by CODEML (e.g., labels to identify foreground branches under the branch and branch-site model) -- please refer to the PAML Wiki or our protocol for further details. When checking your input files, I also incorporated only the relevant options in the control file to run a sites model in batch (M0, M1a, M2a, M7, and M8; you also had tried to run this in your contrl file) as you had other options that are not relevant (
please check our GitHub repository, which is supposed to be used alongside our protocol). You seem to have a STOP codon in seq #3 (Mac.mul_NP_001035330.1) as per the warning in CODEML, so you may want to check the sequences before you run CODEML -- please note that you should run CODEML only once you have finished to filter your data and you have a well curated alignment and tree files after all filtering processes.
Once you filter your dataset and make the required formatting changes, you shall be able to run CODEML. I have attached the modified files with which I have troubleshooted your issue in case you want to use them as templates :)
Hope this helps!
S.