Issues with input tree

78 views
Skip to first unread message

Avehi Singh

unread,
Dec 16, 2022, 1:16:28 PM12/16/22
to PAML discussion group
Hi all,

I've been having some issues with the input files for PAML site models (I have attached my codon file, tree and codeml.ctl files below).  I am using PAML 4.9 on Ubuntu 22. Running codeml on the tree and codon file produces this (clipped) error message:

Sites with gaps or missing data are removed.
28 ambiguity characters in seq. 46
10 sites are removed.  233 234 235 236 237 238 239 240 241 242
Sequences read..
Counting site patterns..  0:00

Counting codons..
    12768 bytes for distance
   403088 bytes for conP
    36344 bytes for fhK
  5000000 bytes for space

Model 0: one-ratio
Seq #51 (Obicornis_bicornis) is missing in the tree


The sequence above is, in fact, in the tree, so I can't figure out why PAML can't parse it. I've tried several troubleshooting measures over the last few days but so far, nothing has worked. This is what I've tried so far:

- ensuring that the sequence names in the tree and codon file were the same
- checking for line ending characters (CF and LRF) -- all files are in ASCII 
- removing branch lengths from the phylogeny
- checking sequence lengths of the codon file match (they are 1278 characters long)

Could anyone help me figure out what I'm doing wrong? 

Thank you so much in advance,
Avehi 

codeml.ctl
new.newick
IR_1.codon

Janet Young

unread,
Dec 16, 2022, 4:43:26 PM12/16/22
to PAML discussion group
it looks like Obicornis_bicornis is present twice in your alignment file - that is most likely the issue

Avehi Singh

unread,
Dec 19, 2022, 11:24:31 PM12/19/22
to PAML discussion group
Thank you so much! I changed the labels of the duplicate sequences and this worked!
Reply all
Reply to author
Forward
0 new messages