error: edid 317 / 317 site patterns

90 views
Skip to first unread message

Yihang Zhao

unread,
Feb 16, 2025, 4:11:23 PMFeb 16
to PAML discussion group
Dear sir,
After a relatively successful launch of paml I have got error message at the very last steps before the analysis finished

BEBing (dim = 6).  This may take many minutes.
Trying to get 44.0M memory in lfunNSsites_ACD

Calculating f(x_h|w) for 120 w's
          120 / 120 sets.  w for branches:  2.8500 2.8500 lnfX = -339757.5318
Time: 48:09
Calculating the marginal likelihood f(X).
        log(fX) = -124829.687134  S = -124592.510060  -230.840122
Calculating f(w_k|X), posterior for site classes for each site.  Slow!
        did 290 / 317 site patterns  48:16
error: edid 317 / 317 site patterns  48:17(paml)

Could you please tell me what this error is?
  paml version: CODONML in paml version 4.10.8, November 2024

Thank you in advance,
Yihang Zhao

Sandra AC

unread,
Feb 19, 2025, 7:12:06 AMFeb 19
to PAML discussion group
Dear Yihan Zhao,

Thanks for your message! Could you please share your input files and control file so that we can further troubleshoot this issue?

All the best,
Sandy

Yihang Zhao

unread,
Feb 19, 2025, 3:20:43 PMFeb 19
to PAML discussion group
Hi

Please see the attached files for my input files and control file

Best
Yihang

Equal_branch_length_corrected_Ga_supertree_467_taxa_DNA_clustalo_backtrans_RAxML_strict_conv.tre
Ga_supertree_467_taxa_DNA_clustalo_backtrans.phy
codeml.ctl

Yihang Zhao

unread,
Feb 25, 2025, 11:37:47 AMFeb 25
to PAML discussion group
Hi

I just want to follow-up on this question and see if there's any update on this issue. Also, I think I forgot to mention this but the error message mentioned in this post is found in the terminal when paml finish its run, not found in the output file. And the paml run did produce an output file that seems normal in all aspects. Hope this information help!

Best
Yihang

On Wednesday, February 19, 2025 at 7:12:06 AM UTC-5 sandr...@gmail.com wrote:

Sandra AC

unread,
Feb 28, 2025, 4:47:09 AMFeb 28
to PAML discussion group
Ηi Yihang,

Thanks for your patience and apologies for the delay, busy times! I have checked your input files and the input tree file does not have a header. You should add the PHYLIP header following the notation "num_taxa  num_trees". If you have 462 taxa, then the PHYLIP header in your input tree file should be "462  1".

The message "did X/X site patterns" lets you know about how many site patterns have been parsed when reading a specific locus in your input sequence file. If you have checked the output file and the run did indeed complete, perhaps the last messages were not printed on the screen (did you use your laptop or did you run the job on a HPC?). I have formatted your input alignment file so that each sequence is in one line (just in case the format has affected the run, not sure what may have exactly caused to prompt this error on the screen). The following are the commands I have used:

# Read your input alignment and generate a
# tmp FASTA file
count=0
while read -r line
do
count=$(( count + 1))
new_line=$(echo $line )
if [[ $line =~ "_" ]]
then
new_line=$( echo $new_line | sed 's/^/\>/' )
fi
if [[ $count -eq 1 ]]
then
echo $new_line > aln.fasta
else
echo $new_line >> aln.fasta
fi
done < Ga_supertree_467_taxa_DNA_clustalo_backtrans.phy
# Get rid of the PHYLIP header and the second
# blank line and carriage characters
sed -i 's/\r//g' aln.fasta
sed -i '1,2d' aln.fasta
# Give first permissions to all my perl scripts
# in case you have not done so
chmod 775 *pl
# Call my perl script to format the FASTA
# file so that there is one line per sequence
./one_line_fasta.pl aln.fasta
# Remove first FASTA file and rename new
# one-line FASTA file
rm aln.fasta
mv aln_one_line.fa aln.fasta
# Get the number of taxa and length of
# sequence and run my script to convert
# the FASTA file into PHYLIP format
aln_name=`ls *.fasta`
a_noext=$( echo $aln_name | sed 's/\.fasta//' )
num=$( grep '>' $aln_name | wc -l )
len=$( sed -n '2,2p' $aln_name | sed 's/\r//' | sed 's/\n//' | wc --m )
perl FASTAtoPHYL.pl $aln_name $num $len
# Remove files that are no longer required
# to save space
rm aln.fasta log_lenseq.txt

The attached compressed file has your input alignment file, my "one_line_fasta.pl" and "FASTAtoPHYL.pl" perl scripts that you see I use in the code snippet above, and the output files that result from running the code snippet above, which you will need to re-run CODEML: "aln.phy" (the formatted input sequence file) and "tree.tree" (the input tree file with the PHYLIP header).

Could you please try to re-run CODEML with the formatted input files "aln.phy" and "tree.tree" and see if you do not get this error again? Please let us know if formatting has fixed the issue!

All the best,
Sandy
test_reformatting.zip

Yihang Zhao

unread,
Feb 28, 2025, 4:36:12 PMFeb 28
to PAML discussion group
Yes, the issue is fixed! Thank you very much!

Best
Yihang
Reply all
Reply to author
Forward
0 new messages