Hello,
I have collection of a few hundred genomes in individual FASTA files; some of them are finished genomes, some are in draft form.
I would like to build a phylogenetic tree for these genomes using Phylophlan and the "-u" option.
As far as I understood, Phylophlan requires the genomes not as nucleotides but as peptides.
What is the suggested way to create this input when it is not readily available, i.e., when ".faa" files are not available, e.g., from NCBI?
So far, I have used prodigal (with the "-a" option) to output the translated CDS sequences into per-genome .faa files and put these into the input/ folder as suggested in the Phylophlan documentation.
Phyloplan performed the usearch run and generated .b6o files for every genome. So far, so good, apparently.
However, the execution of the program crashed immediately after this with the following message:
Traceback (most recent call last):
File "/PATH/TO/PHYLOPHLAN/phylophlan.py", line 777, in <module>
gens2prots(inps, projn)
File "/PATH/TO/PHYLOPHLAN/phylophlan.py", line 301, in gens2prots
assert vv not in prots2ups
AssertionError
And I have no idea what might be the issue.
Looking forward to your input.
Thank you very much in advance.
Best,
Cedric