EPA-NG taxa in tree not found in reference MSA

31 views
Skip to first unread message

Kenta Renard

unread,
Mar 13, 2024, 6:11:33 PM3/13/24
to Phylogenetic Placement
Hi,

I am confused as to why the error message "The reference Tree contained taxa that could not be found in the reference MSA" pops up when I try my EPA-NG run. The reference MSA I used is the same one that I used to generate the reference tree. I don't think it's an issue of reading the tree since it is saying the taxa in the tree can't be found in the MSA. Both MSAs are FASTA format and I checked that the taxa are there.

Could you please help me troubleshoot this?

Best wishes,
Kenta

Alexandros Stamatakis

unread,
Mar 14, 2024, 12:41:27 AM3/14/24
to phylogeneti...@googlegroups.com
Well maybe you did some post-processing or there is an issue with the
taxon names. Did you use RAxML to infer the tree or some other tool?

Are you getting a more concrete error message as to which taxon is missing?

It will be hard to help here without the actual data.

Alexis
> --
> You received this message because you are subscribed to the Google
> Groups "Phylogenetic Placement" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to phylogenetic-plac...@googlegroups.com
> <mailto:phylogenetic-plac...@googlegroups.com>.
> To view this discussion on the web, visit
> https://groups.google.com/d/msgid/phylogenetic-placement/0bafd54f-7bb8-410b-8dcd-2b72ae4a523bn%40googlegroups.com <https://groups.google.com/d/msgid/phylogenetic-placement/0bafd54f-7bb8-410b-8dcd-2b72ae4a523bn%40googlegroups.com?utm_medium=email&utm_source=footer>.

--
Alexandros (Alexis) Stamatakis

ERA Chair, Institute of Computer Science, Foundation for Research and
Technology - Hellas
Research Group Leader, Heidelberg Institute for Theoretical Studies
Full Professor, Dept. of Informatics, Karlsruhe Institute of Technology

www.biocomp.gr (Crete lab)
www.exelixis-lab.org (Heidelberg lab)

Kenta Renard

unread,
Mar 14, 2024, 7:03:09 AM3/14/24
to Phylogenetic Placement
Dear Alexandros,

Thank you for your reply. I used RAxML to infer the tree, yes. The error message says ALL the taxa found in the tree are not seen in the reference MSA. I have attatched both my query and reference MSAm, and the best model and tree file.

Best wishes,
Kenta
info4.raxml.bestTree
info4.raxml.bestModel
ref_seqs.fasta
query_seqs.fasta

Alexandros Stamatakis

unread,
Mar 15, 2024, 9:48:28 AM3/15/24
to phylogeneti...@googlegroups.com
Dear Kenta,

I assume the problem is the discrepancy between FASTA and tree file
names, e.g. you have in fasta:

WP_006909616.1 hypothetical protein [Cyanobium sp. PCC 7001]

and in thy Newick file only:

((WP_006909616.1:

so I guess the part "hypothetical protein [Cyanobium sp. PCC 7001]" in
the fasta file is the problem and I would guess that if you remove all
that meta-information it should work,

Alexis
> https://groups.google.com/d/msgid/phylogenetic-placement/0bafd54f-7bb8-410b-8dcd-2b72ae4a523bn%40googlegroups.com <https://groups.google.com/d/msgid/phylogenetic-placement/0bafd54f-7bb8-410b-8dcd-2b72ae4a523bn%40googlegroups.com> <https://groups.google.com/d/msgid/phylogenetic-placement/0bafd54f-7bb8-410b-8dcd-2b72ae4a523bn%40googlegroups.com?utm_medium=email&utm_source=footer <https://groups.google.com/d/msgid/phylogenetic-placement/0bafd54f-7bb8-410b-8dcd-2b72ae4a523bn%40googlegroups.com?utm_medium=email&utm_source=footer>>.
>
> --
> Alexandros (Alexis) Stamatakis
>
> ERA Chair, Institute of Computer Science, Foundation for Research and
> Technology - Hellas
> Research Group Leader, Heidelberg Institute for Theoretical Studies
> Full Professor, Dept. of Informatics, Karlsruhe Institute of Technology
>
> www.biocomp.gr <http://www.biocomp.gr> (Crete lab)
> www.exelixis-lab.org <http://www.exelixis-lab.org> (Heidelberg lab)
>
> --
> You received this message because you are subscribed to the Google
> Groups "Phylogenetic Placement" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to phylogenetic-plac...@googlegroups.com
> <mailto:phylogenetic-plac...@googlegroups.com>.
> To view this discussion on the web, visit
> https://groups.google.com/d/msgid/phylogenetic-placement/fe97ce8c-66fb-4043-802a-b959e0d989e5n%40googlegroups.com <https://groups.google.com/d/msgid/phylogenetic-placement/fe97ce8c-66fb-4043-802a-b959e0d989e5n%40googlegroups.com?utm_medium=email&utm_source=footer>.
Reply all
Reply to author
Forward
0 new messages