Hi Alicia,
that looks like your alignment is not an alignment, at least according to that message. Did you check whether all sequences (references and queries) are aligned to each other, i.e., if each sequences involved has the same length?
From the exemplary sequence that you posted, it looks there are no gaps in this sequences. This hints that it is not aligned yet. If you haven't aligned your queries yet, you can use PaPaRa for this: https://sco.h-its.org/exelixis/web/software/papara/index.html
Cheers
Lucas
--
You received this message because you are subscribed to the Google Groups "raxml" group.
To unsubscribe from this group and stop receiving emails from it, send an email to raxml+un...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hi Alicia,
that sounds alarming, the results should not differ that much!
Unfortunately, Pierre, who is the developer of EPA-ng, is away for
a few days. When he returns, we will investigate this.
So, please be patient, we will get back to you once we know
what's going on with your data.
Best
Lucas
Hi Alicia,
to reiterate what I sent you in an accidental personal mail instead of to the list:
- the default for epa-ng is to do the heuristic, whereas raxml
doesn't (use --no-heur with epa-ng to do the same)
- still this is strange
I've had some time today to look at your data, and I can indeed replicate the strange results you show.
I noticed that the 119 reference sequences, which are also at the
beginning of the papara_alignment.Sanabria1000_90_reduced.fasta
file, differ from the ones you supply with
ssu_globalEuk_ALtrimAl.fasta. This should not be the case! The
query sequences need to be aligned against a fixed reference, out
of which the reference tree was inferred.
So one explanation would be that the tree you supplied may be based on the "wrong" alignment. I pryed apart papara_alignment.Sanabria1000_90_reduced.fasta into query and reference part, and fed them separately to epa-ng, and the results are more sane. However the tree log-likelihood is still weird (-70k compared to raxml's -35k), indicating that there is something wrong still.
As for the trimming: what matters most is that the queries are
aligned to the fixed reference, and that the number of sites for
the query and reference alignment are identical.
-- MSc Pierre Barbera Phone: +49 6221 533 258 Fax: +49 6221 533 298 E-Mail: pierre....@h-its.org HITS gGmbH Schloss-Wolfsbrunnenweg 35 D-69118 Heidelberg Amtsgericht Mannheim / HRB 337446 Managing Director: Dr. Gesa Schönberger Scientific Director: Prof. Dr. Michael Strube