I want to use the Approximately Unbiased test to compare tree topologies. I want to compare the Maximum Likelihood tree to multiple other trees that have constraints on the monophyly of the taxa of interest.
I obtained four trees using likelihood methods, and used those as the input in IQ-TREE to run the AU test. I'm actually most interested in comparing trees #2 and #3 (alternative phylogenetic hypotheses) to tree #1 (the ML tree) shown below. Tree #4 is a tree in which I included monophyletic constraints that I knew would produce a tree that was much worse than the ML tree. So the fourth tree is expected to differ significantly.
I'm noticing a strange result with respect to the behavior of the AU test. I will share the results here, and if someone wishes to look at the input data files I'm happy to share them by email.
See belo509raxml_8loci_r1.phy.trees for trees with branch lengths.
Tree logL deltaL bp-RELL p-KH p-SH c-ELW p-AU
-------------------------------------------------------------------------
1 -42271.306 0.000 0.5485 + 0.5937 + 1.0000 + 0.5457 + 0.5115 +
2 -42272.563 1.257 0.4140 + 0.4063 + 0.7919 + 0.4154 + 0.4794 +
3 -42296.487 25.181 0.0375 - 0.0546 + 0.3303 + 0.0389 - 0.4705 +
4 -42736.708 465.401 0.0000 - 0.0000 - 0.0000 - 0.0000 - 1.0000 +
deltaL : logL difference from the maximal logl in the set.
bp-RELL : bootstrap proportion using RELL method (Kishino et al. 1990).
p-KH : p-value of one sided Kishino-Hasegawa test (1989).
p-SH : p-value of Shimodaira-Hasegawa test (2000).
c-ELW : Expected Likelihood Weight (Strimmer & Rambaut 2002).
p-AU : p-value of approximately unbiased (AU) test (Shimodaira, 2002).
Plus signs denote the 95% confidence sets.
Minus signs denote significant exclusion.
All tests performed 10000 resamplings using the RELL method.
So just to be sure that I understand this correctly, the AU test compares each of the user trees to the user tree that has the greatest logL. In other words, I am making three comparisons: T2-T1, T3-T1, and T4-T1. Is that correct?
Then I would interpret these results to say that T4 is significantly worse than T1 based on all tests except p-AU. For that test the p-value is 1.000. This seems strange to me. So I'm writing to ask whether this is an error in the way that I'm running the analysis, or interpreting the results, or whether there is a problem with the calculation of p-AU?
I have compared the results from IQ-TREE v1.6.6 on XSEDE to those obtained using the IQ-TREE web server v1.6.6. Using both platforms, I am experiencing the same problem. I tried to run the same dataset using v1.6.2 on XSEDE, however, it completes the tree inference analysis, and when it arrives at the user trees it gives me an error in the log file: "Tree 1ERROR: #leaves and taxa_set do not match!" (even though the same exact files do work in 1.6.6).
Thank you in advance for your help!