Dear Xavi (CC Lars Jermiin, in case you want to add something),
You are right that LG+G7 is the closest match to LG+R7, though they are still quite different.
Moreover, since you have such long sequences (20K aa), I suggest that you look at the composition test result (in the log file) and if many sequences failed the test, you would try the C10-C60 models as well (they are the ML version for PhyloBayes’ CAT model). You can use LG+C10+R7 model, for example, and look if such model provides better fit to the data. This is because -m TESTNEW by default does not include complex models due to computational reason. In fact you can instruct IQ-TREE to consider these models by e.g.:
iqtree … -m TESTNEW -madd LG+C10+R7,LG+C20+R7
And finally if the C10 model series takes too much time and memory during subsequent analysis, you can use the recently proposed site frequency model (see
http://www.iqtree.org/doc/Complex-Models/#site-specific-frequency-models)
Cheers, Minh
> --
> You received this message because you are subscribed to the Google Groups "IQ-TREE" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
iqtree+un...@googlegroups.com.
> To post to this group, send email to
iqt...@googlegroups.com.
> Visit this group at
https://groups.google.com/group/iqtree.
> For more options, visit
https://groups.google.com/d/optout.
--
Bui Quang Minh
Center for Integrative Bioinformatics Vienna (CIBIV)
Campus Vienna Biocenter 5, VBC5, Ebene 1
A-1030 Vienna, Austria
Phone: ++43 1 4277 74326
Email: minh.bui (AT)
univie.ac.at