Hi,
Of course, it is always good to find some research that would help to find the best modeling for your analyses. Nevertheless, nowadays, there are so many sources that it is easy to be misled sometimes. I have no idea what ModelFinder, IQTREE, and, as Shanshan says below, JMODEL do. I assume these are ML-based approaches. And they suggest the best model without extensive tree reconstruction. If so, and as I read in this group from more knowledgeable people, they can't suggest the best model for Bayesian analyses. That is why my approach is to learn the models from earlier studies. So, in the case of human mitochondrial DNA, I used HKY as it has been shown many times that the model fits mtDNA the best. Then, why use a more complex model?
Also, in this group, I read that GTR + I + G4 and Empirical can make a convergence problem. So, for Y chromosome analyses, I use GTR + G4. It has been shown many times that the model works well in the Y chr analyses. You may find the discussions about problems when using GTR + I + G4 and Empirical yourself. It is not an easy task :) Sorry.
By the way, you can use BModelTest, but it is only in Beast 2, to find the model for sites in the same analysis.
Best,
Hovhannes