Hello,
I've been running codeml on a large dataset and investigating possible positive selection along certain branches. I used the likelihood ratio test to compare my test model (model = 2, fix_omega=0, NSsites =2) with the null model (model = 2, fix_omega=1, NSsites=2). I used LR = 2*(lnL1-lnL2) then I calculated p-values with 1 d.f.
The results look ok apart from one branch that is under pos selection with high significance (p-value = 1.68E-12, lnL1= -179449.502514, lnL2 = -179474.413117, LR = 24.91) but has no sites identified by NEB or BEB analyses:
kappa (ts/tv) = 1.52404
dN/dS (w) for site classes (K=4)
site class 0 1 2a 2b
proportion 0.92529 0.07471 0.00000 0.00000
background w 0.06474 1.00000 0.06474 1.00000
foreground w 0.06474 1.00000 1.00000 1.00000
Bayes Empirical Bayes (BEB) analysis (Yang, Wong & Nielsen 2005. Mol. Biol. Evol. 22:1107-1118)
Positive sites for foreground lineages Prob(w>1):
There don't look to be convergence problems with this branch but I have tried starting omegas of <1 and >1 anyway. This doesn't change the BEB output. Is there a scenario where a foreground branch would be under positive selection but there are no individual sites that are significantly under PS? Or, is it more likely that this is a problem with my dataset?
Any help greatly appreciated, I am fairly new to PAML.
Thanks!
Milly