Hi Ryan and dadi-user group,
Thank you for all your help with my other questions directly so far and having all the detailed threads to peruse.
I have completed an analysis of three 1D models (1 - 3 epochs) for a species I was interested in seeing about historical population change because they are an alpine specialist and I was expecting declines since LGM. For each model, I ran 100 iterations of the dadi_pipeline (100 optimization rounds per iteration, Portik et al. 2017) and checked for convergence of the results. Only the one epoch model appeared to converge to a global optimum (albeit very poor), but the two and three epoch models didn't necessarily converge on the same parameters with their best models, though their parameters followed similar patterns, i.e. the best two epoch models all decreased in pop size, and the three epoch increased and then decreased. While the two epoch model had the highest likelihood, the theta values were extremely high and the calculations of T and Nu were magnitudes higher than I expected. L was already estimated at ~30,000,000 so even if that was underestimated it wouldn't be producing the expected T and Nu. I did realize though this species is part of a species-complex that has historically high levels of gene flow so maybe that factor is influencing the very high levels of theta. One aspect of this data set that I'm not sure would affect dadi is that males and females have high genetic differentiation, e.g. PCA picks up on sex differentiation over the very weak population structure, so I also ran dadi on just the males.
I hope that explanation sets up my questions well. I have attached my results I had compiled in markdown (models, residuals, LL, thetas, parameters, L) and my specific questions are:
1) Is it reasonable that a large species complex with gene flow would produce extremely high theta values when sampling just one species that is pretty small (abundance estimates ~95,000, effective size via SNeP ~1000)?
2) Even if a model isn't converging on a best set of parameters, if numerous models after 100 optimizations follow the same patterns (e.g. Nu2 decreases from Nu1), is that reasonable enough support for the model but not the specific set of parameters?
Thanks so much, I really appreciate the help.
Matt