--
You received this message because you are subscribed to the Google Groups "dadi-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dadi-user+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dadi-user/66312c61-08e7-4a7b-bfb6-c0f25354e2f8n%40googlegroups.com.
Hi Ryan,
Following our previous discussions, we performed demographic inference analyses for a set of species using ddRAD sequencing data. We fitted the following three-population demographic models:
split_nomig
split_symmig_adjacent
split_symmig_all
ancmig_adj_1
ancmig_adj_2
ancmig_adj_3
refugia_adj_1
refugia_adj_2
refugia_adj_3
For reference, I have attached the results for two species datasets: dadi-cli-sample-results
I had a few questions and would appreciate your input.
We are using the following equations to convert the population-scaled estimates from dadi into biological units:
Effective sequence length, L = Number of sequenced sites * (number of SNPs input in dadi / total SNPs detected)
N_ref = theta / (4*mut_rate*L)
Ne = nu*N_ref
M = m / (2*N_ref)
t = T*2*N_ref*gen_time
AIC = 2*Number_of_paramters - 2*Log(likelihood)
Are these fine? Would you recommend any modifications?
We were planning to use AIC as a metric for model selection. But as you can see from the table (Species A), the AIC values are very similar across several models, with no clear best-supported model. What would you suggest in such a case?
My concern is that the parameter estimates differ substantially among these competing models. For example, the estimated divergence times for three highlighted models are on the order of 1e5, 1e4, and 2e6 years, respectively.
You previously suggested examining residual plots to evaluate model fit. I have attached the residual plots of the same analyses here: dadi_sample_resedual_plots. Could you please advise on how best to assess model fit from these plots and whether any of the models appear clearly preferable based on the residual patterns, again taking into consideration the variability of estimated parameters?
Some models (highlighted in Species B) have not converged despite very long runtimes (~800 CPU hours). They seem to have less convincing residual plots, but their AICs are lower than those of the converged models. What would you recommend in such cases? Should we exclude them from further analyses of the results, wait for the models to converge, or some other suggestion?
Our sample sizes are relatively small, with approximately 5–10 individuals per population. Could the limited sample sizes be contributing to these issues?
Thanks and Regards,
Rayis.