Posterior Distributions: M looks great, θ does not

53 views
Skip to first unread message

George zaragoza

unread,
Aug 16, 2023, 3:14:28 PM8/16/23
to migrate-support
I'm revisiting an old dataset with a small number of SNPs (25) but large number of individuals (>700). I am assuming 3 subpopulations. I am using the following parameters to estimate M and θ:

updatefreq=0.200000 0.200000 0.200000 0.200000 #tree, parameter haplotype, timeparam updates

bayes-posteriorbins=1500 1500 1500 1500 1500

bayes-posteriormaxtype=TOTAL

bayes-file=NO

bayes-allfile=NO

bayes-all-posteriors=NO

bayes-proposals= THETA METROPOLIS-HASTINGS Sampler

bayes-proposals= MIG METROPOLIS-HASTINGS Sampler

bayes-proposals= DIVERGENCE METROPOLIS-HASTINGS Sampler

bayes-proposals= DIVERGENCESTD METROPOLIS-HASTINGS Sampler

bayes-proposals= GROWTH METROPOLIS-HASTINGS Sampler

bayes-priors= THETA * * UNIFORMPRIOR: 0.000000 10.000000 1.000000

bayes-priors= MIG * * UNIFORMPRIOR: 0.000000 1000.000000 100.000000

bayes-priors= RATE * * UNIFORMPRIOR: 0.000000 10000000000.000000 1000000000.000000

While M has looked great and given nice normal posterior distributions, θ has been either susceptible to what seems like local optimums (many small humps near a peak) or has been at the maximum end of the posterior distribution. What I've resorted to now is slightly increasing my upper bound of 
θ priors with the hope that my run time does not increase substantially. I was hoping to hear feedback on whether I should stick to this approach or if there's something I may not be considering when setting my parameters.

George zaragoza

unread,
Aug 19, 2023, 7:02:10 PM8/19/23
to migrate-support
Was revisiting this and realized a few more details would be good:
  • I'm running migrate v5.0.3
  • I'm running Migrate on a computer cluster using a single node with 16 CPUs, each with 16,000 MB of RAM
    • I am not using the parallel version of migrate as suggested by my cluster admin as I am only using one node
  • My estimated time of completion for a job with a SNP HapMap with 25 alleles and (corrected from above) 449 animals (i.e., 898 individual alleles/gametes) 15 days (though my actual time of completion was 15 days)
    • My actual time of completion was 8 days
  • Migrate usually runs until about ~0.5 "done" as below (maybe something to due with convergence? Most of my data is usually OK by this point):
    • 20:34:20   Sampling: prognosed end of run is 18:38 August 22 2023 [0.545455 done]
Hopefully this helps. With 16 CPUs I was hoping such runs would be quicker but I may just need to be patient.
Reply all
Reply to author
Forward
0 new messages