Dear Peter and other MIGRATE users,
I am attempting a "large-scale" SNP using MIGRATE-N.
The dataset consists of 3500 SNPS (RAD-SEQ) from 36 individuals grouped into 7 populations (population assignment inferred using STRUCTURE; these groups correspond to mtDNA haplotype groups that are between 2–5% divergent [ND2 gene]). There is no missing data at the population level.
Biological relevance: The populations are sampled from an island archipelago, 4 populations are found on large islands with recent connectivity (hypothesized during low sea levels during last glacial maximum), 3 populations are found on small “peripheral isolate” islands, these populations differ slightly in phenotype. I wish to estimate migration rates between all geographically close populations (the islands have a near-linear arrangement), to determine:
1. Do small island populations have small effective population sizes?
2. Are phenotypically divergent populations linked by gene flow?
3. If so, is gene flow to small islands unidirectional?
I am new to MIGRATE, so I've played around with settings and preliminary runs to familiarize myself with the program. I’ve used mostly default settings (BI, exponential priors), but I’ve used equal base frequencies and an empirical estimate of the ti/tv ratio (as previously suggested), and defined a migration matrix that includes only geographically relevant migration routes.
I used SNAPP to infer a population tree with the same dataset (attached, branch labels are thetas [scaled], all nodes have 1.0 posterior). The population tree is qualitatively similar to the mtDNA tree (reassuring). SNAPP estimated the thetas for each small island (populations 1, 2, and 5) an order of magnitude less that the large islands. Nucleotide diversity for populations 1, 2, and 5 are also an order of magnitude lower than other populations, so this all makes sense.
However, when I run MIGRATE (example attached, I know estimates of a couple Ms haven’t quite reached stationary), the estimates of theta for all populations (large and small islands) are similar (same order of magnitude) to one another. I know that directly comparing theta values from SNAPP and MIGRATE isn’t possible because the thetas in SNAPP are scaled differently, but I expected qualitatively more similar results.
I understand that running MIGRATE with SNP data is problematic. Do you have any ideas why estimates of theta from MIGRATE and SNAPP are so qualitatively different?
Many thanks,
-Pete
--
You received this message because you are subscribed to the Google Groups "migrate-support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to migrate-suppo...@googlegroups.com.
To post to this group, send email to migrate...@googlegroups.com.
Visit this group at http://groups.google.com/group/migrate-support.
For more options, visit https://groups.google.com/groups/opt_out.
<SNAPP_tree.jpg><outfile_SNP.pdf>
looking at the outfile it seems that your 'small' populations are smaller, but definitely not a magnitude smaller as SNAPP suggests,did you look at the 'allele frequency table'? You should see also a 10x difference in number of allele if snapp is correct,
in migrate version 3.5.4 here is a hidden option that you may want to try to see whether the snp model makes a difference,in the parmfile search for the option freqs-from-data
I also wonder whether many of your snps are coming from the same short read, if so then this will lead to an upwards bias (I am working a paper on this).