Dear Ryan and dadi users,
I started using dadi for my demographic analysis and everything worked great, but now i want to estimate my parameters uncertainties and i'm stuck...because to be honest, i don't understand at all the principle of Non-parametric bootstrap of my dataset from independant units of my data. Googling Non-parametric bootstrap didn't help so i hope some of you can answer my following questions:
1) I know the basic principles of a bootstrap : in my file with SNPs, resampling with replacement over my different SNPs, which each represent 1 unit. But i don't understand what you mean when you say ' instead of resampling from the SNPs, resample from regions you've sequenced'. Concretely how does this work? My SNPs are distributed over 9 different scaffolds, so i guess those would be my independant units. But it doesn't mean i sample only 1 SNP per scaffold right? otherwise i would have only 9 SNPs in each of my bootstraped dataset... If i sample several SNPs for each scaffold, then the results is the same as if i was sampling SNPs in the general file without taking in account the scaffold information in my opinion: i would have again SNPs that are potentially not independent because located on the same scaffold...
2) Must the bootstraped datasets have the same length (same number of SNPs) than the original dataset, or is it supposed to be a subset of those initial SNPs? Should the different boostraped datasets have the same total length between each other? Should they have the same number of SNPs from each of my scaffolds?
Thanks
Julie