Hi All,
May I have your help for me to understand dadi more?
1. Do I need to filter out snp data before running dadi? In detail, I always filter missing data, maf, hwe and linked sites.. et al to get the independent 'neutral sites'. However, in my mind, the maf and hwe filtering would have effects on rare allele, does it matter for dadi running? In all, which filtering for snp data do i need to preform before dadi running?
2. I usually use ANGSD to preform analysis but also I tried use GATK for genotype calling and plink to filtering. But I have 5~8 x coverage for my samples so that ANGSD would be more precise. Thus, Could I use ANGSD sfs estimation results directly to dadi? if so, anything else I need to take care?
3. Actually, my samples (>200 inds) from different locations should be one population according to Admixture and faststructure analysis, also, I found the Fst among different location are pretty small (the largest one =0.008). Thus, I would like to split the one population to few 'subpopulation' or 'cluster' according to PCA or Admixture? Could dadi precise estimate the 'subpopulation' demographic history ?
Thanks a lot
Best,
Yafei