Dear poppr community,
I hope this finds you well.
I wish to set my MLG cutoff threshold using filter_stats, but I am having issues wrapping my head around my current output.
I have a VCF file with SNP data that I converted into a SNPclone object, and then ran the following commands in R:
thresholds <- filter_stats(gl, distance = bitwise.dist, plot = TRUE)
thresholds
My plot looks as follow:
My questions are as follows:
- I am unsure if I am fully understanding how this plot is made and what it represents, as I would expect that as the genetic distance cut-off increases, the number of MLGs called decreases. However, there is an unexpected second peak in the histogram, close in size to the first one, at around a genetic distance of 0.025.
- From reading the online tutorial, I would interpret the grey histogram in the background to represent the total number of MLGs called at a given genetic distance cutoff. Is the histogram representing cumulative MLGs called across all three clustering methods, or a mean/median amount for all three clustering approaches?
Thank you in advance for your time and help. I really appreciate it.
Best regards,
Victoria