Hi Zhian,
I have a genomic database of thousands of samples across a species. Within the database are a handful of populations of individuals that are clonal cuttings of certain genotypes, some populations that are full siblings to each other and some populations that are half-siblings to each other. The rest (and majority) of the database are considered unrelated random individuals collected from across the species.
I am trying to identify the clonal threshold in the database to determine if two randomly chosen samples are compared to each other, I can reliably identify them as clonal.
The input data is a vcf with thousands of SNPs per individual, thousands of individuals.
I am stumped on the correct approach here based on how poppr functions. I think the approach should be turn the vcf into a genclone object, isolate the clonal populations only into its own genclone object from the rest of the database, and do mlgfilter and then cutoff predictor on the clonal populations to identify the clonal threshold?
Lastly, is removing the rest of the database when trying to identify a cutoff biasing results in some way?
Thank you for your time,
Kerin