Hi Natalie,
For most datasets, we would recommend the default 'marukilow' model, it
takes a Bayesian approach (incorporating information about allele
frequencies of the population at each site) and works quite well. The
'marukihigh' model can call more than two alleles per site, but does not
use a Bayesian approach. The 'snp' model is the model by Hohenlohe and
collaborators and it was the default model in Stacks v1. I recommend
Maruki + Lynch's paper to fully understand the new models:
Maruki, T., & Lynch, M. (2017). Genotype Calling from Population-Genomic
Sequencing Data. G3: Genes|Genomes|Genetics, 7(5), 1393–1404.
To determine the level of coverage in your dataset, just average the
coverage of each individual as reported by ustacks (or
denovo_map.pl
will create a table of all coverages for the dataset).
Best,
julian
Natalie B wrote:
> Hi,
>
> I’m quite new to Stacks, using v2.0 and I’m trying to understand the new
> --model option setting in gstacks.
>
> --model — model to use to call variants and genotypes; one of marukilow
> (default), marukihigh, or snp
>
> I think 'marukilow' would be used for datasets with low coverage,
> 'marukihigh' for high coverage, but I’m not sure what the ‘snp’ option
> would be used for?
>
> Also how can I determine the level of coverage in my dataset?I have 531