Hi Zjon,
You are correct that detailed information per site is only recorded on sites found to be variable in the metapopulation. We do record all the site depths internally, but to record all the details of the model for every nucleotide would end up being a very large amount of data that has to be stored in the catalog. In principle, the depth could be output into the --vcf-all output, but what is your use case that would require this type of data?
Best,
julian
Hi Julian,
Thanks for getting back to me. We were wanting to apply further filtering on the resultant vcf file (including both a minimum and maximum depth cutoff, and genotype quality filtering) and then use that file for calculating nucleotide diversity as well as heterozygosity using all sites. Our concern is that if I filter the variable sites but not the linked invariant sites in each locus, we will get a biased estimate of nucleotide diversity. We also wanted to include invariant stacks for these calculations which currently have no depth information at all.
Many thanks,
Zjon