Hi Inbar,
Best,
julian
--
Stacks website: http://catchenlab.life.illinois.edu/stacks/
---
You received this message because you are subscribed to a topic in the Google Groups "Stacks" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/stacks-users/4EwlD0lwR38/unsubscribe.
To unsubscribe from this group and all its topics, send an email to stacks-users...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/stacks-users/BN7PR11MB25461FEFD24BF7B029FA68D9A7C89%40BN7PR11MB2546.namprd11.prod.outlook.com.
Hi Inbar,
The catalog is not affected by the filters you provide. The catalog always contains everything found in the dataset. Instead, the flag you specified is passed to populations and the populations-specific output will respect the filtering parameters you asked for. More generally, the core pipeline is designed to be run once (given an optimized set of de novo assembly paramters) and then populations is designed to be run multiple times, with different population maps and/or filters or export formats. I suggest you take a look at our protocol with explains a good bit of this strategy: https://link.springer.com/protocol/10.1007/978-1-0716-2313-8_7.
Best,
julian
From:
stacks...@googlegroups.com <stacks...@googlegroups.com> on behalf of Inbar Maayan <ima...@g.harvard.edu>
Date: Friday, March 3, 2023 at 9:04 AM
To: Stacks <stacks...@googlegroups.com>
Subject: Re: [stacks] Global minimum samples in ref_map.pl / Phasing in catalog.fa.gz?
Hi Julian,
I've tried running ref_map.pl with the -X "populations:--min-samples-overall 0.15" addition (I have 267 individuals in my dataset so I'm shooting for a minimum of 40, which is about 15%), but when I look at my catalog.fa.gz there are still many loci with NS<40: