Some general tips:
- plink2 loads the psam and pvar files near the beginning of typical invocations. However, it may not need to load more than the pgen's header; see
the documentation on --read-freq/--error-on-freq-calc for a way to avoid the sometimes-expensive "Calculating allele frequencies..." step.
- From the --glm documentation:
"If you have multiple quantitative phenotypes with either no missing values, or missing values for the same samples, analyze them all in a single --glm run! PLINK 2.0's linear regression 'only' tends to be a few hundred times as fast as PLINK 1.9 when you analyze one quantitative phenotype at a time. But --glm also has a quantitative-phenotype-group optimization that can multiply the speedup by another factor of ~10."
- It doesn't come up much, but there is a
--loop-cats flag for rerunning the same analysis on a bunch of disjoint categories.