Hi Robbie,
I was running STITCH with different combinations of cores and RAM per core (256Gb total in all cases) on 1436 samples and 200,000 variants to find a good optimum for my use case in terms of duration as well as job wait time in the SLURM queue. Here is a summary table of the results. One this that was unexpected was the variation in the size of the vcf.gz output files (510-522Mb). Is this expected because some dosage values and genotypes calls are going to be different in different runs of STITCH? Thank you for your help.
Cores RAM/core (Gb) Hours vcf.gz size (Mb) Total RAM used (Gb)
1 256 5.74 522 66
2 128 3.91 515 100
4 64 2.57 521 119
8 32 1.90 521 122
16 16 1.42 517 142
32 8 1.30 516 236
64 4 NA NA 256
128 2 2.22 510 187