Dear Peter,
Thank you for explaining the rationale behind this change. I think this would be worth adding to the "Recent changes" page of the docs, for the sake of users who might not be monitoring this forum.
I might have further questions about the parallelism implementation in the future. I still need to reread the docs, but the interactions between different MPI/GA/disk vs. non-disk(?) implementations for all the different subprograms of Molpro feel like a bit of a web to me right now.
> The helper processes of course consume processor cycles. On modern hardware, and especially with the typical high data-movement characteristics of most quantum chemistry algorithms, it should usually be the case that processing cycles are not the resource type that limits execution speed.
I am curious, do you mean the data movement between MPI processes? So far, we have only used parallelism within a single host, so no networking required. Data movement ought to be extremely fast within a single host, unless something is misconfigured or buggy. So I am surprised that would be the rate-limiting step.
Or do you mean RAM bandwidth (or disk bandwidth) under data movement?
Thanks,
Tibor