On 11/15/23 04:09, Abbas Ballout wrote:
>
> If I run mpirun -np2 the assemble times are 25.8 seconds and the MUMPS
> solve times are 51.7 seconds.
> If I run mpirun --cpu-set 0-1 -np 2, the assemble times are 26 seconds
> (unchanged) but the solve time are at 94.9 seconds!
> Is this normal and expected?
What happens if you use `--cpu-set 0,2` or other combinations?
Cores have memory channels that they often share with other cores. I
wonder whether, for example, cores 0 and 1 share a memory channel and
consequently step on each other's toes during the direct solve (which is
memory bound). Or they share SIMD execution units. It would be
interesting to see what happens if you choose other sets of cores to use.
Best
W.