Hi all,
I had run OpenMC using 2 computers, a local
computer with AMD Ryzen 9 3900X (12 cores, 24 threads) and a remote virtual
machine with Intel(R) Xeon(R) Silver 4114 (16 cores, 16 threads). The
implemention of MPI is OpenMPI. It took 50 seconds for a simple bundle assembly.
The command line I used is “mpirun -use-hwthread-cpus --map-by slot:PE=8 --hostfile /home/yuejin/Cloud/mpi_config_file openmc”.
In the host file, number of slots of local node and remote node are 3 and 2 respectively.
If I run OpenMC on local computer alone with 1 MPI process * 24 OpenMP threads, total time elapsed is 80 s. So the timings are resonable.
I’m new to MPI. I wonder if there is a better way to write my command line? Is there anything that I missed in arguments which can affect the performance? Maybe there is a way to set (1 MPI * 24 threads + 1 MPI * 16 threads) instead of (3+2)MPI*8 threads that I set? Which one is better?
Secondly, I didn’t find how to add MPI arguments in openmc.depletion.Integrator. Can anyone show me an example how to run depletion in parrellel using two computers?
Regards,
Yue.