Hi,
I'm running some small organic molecule calculations on a desktop system.
CPU is intel core i9 14900k cpu, 8 performance cores (with hyperthreading) and 16 efficiency cores (no hyperthreading)
OS ubuntu 24.4lts,
NWchem version: nwchem-openmpi/noble,now 7.2.2-1build3 amd64 , installed from the standard Ubuntu repository with apt
I get strange results with different numbers of processors, and am curious whether this is expected. Typical molecules are a few tens of second row elements.
With strychnine
(C21H22N2O2),
specifically, using input as follows:
memory stack 1500 mb heap 1500 mb global 3000 mb
echo
start molecule
title "strichnine_nmr.nw"
echo
start
charge 0
<coordinates>
basis
* library 6-311g*
end
cosmo
do_cosmo_smd true
solvent chcl3
end
dft
xc mpw91 0.75 HFexch 0.25 perdew91
mult 1
end
task dft energy
property
shielding
end
task dft property
If I run with nwchem.openmpi <inputfile> , the timings are:
Total times cpu: 16333.7s wall: 6454.8s
(you can see during the calculation, that there is one process active but the cpu usage goes above 100% ie it's really using more than one core)
if I use /usr/bin/mpirun.openmpi -np 2 nwchem.openmpi <inputfile>
then there are two processes, again sometimes using >100%. Timings are:
Total times cpu: 4034.3s wall: 3866.3s
With -np 4:
Total times cpu: 6220.2s wall: 3602.7s
With -np 8
Total times cpu: 3407.0s wall: 2888.1s
So the time is substantially reduced with two processes but not so much after that.
However, with smaller molecules, more processes can take longer. With coumarin (C9H6O2) and the same level of theory as above, I get:
-np 1: Total times cpu: 3407.0s wall: 2888.1s
-np 2:Total times cpu: 231.0s wall: 205.2s
-np 4: Total times cpu: 1347.3s wall: 593.8s
-np 8: Total times cpu: 346.1s wall: 244.2s
So 2 processes is optimal and more may be substantially worse.
I get similar results with energy calculations.
Is this expected behaviour for systems of this sort of size? Is there anything I'm doing wrong with running the calculations? Any thoughts gratefully received!
Output files are attached for reference
Pete