My question was related to a weird performance of NWChem running in the cluster with muti-nodes.
If I used the slurm script
below.to run nwchem in a cluster with different node numbers, I got different outcomes.
******************************************************************************************
#!/bin/bash
#
#SBATCH --job-name=jobst1
#SBATCH --output=jobst1-%j.out
#SBATCH --error=jobst1-%j.err
#SBATCH --nodes=3
#SBATCH --ntasks-per-node=10
#SBATCH --partition=regular
module purge
mpirun nwchem TiOH6-2-m06-dzvp2 > TiOH6-2-m06-dzvp2.out
exit
**********************************************************************************************
Although the calculations appear to work fine, the outcomes are unstable. If I use multi-node, I may get different results. I am wondering if creating and loading nwchem modules are important for solving this problem.
The following is an input example.
******************************************************************************************************
title "Ti2O4 model ECP "
start TiOH6-2-m06-dzvp2
echo
charge -2
geometry units angstroms
symmetry C1
Ti 7.197215000000 12.356000000000 10.655614000000
O 5.335286000000 12.157686000000 10.716351000000
H 5.118759000000 11.434242000000 10.347667000000
O 7.075484000000 12.897193000000 8.891045000000
H 6.451182000000 12.480796000000 8.512770000000
O 7.027445000000 14.110058000000 11.206510000000
H 6.403972000000 14.478761000000 10.781351000000
O 7.366985000000 10.601942000000 10.104719000000
H 7.990458000000 10.233239000000 10.529878000000
O 7.318946000000 11.814807000000 12.420184000000
H 7.943248000000 12.231204000000 12.798459000000
O 9.059144000000 12.554314000000 10.594877000000
H 9.275671000000 13.277758000000 10.963562000000
zcoord
cvr_scaling 1.2
# bond 1 2
end
end
dft
odft
mult 1
vectors input atomic output TiOH6-2-m06-dzvp2.movecs
XC m06
grid xfine
iterations 200
mulliken
end
driver
maxiter 80
end
basis "ao basis" cartesian print
Ti library "DZVP2 (DFT Orbital)"
H library "DZVP2 (DFT Orbital)"
O library "DZVP2 (DFT Orbital)"
END
task dft optimize
******************************************************************************************************************************
If I used one node (or longin node, or a workstation), the final results (toward an intermolecular hydrogen bond species):
----------------------
Optimization converged
----------------------
Step Energy Delta E Gmax Grms Xrms Xmax Walltime
---- ---------------- -------- -------- -------- -------- -------- --------
@ 36 -1304.38794707 2.7D-06 0.00003 0.00001 0.00058 0.00131 2289.6
ok ok ok ok
If I used multi-node (for example 3 nodes with 10 cores/node), the final results (toward an intramolecular hydrogen bond species):
----------------------
Optimization converged
----------------------
Step Energy Delta E Gmax Grms Xrms Xmax Walltime
---- ---------------- -------- -------- -------- -------- -------- --------
@ 66 -1304.39789550 -8.9D-08 0.00003 0.00000 0.00051 0.00169 4995.3
ok ok ok ok