Dear Dirac experts,
I've got a really large DC-CCSD(T) calculation (224 correlated electrons!) that fails just after the MP2 calculation. I've included most of the last part of the output below. My first guess is maybe we've not given it enough memory. The output here says it requires about 115 GB of RAM and the job was allocated 120GB. Does this look familiar to anyone? This version was compiled with 64-bit integers (gnu compilers) with OpenMPI (64-bit) on the Cray at NERSC. Test cases are fine and I've run smaller CCSD(T) jobs successfully.
regards,
-Kirk
MP2 results
SCF energy : -109682.222279765090207
MP2 correlation energy : -3.981023068481005
Total MP2 energy : -109686.203302833571797
T1 diagnostic : 0.000003526477619
CCSD options :
Maximum number of iterations : 30
Maximum size of DIIS space : 8
Convergence criterium : 0.1E-11
DIRAC pam run in /global/cscratch1/sd/rthomas/malli/Rollin.input
==== below this line is the stderr stream ====
Program received signal SIGBUS: Access to an undefined portion of a memory object.
Backtrace for this error:
#0 0x155553c563df in ???
#1 0x1023229 in readdi_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/gp/io.F:47
#2 0x122ae03 in waio_intio_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/waio.F:305
#3 0x122c2b2 in waio_intio_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/waio.F:227
#4 0x122c2b2 in master.0.rread
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/waio.F:415
#5 0x122c2b2 in rread_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/waio.F:398
#6 0x120c2d6 in getvovo_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/ccgetv.F:390
#7 0x11e26a9 in amplitude_equation_t1_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/cceqn_t1_amplitudes.F:156
#8 0x11e0b83 in cceqn_driver_amplitudes_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/cceqn_driver_amplitudes.F:107
#9 0x6bd689 in ccener_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/ccdriv.F:2575
#10 0x6d573b in ccmain_
at /global/cscratch1/sd/rthomas/malli/Dirac64/src/relccsd/ccdriv.F:904
--
You received this message because you are subscribed to the Google Groups "dirac-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dirac-users...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dirac-users/792F4197-DA43-41C6-BB32-5E212AAAE791%40wsu.edu.
On 20 Oct 2022, at 11:07, 'Visscher, L. (Luuk)' via dirac-users <dirac...@googlegroups.com> wrote:
Dear Kirk,
To view this discussion on the web visit https://groups.google.com/d/msgid/dirac-users/3C2BDFEE-AC51-447A-BD7B-E3856483B881%40vu.nl.
Dear Luuk,
thanks for your reply on this. I also guessed it perhaps just needed a bit more memory. Unfortunately 120GB per node is all we can request at NERSC, so it seems this job is just too large for there. Building Dirac with Exacorr is on my to-do list at NERSC since their new machine stresses GPUs a lot.
best regards,
-Kirk
From: "'Visscher, L. (Luuk)' via dirac-users" <dirac...@googlegroups.com>
Reply-To: "dirac...@googlegroups.com" <dirac...@googlegroups.com>
Date: Thursday, October 20, 2022 at 2:07 AM
To: "dirac...@googlegroups.com" <dirac...@googlegroups.com>
Subject: Re: [dirac-users] crash in relccsd
[EXTERNAL EMAIL]
To view this discussion on the web visit https://groups.google.com/d/msgid/dirac-users/3C2BDFEE-AC51-447A-BD7B-E3856483B881%40vu.nl.
To view this discussion on the web visit https://groups.google.com/d/msgid/dirac-users/26658E76-3183-4FB8-B9A1-1FFE24634DD6%40wsu.edu.
To view this discussion on the web visit https://groups.google.com/d/msgid/dirac-users/CAKEb7iWvqNA7M09mmaVpYtYk6mMh3wVaywiJ9Qrvz6Q%2Bnh%3DU_A%40mail.gmail.com.