########## START ITERATION NO. 2 ########## Wed Feb 20 14:57:27 2013
* GETGAB: label "GABAO1XX" not found; calling GABGEN.
SCR scr.thr. Step1 Step2 Coulomb Exchange WALL-time
QM-QM nuclear repulsion energy : 9.055003146800
Application 873572 exit codes: 134
Application 873572 resources: utime ~2s, stime ~0s
==== below this line is the stderr stream ====
Rank 1 [Wed Feb 20 14:57:27 2013] [c3-0c2s12n2] Fatal error in PMPI_Bcast: Other MPI error, error stack:
PMPI_Bcast(1682)......: MPI_Bcast(buf=0x2aaab2596968, count=1, dtype=0x4c000829, root=0, MPI_COMM_WORLD) failed
MPIR_Bcast_impl(1488).:
MPIR_Bcast_intra(1247):
MPIR_SMP_Bcast(1156)..: Failure during collective
forrtl: error (76): Abort trap signal
Image PC Routine Line Source
libc.so.6 00002AAAB0562B55 Unknown Unknown Unknown
libc.so.6 00002AAAB0564131 Unknown Unknown Unknown
libmpich_intel.so 00002AAAAEC75F36 Unknown Unknown Unknown
libmpich_intel.so 00002AAAAEC4B2ED Unknown Unknown Unknown
libmpich_intel.so 00002AAAAEC4B456 Unknown Unknown Unknown
libmpich_intel.so 00002AAAAECDCD90 Unknown Unknown Unknown
libmpich_intel.so 00002AAAAEC55F9A Unknown Unknown Unknown
dirac.x 00000000006540A6 interface_to_mpi_ 444 interface_to_mpi.F90
dirac.x 0000000000A2B188 hernod_ 1391 herpar.F
dirac.x 00000000004AAB67 dirnod_ 1141 dirac.F
dirac.x 000000000047F5BC MAIN__ 157 main.F90
dirac.x 000000000047EEEC Unknown Unknown Unknown
libc.so.6 00002AAAB054EC36 Unknown Unknown Unknown
dirac.x 000000000047EDE9 Unknown Unknown Unknown
_pmiu_daemon(SIGCHLD): [NID 00754] [c3-0c2s12n2] [Wed Feb 20 14:57:27 2013] PE RANK 1 exit signal Aborted
[NID 00754] 2013-02-20 14:57:27 Apid 873572: initiated application termination
--
You received this message because you are subscribed to the Google Groups "dirac-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dirac-users...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
-- Stefan Knecht Laboratory of Physical Chemistry - The Reiher Group - ETH Zuerich HCIG 230 Wolfgang-Pauli-Str. 10 CH-8093 Zuerich Schweiz phone: +41 44 633 22 19 fax: +41 44 633 15 94 email: stefan...@phys.chem.ethz.ch web: http://www.theochem.uni-duesseldorf.de/users/stefan/index.htm http://www.reiher.ethz.ch/people/knechste
Dear Stefan,
I have tried with varying number of processes, in one node and over multiple nodes, with the same effect. The calculation, from which that output was, used just 2 processes, i.e., a minimal parallel test, both in the same node. The pam command was
pam --mb=500 --scratch=${SCRATCH} --basis=${BASISDIR} --inp=bss_rkb+mfsso.spsp.inp --mol=H2O.mol --mpi=2
where the scratch variable points to a shared directory visible to all nodes, and basisdir variable to the Dirac basis directories. The pam script was modified to use 'aprun' instead of 'mpirun', but otherwise it's stock stuff.
-- User set math libraries: -L/opt/intel/composer_xe_2013.1.117/mkl//lib/intel64 -Wl,-rpath=/opt/intel/composer_xe_2013.1.117/mkl//lib/intel64 -lmkl_intel_lp64 -lmkl_sequential -lmkl_core -mkl=sequential
-- mpi.mod matches current compiler, setting -DUSE_MPI_MOD_F90
-- MPI-2 support found
CMake Warning:
Manually-specified variables were not used by the project:
ENABLE_BLAS
ENABLE_LAPACK
FC=ftn CC=cc CXX=CC cmake -DENABLE_MPI=ON -DENABLE_SGI_MPT=OFF -DENABLE_OPENMP=OFF -DENABLE_BLAS=ON -DENABLE_LAPACK=ON -DENABLE_TESTS=OFF -DENABLE_64BIT_INTEGERS=OFF -DEXPLICIT_LIBS="-L/opt/intel/composer_xe_2013.1.117/mkl//lib/intel64 -Wl,-rpath=/opt/intel/composer_xe_2013.1.117/mkl//lib/intel64 -lmkl_intel_lp64 -lmkl_sequential -lmkl_core -mkl=sequential" -DCMAKE_BUILD_TYPE=Release /homeappl/home/juhorouk/appl_sisu/DIRAC-12.3
-- The Fortran compiler identification is Intel
-- The C compiler identification is Intel 13.0.0.20121010
-- The CXX compiler identification is Intel 13.0.0.20121010
-- Check for working Fortran compiler: /opt/cray/craype/1.00/bin/ftn
-- Check for working Fortran compiler: /opt/cray/craype/1.00/bin/ftn -- works
-- Detecting Fortran compiler ABI info
-- Detecting Fortran compiler ABI info - done
-- Checking whether /opt/cray/craype/1.00/bin/ftn supports Fortran 90
-- Checking whether /opt/cray/craype/1.00/bin/ftn supports Fortran 90 -- yes
-- Check for working C compiler: /opt/cray/craype/1.00/bin/cc
-- Check for working C compiler: /opt/cray/craype/1.00/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working CXX compiler: /opt/cray/craype/1.00/bin/CC
-- Check for working CXX compiler: /opt/cray/craype/1.00/bin/CC -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Found Git: /usr/bin/git
-- Found MPI_C: /opt/cray/craype/1.00/bin/cc
-- Found MPI_CXX: /opt/cray/craype/1.00/bin/CC
-- Found MPI_Fortran: /opt/cray/craype/1.00/bin/ftn
-- Performing Test MPI_COMPATIBLE
-- Performing Test MPI_COMPATIBLE - Success
-- Performing Test MPI_COMPILER_MATCHES
-- Performing Test MPI_COMPILER_MATCHES - Success
-- Performing Test MPI_ITYPE_MATCHES
-- Performing Test MPI_ITYPE_MATCHES - Success
-- Performing Test MPI_2_COMPATIBLE
-- Performing Test MPI_2_COMPATIBLE - Success
-- Configuring done
-- Generating done
-- Build files have been written to: /homeappl/home/juhorouk/appl_sisu/DIRAC-12.3/sisu-intel-mklseq
configure step is done
now you need to compile the sources
Dear Stefan,
I just tried to comment out the line you mentioned, re-ran setup (creating a new build directory) and compiled the program. I still get the same error with the test calculation; should I expect to see something different, even if it's an error (it still points out the same routines/files in the stderr output)?
diff --git a/src/abacus/herpar.F b/src/abacus/herpar.F index f6583f9..c8731eb 100644 --- a/src/abacus/herpar.F +++ b/src/abacus/herpar.F @@ -1390,7 +1390,7 @@ C IF(NFMAT.GT.0) THEN CALL MEMGET('REAL',KFMAT,LFMAT,WORK,KFREE,LFREE) CALL MEMGET('INTE',KIRD,NDMAT,WORK,KFREE,LFREE) - CAll interface_mpi_BCAST(WORK(KIRD),NDMAT,MPARID, + call interface_mpi_bcast_i1_work_f77(WORK(KIRD),NDMAT,MPARID, & global_communicator) ELSE KFMAT = KFREE @@ -1400,12 +1400,12 @@ C KIFC = KFREE ELSEIF (ITYPE.EQ.2) THEN CALL MEMGET('INTE',KIFC,NDMAT,WORK,KFREE,LFREE) - CAll interface_mpi_BCAST(WORK(KIFC),NDMAT,MPARID, - & global_communicator) + call interface_mpi_bcast_i1_work_f77(WORK(KIFC),NDMAT,MPARID, + & global_communicator) ELSE CALL MEMGET('INTE',KIFC,NFMAT,WORK,KFREE,LFREE) - CAll interface_mpi_BCAST(WORK(KIFC),NFMAT,MPARID, - & global_communicator) + call interface_mpi_bcast_i1_work_f77(WORK(KIFC),NFMAT,MPARID, + & global_communicator)