The attempted run was executed on 8 compute nodes running Intel Xeon E5-2660 v3 cpus (each node has 40 mpi processing cores and 256 Gb of RAM) and the interconnect between the nodes is 40 Gb ethernet. We are running the latest version of Openmpi and other mpi based programs are able to run successfully. Looking at the logs I see nothing to indicate that the job fails (other than it exceeding alloted walltime) and I also see nothing to indicate that the job is progressing.
What is the expected output I should be seeing to indicate that the job is running successfully/failing ?
Thank you.