#echo commands to stdout
set -x
export I_MPI_JOB_RESPECT_PROCESS_PLACEMENT=0
#activate dedalus
source /home/nanders4/dedalus/bin/activate
#run mpi program
mpirun -np $SLURM_NTASKS python3 rayleigh_benard.py
#copy output to $HOME
srun -N $SLURM_NNODES --ntasks-per-node=28 \
sh -c 'cp $LOCAL/* /home/nanders4/DedalusExamples/'
==============================================
But when I increase the number of nodes to 2, I get the following error for processes 28-56 (these are the processes run on the second node):
2018-06-19 14:36:46,995 __main__ 28/56 ERROR :: Exception raised, triggering end of main loop.
For the 1 node case, several snapshots folders were output, each coming from different processes, but for the 2 node case, I only got snapshots_s1.
What do I need to do to allows a Dedalus script, say the 3d rayleigh benard, to use several nodes?
-Nick
--
You received this message because you are subscribed to the Google Groups "Dedalus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dedalus-users+unsubscribe@googlegroups.com.
To post to this group, send email to dedalu...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dedalus-users/70b7dfcd-7d0d-4eb0-88b5-bc8321c4f309%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
What might be helpful is that farther down, after the processes have failed to start, there's this:
Traceback (most recent call last):
File "rayleigh_benard.py", line 136, in <module>
solver.step(dt)
File "/home/nanders4/dedalus/src/dedalus/dedalus/core/solvers.py", line 483, in step
self.timestepper.step(self, dt)
File "/home/nanders4/dedalus/src/dedalus/dedalus/core/timesteppers.py", line 111, in step
evaluator.evaluate_scheduled(**evaluator_kw)
File "/home/nanders4/dedalus/src/dedalus/dedalus/core/evaluator.py", line 107, in evaluate_scheduled
self.evaluate_handlers(scheduled_handlers, wall_time=wall_time, sim_time=sim_time, iteration=iteration, **kw)
File "/home/nanders4/dedalus/src/dedalus/dedalus/core/evaluator.py", line 153, in evaluate_handlers
handler.process(**kw)
File "/home/nanders4/dedalus/src/dedalus/dedalus/core/evaluator.py", line 544, in process
file = self.get_file()
File "/home/nanders4/dedalus/src/dedalus/dedalus/core/evaluator.py", line 418, in get_file
self.create_current_file()
File "/home/nanders4/dedalus/src/dedalus/dedalus/core/evaluator.py", line 458, in create_current_file
file = h5py.File(str(self.current_path), 'w-')
File "/home/nanders4/dedalus/lib/python3.6/site-packages/h5py/_hl/files.py", line 312, in __init__
fid = make_fid(name, mode, userblock_size, fapl, swmr=swmr)
File "/home/nanders4/dedalus/lib/python3.6/site-packages/h5py/_hl/files.py", line 146, in make_fid
fid = h5f.create(name, h5f.ACC_EXCL, fapl=fapl, fcpl=fcpl)
File "h5py/_objects.pyx", line 54, in h5py._objects.with_phil.wrapper
File "h5py/_objects.pyx", line 55, in h5py._objects.with_phil.wrapper
File "h5py/h5f.pyx", line 98, in h5py.h5f.create
OSError: Unable to create file (unable to open file: name = '/home/nanders4/DedalusExamples/snapshots/snapshots_s1/snapshots_s1_p45.h5', errno = 2, error message = 'No such file or directory', flags = 15, o_flags = c2)
It's repeated many times with a different values for X in ...snapshots_s1_pX.h5, ranging between 28 and 55, but I think this is more the result of the failed processes than the reason.
To unsubscribe from this group and stop receiving emails from it, send an email to dedalus-user...@googlegroups.com.
To post to this group, send email to dedalu...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dedalus-users/70b7dfcd-7d0d-4eb0-88b5-bc8321c4f309%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "Dedalus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dedalus-user...@googlegroups.com.
To post to this group, send email to dedalu...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dedalus-users/CAJoYf%3Djdd7XXttCe4_uFGMV5Hs30d3PiwG-sN%3DpVK4WcdSuQuQ%40mail.gmail.com.
...
18-06-19 15:18:41,959 pencil 0/32 INFO :: Building pencil matrix 128/128 (~100%) Elapsed: 6s, Remaining: 0s, Rate: 2.3e+01/s
2018-06-19 15:18:41,963 __main__ 0/32 INFO :: Solver built
2018-06-19 15:18:56,154 __main__ 0/32 INFO :: Initialization time: 19.888969
2018-06-19 15:18:56,154 __main__ 0/32 INFO :: Starting loop
2018-06-19 15:19:03,462 __main__ 30/32 ERROR :: Exception raised, triggering end of main loop.
2018-06-19 15:19:03,462 __main__ 31/32 ERROR :: Exception raised, triggering end of main loop.
2018-06-19 15:19:03,462 __main__ 29/32 ERROR :: Exception raised, triggering end of main loop.
2018-06-19 15:19:03,462 __main__ 28/32 ERROR :: Exception raised, triggering end of main loop.
Traceback (most recent call last):
...as before
--
You received this message because you are subscribed to the Google Groups "Dedalus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dedalus-user...@googlegroups.com.
To post to this group, send email to dedalu...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dedalus-users/3555bee3-8f94-4a2a-adac-7e544f6f320f%40googlegroups.com.
That fixed everything, thank you!
--
You received this message because you are subscribed to the Google Groups "Dedalus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dedalus-user...@googlegroups.com.
To post to this group, send email to dedalu...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dedalus-users/3d46afa9-99b0-4d9b-989a-2e99f8b23653%40googlegroups.com.