When running ZMQ on my cluster, it finishes running the segments in the first iteration, but before it's about to start the next iteration, I get the following error:
-- ERROR [w_run] -- Traceback (most recent call last):
File "/projects/bbpa/westpa_source/src/westpa/cli/core/w_run.py", line 62, in run_simulation
sim_manager.run()
File "/projects/bbpa/westpa_source/src/westpa/core/sim_manager.py", line 777, in run
self.prepare_iteration()
File "/projects/bbpa/westpa_source/src/westpa/core/binning/mab_manager.py", line 177, in prepare_it
eration
self.work_manager.submit(wm_ops.prep_iter, args=(self.n_iter, segments)).get_result()
File "/projects/bbpa/westpa_source/src/westpa/work_managers/core.py", line 343, in get_result
raise self._exception.with_traceback(self._traceback)
westpa.work_managers.zeromq.core.ZMQWorkerMissing: no workers available
What could be the cause? I'm trying to debug it, but it's hard given I didn't write the code. Darian, you also seem to have encountered the error here:
Any idea how to fix it? I've tried re-running it multiple times, to no avail.
Thanks,
Razvan