Can you attach one of your output log files? I've been running into I think the same, or a very similar problem, also running the NaCl tutorial with gromacs, but in my case there's always just one remaining worker on each node which times out and does not complete its trajectory.
As a somewhat unsatisfying workaround, see if you can just restart the job and have it complete the iteration it crashed during. In my case, even though each iteration crashes with one worker per node incomplete, re-submitting the job to the scheduler sees WESTPA finishing those, then continuing on to the next iteration, and crashing in the same way. By chaining N jobs for N iterations, I'm able to run to completion.
Of course, this hack is neither ideal nor an actual solution, and I'm spending some time debugging. I'll update with the results of that.
As an aside, in my case I also saw (in the single test run I've completed so far) that at around iteration 66, it actually stopped crashing, and ran successfully to completion. I'd be interested to know if, with the above workaround, you see the same thing.
-John Russo