Shot in the dark, but check your ulimits in both situations. This smells like a stack overflow problem.
A hack I've used in the past is to put a sleep 30 at the top of my test and then quickly attach GDB to the test process to catch the failure in the act.
Depending on how bazel is setup, you are running under one of the sandboxes. They provide different levels of sandboxing. --sandbox_debug will print out a ton more details about how that is being run, and should give you enough info to actually reproduce the failure.
Good luck!
Austin