Hi Bob,
Well, I can say that random 500s are a pretty rare occurrence in current disco builds, and really points to something being wrong in the installation. I would:
- figure out what version of disco is running
- if getting 500s on the console, try `ddfs` and `disco` commands from command line and see if errors occur there as well
- capture master logs during 500s
- determine if there are any system resource issues (disk space, cpu, network issues)
- determine if possible to upgrade to latest version
- inspect master logs to find tracebacks to help diagnose root issues
- inspect/capture logs during GC to determine if that is breaking
- do detailed memory inspection during GC to determine if an OOM issue exists on any of the nodes
please let us know if you can get some more information!
tim