Listing the queues from the command line, or looking at the queues in the web admin interface crashes RabbitMQ. I haven't found any similar problems after Googling for a few hours, so here goes...
The problem was noticed this Monday, it was not a problem on Friday. The server admins have patched and upgraded both the VM host and Windows. I do not know if this has anything to do with the problem or not, but I thought it was worth mentioning it. I am responsible for RabbitMQ, the server admins for running Windows server, so I'm on my own.
I have 3 node cluster, and I have crashed 2 of the nodes. I haven't tried it on the last node.
The nodes are at 3.9.6, Erlang is v24.0.
I upgraded one of the nodes to 3.9.8 in case this could help, but no joy.
The nodes are running at Windows Server 2019 (1809). There's lots of RAM and disk space available.
I can crash RabbitMQ by opening the Queues tab, and then press "Total" to sort the queues. The UI then shows an error message:
"Error: could not connect to server since 2021-11-08 10:50:54."
The log file then contains two notices:
2021-11-08 10:51:00.267000+01:00 [notice] <0.229.0> Logging: configured log handlers are now ACTIVE
2021-11-08 10:51:03.595000+01:00 [notice] <0.44.0> Application mnesia exited with reason: stopped
The rest of the log is related to RabbitMQ starting up. I haven't compared the startup log of today to Friday (yet).
There's a few warnings, I'm not sure if it's relevant...
2021-11-08 10:51:04.285000+01:00 [warning] <0.229.0> Feature flags: the previous instance of this node must have failed to write the `feature_flags` file at `c:/Users/XXX/AppData/Roaming/RabbitMQ/db/rabbit@NORENAPP001V-feature_flags`:
2021-11-08 10:51:04.285000+01:00 [warning] <0.229.0> Feature flags: - list of previously disabled feature flags now marked as such: [empty_basic_get_metric]
There is no erlang crash dump file to be found, at least not on C:
I really hope someone can help me with this!
Cheers
Henning