Here's the RabbitMQ Cluster Details
Node Count: 3
Version: 3.8.1
Infrastructure: AWS EC2 (each node in different availability zone)
We occasionally notice one of the rabbitmq nodes crashing and restarting, which requires us to synchronize the queue manually.
Further to that, after we synchronize all the queues, we also need to restart many (well, not all but some) of our micro services (spring boot that has rabbitmq queue listeners) so that listeners make the connections with queue and resume processing messages. (Note: We generally notice some of the queues have 0 consumers so we pick those micro services for restart.)
This issue happens every once in 2 weeks. Attached are some of the error logs at the time of crash.
Any help in further troubleshooting this is appreciated.
Please let me know, if any further details are needed.