x yang
unread,Jun 3, 2024, 9:30:56 PMJun 3Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to rabbitmq-users
I am experiencing an issue with my RabbitMQ cluster, which consists of two nodes running version 3.9.7 in autoheal mode.
When a network partition occurs and the network is restored around the nettick_timeout , messages in certain queues start to backlog and are no longer delivered to consumers. Packet capture analysis shows that the corresponding connections are only sending heartbeats to each other.
More information:
1. On the web management page, the queues show non-empty consumers. However, using the rabbitmqctl list_queues consumers command reveals that the queues actually have no consumers.
2. The affected queues share a common characteristic: they use both mirroring and durability attributes.
3. RabbitMQ logs indicate that autoheal triggered a node restart, and there were mnesia database inconsistency warnings before the restart.
This issue is quite critical because there are no clear symptoms indicating cluster or queue anomalies. Has anyone else encountered a similar issue, or is there a known solution without upgrading the RabbitMQ version?
Any insights or solutions are greatly appreciated.
Thank you!