RabbitMQ 3.9.4 Erlang 24.0.5 Single Node Memory explosion

60 views
Skip to first unread message

b...@solute.de

unread,
Aug 25, 2021, 6:48:36 AM8/25/21
to rabbitmq-users
Hey Team,

we discover since some weeks now a suspicious behavior in rabbitmq. Somtimes, we can't find a direct trigger, one of our 3 Nodes explode in Memory Usage and didn't recover from these state.

After a restart of the service `sudo systemctl restart rabbitmq-server.service` the server is fully responsive and behaves normaly for around a week. We don't think this problem lay in the hardware because it didn't happen on one of the nodes all the time. It happens on randomly any of the 3 nodes in sometime.

One exception to "any of the 3 nodes" could we find. If the node don''t have any mirror queue slave operations it seems the node doesn't raise in memory so far.

Following data could be collected:
Screenshot 2021-08-25 at 11-45-38 RabbitMQ Management.pngScreenshot 2021-08-25 at 11-44-50 RabbitMQ Management.pngScreenshot 2021-08-25 at 11-44-28 RabbitMQ Management.pngScreenshot 2021-08-25 at 11-43-48 RabbitMQ Management.pngScreenshot 2021-08-25 at 11-42-24 RabbitMQ Management.png

Memory Usage:
Screenshot 2021-08-25 at 12-05-03 AuStoreQueues NG - Grafana.png

One thing to `ha.delete_offer.dead` this queue is consumed every few hours and republished to `ha.delete_offer`. On `ha.delete_offer` is a dead-letter-exchange which sends the message back to `ha.delete_offer.dead` a lock mechanism ensures this loop don't happen to the same time. Every message which looped for some few(3-4) days will be ignored and not looped again.

Screenshot 2021-08-25 at 12-14-36 AuStoreQueues NG - Grafana.png

Could you help here to find out whats going on in our cluster?

Sincerely,

Bjarne

b...@solute.de

unread,
Aug 25, 2021, 6:55:47 AM8/25/21
to rabbitmq-users
Hey,

I have forgotten to add some logs, you find them as attachment.

Sincerely,

Bjarne
2021-08-25_austorequeues01f_logs.log.gz
Reply all
Reply to author
Forward
0 new messages