Hi All,
We have many applications running in production and sometimes we see below tow serious issues which lead to loss of business (sometimes).
- Consumer gets dropped from Rabbit queue.
- Consumer stops processing messages from the queue when there are millions of messages in the queue.
Description:
1. In the first case, this happens suddenly and then our support team needs to get the notification about the transaction stuck then we re-bounce the TC instance and then again our war (interface) comes to normal situation and starts processing the message.
I have done some analysis and seen some waiting threads into thread dump between rabbit and other resource and one more issue noticed was memory unavailability.
Though we have been working to find root cause of this serious issue but still we don't have accurate information and no perfect solution available yet.
2. - Consumer component stops processing the messages from the queue.
In this case we have a transaction queue and a dead letter queue, consume polls messages from transaction queue and if the message is corrupted (or consumer did not accept the message after validation) then message goes to dead letter queue and based on policy message stays there for 30 mins and then message is flushed out from DL queue.
We are looking for the solution for these two serious problems if anyone have noticed?
--
Regards,
Sunil