Hi again,
I'm tracking down a problem in my monitoring stack.
The problem is that forwarding events over to another riemann on the network
seems to fail after a server crash, and never to recover.
I made a minimal config to show the problem using a docker-compose file.
You can find it here:
There's also a README with instructions.
It boils down to : if riemann A sends to riemann B using forward tcp-client and riemann B dies and comes back, riemann A never recovers, even if using an async-queue.
Thanks so much in advance if you can have a look !