Hi,
We're running a redundant prometheus (2.3.1) / alertmanager (0.15) setup. If we restart (not reload) prometheus, it triggers prometheus alerts to go into "pending" state when they are configured with the "for" option. This makes alertmanager think the alerts are resolved, so it sends out resolved webhooks. After the "for" timer has passed, they are firing again and alertmanager fires them again too.
As we have quite a few alerts, this causes pain and grief on operational teams. I'm wondering how we can avoid this from happening (alternate the restarts until it stabilizes I'm assuming) and see what other people are doing during restart scenarios with their alerts.
Thanks,
Pieter