resendDelay and AlertManager failures

11 views
Skip to first unread message

Callum Jones

unread,
Jul 15, 2021, 8:41:34 PM7/15/21
to Prometheus Users
Hi there,

I was wondering if there is any documentation on how Prometheus Rulers handle the complete loss or unavailability of AlertManager nodes (basically when the notify function is unable to get a successful response from any AM node).

I see in alerting.go that there is the notion of resendDelay, my read from this is that the Ruler will keep notifying about an active alert (regardless of success) every <resendDelay> until it expires which is 4x <resendDelay> and then it would wait until the next evaluation period?

Thanks!
Callum
Reply all
Reply to author
Forward
0 new messages