Daniel Trüssel
unread,Mar 23, 2020, 5:34:05 AM3/23/20Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Prometheus Users
Hey
This is the alert based on blackbox exporter
'avg_over_time(probe_duration_seconds{env="Prod"}[15m]) > 1'
It is when the performance gets slow, we need to wake up and try to
solve the root cause.
When we restart web server, this alert temporary resolved, but later
fires again.
Is there a hack to send the resolved notification in delay? For example
only close alert if the situation is fine for 2 hours?
I not wish the alert to fire, close, fire, close,..
We see this behavior only with slow performance alerts, others which
check things like HTTP status are fine.
what possible solutions?
kind regards
Daniel