Good morning,
I want to create a condition for an alert, and since my monitorization implies lots of up and downs when something is wrong, I am receiving lots of mails, because if something fails, tries to reboot, goes on and then of again for a while.
My current alerts looks like this:
- alert: smarttools_unavailable
expr: probe_http_status_code{job="smarttools_urls"}<= 199 OR probe_http_status_code{job="smarttools_urls"} >= 300
for: 5m
annotations:
title: SmartTools unavailable
summary: HTTP failure {{ $labels.platform }}
description: "HTTP status = {{ $value }}"
Is there any way to indicates that the alarm must be off for 60 minutes or so before it triggers again ?
Thanks!