Hi,I have configured alertmanager to send alerts on prometheus. Following is my setup details:1. I have a target "snmp" specified in prometheus.yml. I am shutting the machine in order to receive an alert which I am able to on alertmanager as expected.2. Now I am switching on my machine and expecting a resolved alert for the earlier alert. However, I still see the earlier "critical" alert.Am I doing something wrong here? Do I need to add a rule for resolved serverity too in alert.rules?
Following is my alert.rules file in prometheus:ALERT system_downIF up == 0FOR 1mLABELS { severity="critical" }ANNOTATIONS {summary = "is down",description = "SNMP has been unreachable for more than 1 minute.",}My simple.yml file (I have enabled snmptrapper_webhook as well)global:# The smarthost and SMTP sender used for mail notifications.smtp_smarthost: 'smtp.gmail.com:587'smtp_from: 'tes...@gmail.com'smtp_auth_username: 'xxxx'smtp_auth_password: 'yyy'receivers:- name: 'team-X-mails'email_configs:- to: 'y...@gmail.com'- name: "webhook"webhook_configs:- url: 'http://192.168.0.221:9099'send_resolved: true
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/CAKH5-WFgFL%2BLDaFbCpQZ%2B2-697ApRn55u3Y1DkpjgNFLbWJSbQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
Thanks Brian.In order to reduce notification spam, the alertmanger only sends out a notification for a group of alerts every 5 minutes by default. If you wait 5 minutes you should get it.>>> So is this 5 mins configurable if I wish to receive the notification earlier?
On Thu, Jun 22, 2017 at 12:46 PM, Brian Brazil <brian.brazil@robustperception.io> wrote:
I have the following configuration in my simple.yml, still I am getting resolved trap only after 30 mins. Is this the intended behavior?
No, I have configured the evaluation_interval as 15s in prometheus.yml# my global configglobal:scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.# scrape_timeout is set to the global default (10s).# Attach these labels to any time series or alerts when communicating with# external systems (federation, remote storage, Alertmanager).external_labels:monitor: 'codelab-monitor'
On Fri, Jun 23, 2017 at 12:01 PM, Brian Brazil <brian.brazil@robustperception.io> wrote:
Can someone please help with this? Am I missing the configuration somewhere?