--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/c50fd9bf-b0f1-4ebf-8fc8-67cb4dcf6de6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You can configure the scrape timeout. It defaults to 10s iirc. At SoundCloud we used to have some slow jmx scrapes which took upwards of 30s and Prometheus didn't care too much. I don't expect this to be an issue. Just make sure to have timeouts in place when you execute the scripts.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/fcb38fa6-0d8d-4273-98e3-cd7f55c132fb%40googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/fcb38fa6-0d8d-4273-98e3-cd7f55c132fb%40googlegroups.com.
One problem: whilst I could keep the plugin status as a value 0/1/2/3, and any perfdata metrics if they are generated [^1], I would have to throw away the plugin text output.I have a recent real world example to illustrate this.Last night, it appears that the OCSP responder for a well-known commercial certificate authority went down. Since we run check_ssl_certificate for each server, I got a zillion separate nagios alerts containing a critical status for each certificate individually; and a couple of hours later, a zillion resolved mails.If I had routed this through prometheus' alert manager, these alerts could have all been grouped into a single mail. Yay!!However the plugin output contained text information such as:SSL_CERT CRITICAL www.example.com: Responder Error: unauthorized (6)
SSL_CERT CRITICAL uktest1.example.com: Response Verify Failureand having that information available in the alert was invaluable in finding the problem.The normal plugin output is variable, e.g.SSL_CERT OK - X.509 certificate 'www.example.com' from 'XXX Authority' valid until Jun 29 10:01:02 2019 GMT (expires in 653 days)so is not suitable as a prometheus label.So how could I work around this? I was thinking:- stash the plugin text output somewhere outside of prometheus, e.g. consul- add this text as a synthesized label when passing alerts onto alertmanager (e.g. <alert_relabel_configs> with __meta_consul_tags)
Another approach might be a proxy between prometheus and alertmanager which adds extra labels from a lookup.Any other ideas gratefully received. Maybe this is just the wrong tool for the job, but I'm not aware of anything else which has the alert-grouping capability of alertmanager.Regards,Brian.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/082dd77c-876f-4a39-b1c1-904fdc11c761%40googlegroups.com.