Hi, AllI'm trying to find a solution to alert server reboot event with node_exporter, the "up" or "node_boot_time" metrics seems hard to achieve this, any ideas? Usually reboot a server can be done within one minute.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/61560c30-20d9-497b-b7bf-e8d0286c7012%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You could alert on changes in node_boot_time.ALERT NodeRebootedIF changes(node_boot_time[1h]) > 0You would also want to combine that with a down alert like this:ALERT NodeDownIF up{job="node"} == 0FOR 5mThis way you get both failure modes, a totally down node, and a node that rebooted.
On Tue, May 2, 2017 at 10:53 AM, Yong Zhang <hisca...@gmail.com> wrote:
Hi, AllI'm trying to find a solution to alert server reboot event with node_exporter, the "up" or "node_boot_time" metrics seems hard to achieve this, any ideas? Usually reboot a server can be done within one minute.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/65ef14ff-7a92-40a6-b7ca-268260b3e307%40googlegroups.com.
You could alert on changes in node_boot_time.ALERT NodeRebootedIF changes(node_boot_time[1h]) > 0You would also want to combine that with a down alert like this:ALERT NodeDownIF up{job="node"} == 0FOR 5mThis way you get both failure modes, a totally down node, and a node that rebooted.
On Tue, May 2, 2017 at 10:53 AM, Yong Zhang <hisca...@gmail.com> wrote:
Hi, AllI'm trying to find a solution to alert server reboot event with node_exporter, the "up" or "node_boot_time" metrics seems hard to achieve this, any ideas? Usually reboot a server can be done within one minute.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
You could alert on changes in node_boot_time.ALERT NodeRebootedIF changes(node_boot_time[1h]) > 0You would also want to combine that with a down alert like this:ALERT NodeDownIF up{job="node"} == 0FOR 5mThis way you get both failure modes, a totally down node, and a node that rebooted.
On Tue, May 2, 2017 at 10:53 AM, Yong Zhang <hisca...@gmail.com> wrote:
Hi, AllI'm trying to find a solution to alert server reboot event with node_exporter, the "up" or "node_boot_time" metrics seems hard to achieve this, any ideas? Usually reboot a server can be done within one minute.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/02540380-b387-439a-a3a3-e3cc0836913a%40googlegroups.com.
* Cloud provider? Tencent Cloud of China
* Custom VM setup? Cloud image (Ubuntu 16.04)* Do you run an NTP client? (Yes, and time sync from default time servers, is this the cause?)
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/5ef4d4a9-d20f-4475-b1b5-890a34ba13a7%40googlegroups.com.