Generally speaking, the way to do HA is to run two or more identical Prometheus servers, and have them scrape every target, including each other.
This way you get `up` alerts if either server goes down.
For my machines, I configure each server to scrape local targets and every other server, even the ones in other datacenters.
Thus, I have multi-way meta-monitoring, with HA in each datacenter.
Finally, I have configured at least two Alertmanagers in each datacenter, that all join into single large mesh. Each Prometheus discovers all the Alertmanagers for sending notifications.
Hope that helps.
Nicholas
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/a529bbe3-b743-4a1c-9fee-52c75c393908%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.