On 16/03/2021 09:04, Tom Liefheid wrote:
> Thanks for your answers,
>
> In my current setup, running prometheus in HA, i have 1 instance who
> can't scrape apps, but the other one can. I want to find out which one
> isn't able to scrape the apps, so i can restart it. i don't see
> anything in the logs that reflect the issues. it would be nice if we
> could 'translate' the output of the /targets page to some kind of
> metric, if that makes sense
All scrapes automatically produce the "up" metric, so a value of 0 would
indicate a failure (as you would see with red sections of the target
page). You should see labels for the job/target which is failing. It can
be a useful metric to alert on, and then look at logs/the target page to
try to figure out why the scrape is failing.
--
Stuart Clark