Connection Timeout error

85 views
Skip to first unread message

ritesh patel

unread,
Jun 22, 2022, 7:56:32 AM6/22/22
to promethe...@googlegroups.com
Hello Team,

While adding 2000 hosts as a prometheus scrape targets. Only 1375 hosts show up. Rest are 625 show down with connection timeout error. 

What is the reason for that? Any idea? 

Thanks and regards
Ritesh patel 

Brian Candler

unread,
Jun 22, 2022, 8:21:53 AM6/22/22
to Prometheus Users
What happens if you scrape just the 625 targets? Or if you scrape just one or two targets, taken from the set of 625 problematic ones?

- if they still show as down, then it's a problem with those targets.  Pick one, make a test direct scrape using curl from the prometheus server, and debug the issue.  Could be something like a firewall between your prometheus server and the target.

- if the targets show as up, but down when go down when you are scraping 2000 hosts, then maybe you don't have enough capacity on your central prometheus server to get round everything.  Increase your scrape interval, or spread the targets between multiple prometheus servers, or reduce the number of metrics being exported from each target.

Reply all
Reply to author
Forward
0 new messages