Hello
Could someone help me resolve an issue in my configuration?
I have three Prometheus jobs that use the proxy_url parameter, pointing to a PushProx deployment located within the same AWS EKS cluster as Prometheus.
PushProx is also externally accessible via an NGINX ingress controller to collect metrics from hosts that are not directly reachable by Prometheus
I am currently facing an issue where Prometheus cannot scrape metrics from certain hosts, even though their metrics endpoints are accessible. The affected hosts appear to be random.
I have already tried increasing the scrape_interval and scrape_timeout settings in Prometheus, as well as the scrape.default-timeout parameter for PushProx. Unfortunately, these changes have not resolved the issue.
Also I checked pod resources like CPU and memory, they also not at the max level
Below are logs from the PushProx proxy pod
ts=2024-11-15T16:04:04.346Z caller=main.go:179 level=error msg="Error scraping:" err="Timeout reached for \"http://10.44.44.161:9000/metrics\": context canceled" url=http://10.44.44.161:9000/metrics
ts=2024-11-15T16:04:04.363Z caller=main.go:179 level=error msg="Error scraping:" err="Timeout reached for \"http://10.44.57.96:9000/metrics\": context canceled" url=http://10.44.57.96:9000/metrics
ts=2024-11-15T16:04:04.463Z caller=main.go:179 level=error msg="Error scraping:" err="Timeout reached for \"http://10.44.57.100:9000/metrics\": context canceled" url=http://10.44.57.100:9000/metrics
ts=2024-11-15T16:04:04.539Z caller=main.go:179 level=error msg="Error scraping:" err="Timeout reached for \"http://10.44.66.66:9000/metrics\": context canceled" url=http://10.44.66.66:9000/metrics
ts=2024-11-15T16:04:04.634Z caller=main.go:179 level=error msg="Error scraping:" err="Timeout reached for \"http://10.10.8.61:9000/metrics\": context canceled" url=http://10.10.8.61:9000/metrics
ts=2024-11-15T16:04:04.664Z caller=main.go:179 level=error msg="Error scraping:" err="Timeout reached for \"http://10.44.66.47:9000/metrics\": context canceled" url=http://10.44.66.47:9000/metrics