level=error ts=2018-05-16T08:39:32.143676493Z caller=federate.go:163 component=web msg="federation failed" err="write tcp 192.168.243.145:9090->10.0.0.12:33494: write: broken pipe"
level=error ts=2018-05-16T08:40:32.146520927Z caller=federate.go:163 component=web msg="federation failed" err="write tcp 192.168.243.145:9090->10.0.0.12:33504: write: broken pipe"
level=error ts=2018-05-16T08:41:32.154047845Z caller=federate.go:163 component=web msg="federation failed" err="write tcp 192.168.243.145:9090->10.0.0.12:33506: write: broken pipe"
level=error ts=2018-05-16T08:42:32.145742427Z caller=federate.go:163 component=web msg="federation failed" err="write tcp 192.168.243.145:9090->10.0.0.12:33508: write: broken pipe"
I saw this caused by timeout on federation by other Prometheus.
Is that your case? If yes you will need to shard your instances or lower the ammount of federated data.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/f5ef63e2-4612-4c07-ae3c-00e76ff3042a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to a topic in the Google Groups "Prometheus Users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/prometheus-users/xlKc_fp4r3k/unsubscribe.
To unsubscribe from this group and all its topics, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
Might be a timeout. It says "Broken Pipe", same as the original example:level=error ts=2018-05-16T08:39:32.143676493Z caller=federate.go:163 component=web msg="federation failed" err="write tcp 192.168.243.145:9090->10.0.0.12:33494: write: broken pipe"Different time and sockets, of course. I can't post my own errors since I've increased the parameter 'scrape_interval' from 5s to 10s, and now it's working just fine, but it would return errors again if I add more jobs...Do you have any idea what causes it? Is there a limit of how many jobs can be federated?Thanks in advance,Erez
On Fri, Jul 13, 2018 at 8:23 AM, Martin Chodúr <m.ch...@seznam.cz> wrote:
Hi,
I saw this caused by timeout on federation by other Prometheus.
Is that your case? If yes you will need to shard your instances or lower the ammount of federated data.
--
You received this message because you are subscribed to a topic in the Google Groups "Prometheus Users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/prometheus-users/xlKc_fp4r3k/unsubscribe.
To unsubscribe from this group and all its topics, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/f5ef63e2-4612-4c07-ae3c-00e76ff3042a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-use...@googlegroups.com.
To post to this group, send email to promethe...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/CANgX1N7mdGO6txpO08%3DE%2BAsbxPZNZ__HR6vYiOg9sANJC%3DPhkw%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/CABbyFmrT5JPFcQv1tFB4ifJvrF0hw--upZb2PVDj6A-cSDZVTg%40mail.gmail.com.