Hey all,
Part of the production story of Zipkin is operations and monitoring of Zipkin itself. For a while now Zipkin has directly exposed a bunch of metrics in the Prometheus format (on the /prometheus endpoint, in case you were wondering). The next step is to provide an example Prometheus setup and a dashboard to consume that data.
- Is the dashboard "correct"? As in, are the labels saying what the metrics actually are? I built it to the best of my ability, but I'd love more eyes on it. Also, are the descriptions easy to understand? Do the graphs make sense? Would other groupings of data series make more sense?
- (Ideally after 1. is taken care of) If you happen to already run Zipkin + Prometheus in production, I'd love to upload a screenshot of the dashboard with actual, real data - the current screenshot is just the idling example docker-compose setup.
Looking forward to your thoughts.