Grafana dashboards

Thomas Dräbing

unread,

Jan 14, 2020, 4:27:09 AM1/14/20

to Repo and Gerrit Discussion

Dear all,

we plan to move some of our monitoring to the Prometheus/Grafana-stack. Among the dashboard collection published on the Grafana homepage, I couldn't find any existing dashboards for Gerrit metrics [1]. Before I start to create new dashboards from scratch, I wanted to ask whether somebody has Grafana dashboards for Gerrit metrics and is willing to share them with the community. Having a solid base to start from would be of great help (not only for me, I guess). Thus, help would be greatly appreciated!

Thanks and best regards,

Thomas

[1] https://grafana.com/grafana/dashboards?search=gerrit

Luca Milanesio

unread,

Jan 14, 2020, 8:37:19 AM1/14/20

to Thomas Dräbing, Luca Milanesio, Repo and Gerrit Discussion

On 14 Jan 2020, at 01:27, Thomas Dräbing <thomas....@gmail.com> wrote:

Dear all,

we plan to move some of our monitoring to the Prometheus/Grafana-stack. Among the dashboard collection published on the Grafana homepage, I couldn't find any existing dashboards for Gerrit metrics [1]. Before I start to create new dashboards from scratch, I wanted to ask whether somebody has Grafana dashboards for Gerrit metrics and is willing to share them with the community. Having a solid base to start from would be of great help (not only for me, I guess). Thus, help would be greatly appreciated!

We have one for Gerrit multi-site, which includes also replication and split-brain metrics.

See some screenshots at [2]

Luca.

[2] https://imgur.com/a/JMoFOg6

Thanks and best regards,
Thomas

[1] https://grafana.com/grafana/dashboards?search=gerrit

--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en

---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/dc9d5de4-1e9b-4b98-87b5-28fe61985a57%40googlegroups.com.

Fabio Ponciroli

unread,

Jan 14, 2020, 8:40:25 AM1/14/20

to Luca Milanesio, Luca Milanesio, Thomas Dräbing, Repo and Gerrit Discussion

@Luca Milanesio we could extract the multi-site part and just publish the core metrics. WDYT?

To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/23AF7B2A-739B-4D1B-9872-31D281029345%40gmail.com.

Luca Milanesio

unread,

Jan 14, 2020, 8:52:28 AM1/14/20

to Fabio Ponciroli, Luca Milanesio, Thomas Dräbing, Repo and Gerrit Discussion

On 14 Jan 2020, at 05:39, Fabio Ponciroli <pon...@gmail.com> wrote:

@Luca Milanesio we could extract the multi-site part and just publish the core metrics. WDYT?

Sure, that would be a start.

Luca.

Thomas Dräbing

unread,

Jan 14, 2020, 9:02:14 AM1/14/20

to Luca Milanesio, Fabio Ponciroli, Repo and Gerrit Discussion

Hi Luca, hi Fabio,

if you could share the dashboard, that would be really awesome!

Maybe we can version the json-files describing the dashboards somewhere? Then it would be easy to adapt to new metrics etc.

I will of course also happily share what we did for our Prometheus/Grafana setup, as soon as it is ready.

Best,

Thomas

Luca Milanesio

unread,

Jan 14, 2020, 10:33:05 AM1/14/20

to Thomas Dräbing, Luca Milanesio, Fabio Ponciroli, Repo and Gerrit Discussion

On 14 Jan 2020, at 06:01, Thomas Dräbing <thomas....@gmail.com> wrote:

Hi Luca, hi Fabio,

if you could share the dashboard, that would be really awesome!

I believe the best would be to have a docker-compose.yaml that already contains the components we need and pre-configured:

1. Prometheus

2. Grafana

With regards to the Grafana dashboard, it should be shared on http://snapshot.raintank.io/info/ correct?

Luca.

Thomas Dräbing

unread,

Jan 14, 2020, 10:56:52 AM1/14/20

to Luca Milanesio, Fabio Ponciroli, Repo and Gerrit Discussion

On Tue, 14 Jan 2020 at 16:33, Luca Milanesio <luca.mi...@gmail.com> wrote:

On 14 Jan 2020, at 06:01, Thomas Dräbing <thomas....@gmail.com> wrote:

Hi Luca, hi Fabio,

if you could share the dashboard, that would be really awesome!

I believe the best would be to have a docker-compose.yaml that already contains the components we need and pre-configured:
1. Prometheus
2. Grafana

I am currently working on a Kubernetes based setup that is very opinionated, so mostly configured. By mostly configured I mean that some configuration can not sensibly be preconfigured (e.g. credentials), but others will most of the time stay the same.

In my approach, I use the helm charts provided for prometheus [1] and grafana [2], configure them as much as possible with values, I think are sensible (and have been tested to work so far). For options like credentials, which have to be configured for each deployment, I created a leaner configuration-file to set them, that will be used to create the final configuration files for helm and additional resources. Thereby with only a little configuration and a few commands one can set up the logging stack, also without having to spend a lot of time to learn how to configure Prometheus or Grafana to get a basic monitoring setup for Gerrit.

I am planning to provide this to open source as soon as I have some dashboards, a sensible base configuration to work with and have tested it for our setup.

With regards to the Grafana dashboard, it should be shared on http://snapshot.raintank.io/info/ correct?

Yes, that would be one option, as well as on the grafana homepage (https://grafana.com/grafana/dashboards). Or we just version the json-files in a git repository. It will require a bit more work to deploy them though, since we can't just import them by an id or URL, but have to load the json. On the other hand it would provide us with code review and a bit more control :-). Maybe we could do both?

[1] https://github.com/helm/charts/tree/master/stable/prometheus

[2] https://github.com/helm/charts/tree/master/stable/grafana

Paladox none

unread,

Jan 14, 2020, 1:06:42 PM1/14/20

to Repo and Gerrit Discussion

Wikimedia use the javamelody dashboard [1]

[1] https://grafana.wikimedia.org/d/Bw2mQ3iWz/gerrit-javamelody?orgId=1

Mihály Petrényi

unread,

Jan 15, 2020, 2:30:39 AM1/15/20

to Repo and Gerrit Discussion

Hi,

I am from Ericsson. We are hosting a huge multi-site Gerrit instance using Grafana with Prometheus for monitoring.
We are using some standard Prometheus exporters: node, mtail, jmx, gerrit, postgres and haproxy.
Additionally, we are generating custom metrics, mainly based on node exporter's textfilecollector functionality.
That is a great, easy to use feature, I highly recommend it.
Files containing the metrics can be placed in a directory and node exporter will serve those to Prometheus.
We are also planning to use Grafana Loki for log files, instead of the current solution with mtail.
These exporters provide us ~15k metrics / node.

We split the metrics into several dashboards. At the moment we have the following main dashboards: Overview, Datacenters, Backend, Database, Frontend, Garbage Collection, Disk usage, Network, RED (based on Google's RED method), Replication, Repositories, Node exporter.

Most important thing is to have the Prometheus targets properly and consistently labeled. We are using the following common labels for our targets:
environment (dev, staging, production), configuration (master, slave), datacenter (for multi-site), job (exporter name), role (backend, frontend, gc, db)

Sample screenshot from our Overview dashboard: https://imgur.com/BLlGDt5

It is probably a good idea to have a shared, publicly available Grafana template for Gerrit that can be easily tailored to the given environments.
We are happy to contribute with our experience.

Regards,
Mihaly

2020. január 14., kedd 15:02:14 UTC+1 időpontban Thomas Dräbing a következőt írta:

Hi Luca, hi Fabio,

if you could share the dashboard, that would be really awesome!

Maybe we can version the json-files describing the dashboards somewhere? Then it would be easy to adapt to new metrics etc.
I will of course also happily share what we did for our Prometheus/Grafana setup, as soon as it is ready.

Best,
Thomas

On Tue, 14 Jan 2020 at 14:52, Luca Milanesio <luca.m...@gmail.com> wrote:

On 14 Jan 2020, at 05:39, Fabio Ponciroli <pon...@gmail.com> wrote:

...@Luca Milanesio we could extract the multi-site part and just publish the core metrics. WDYT?

Sure, that would be a start.

Luca.

Il giorno mar 14 gen 2020 alle ore 14:37 Luca Milanesio <luca.m...@gmail.com> ha scritto:

On 14 Jan 2020, at 01:27, Thomas Dräbing <thomas....@gmail.com> wrote:

Dear all,

we plan to move some of our monitoring to the Prometheus/Grafana-stack. Among the dashboard collection published on the Grafana homepage, I couldn't find any existing dashboards for Gerrit metrics [1]. Before I start to create new dashboards from scratch, I wanted to ask whether somebody has Grafana dashboards for Gerrit metrics and is willing to share them with the community. Having a solid base to start from would be of great help (not only for me, I guess). Thus, help would be greatly appreciated!

We have one for Gerrit multi-site, which includes also replication and split-brain metrics.
See some screenshots at [2]

Luca.

[2] https://imgur.com/a/JMoFOg6

Thanks and best regards,
Thomas

[1] https://grafana.com/grafana/dashboards?search=gerrit

--
--

To unsubscribe, email rep...@googlegroups.com

More info at http://groups.google.com/group/repo-discuss?hl=en

---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.

To unsubscribe from this group and stop receiving emails from it, send an email to repo-d...@googlegroups.com.

To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/dc9d5de4-1e9b-4b98-87b5-28fe61985a57%40googlegroups.com.

--
--
To unsubscribe, email repo-d...@googlegroups.com

More info at http://groups.google.com/group/repo-discuss?hl=en

---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.

To unsubscribe from this group and stop receiving emails from it, send an email to repo-d...@googlegroups.com.

Luca Milanesio

unread,

Jan 15, 2020, 10:31:46 AM1/15/20

to Mihály Petrényi, Luca Milanesio, Repo and Gerrit Discussion

On 14 Jan 2020, at 07:20, Mihály Petrényi <e.mihaly...@gmail.com> wrote:

Hi,

I am from Ericsson. We are hosting a huge multi-site Gerrit instance using Grafana with Prometheus for monitoring.

Hi Mihaly, thanks for sharing your experience.

Out of topic: which Gerrit multi-site implementation are you using? Gerrit + multi-site plugin? WANdisco? In-house implementation?

Is it a Gerrit multi-master/multi-site or a simple master-slave replication?

We are using some standard Prometheus exporters: node, mtail, jmx, gerrit, postgres and haproxy.

How do you synchronise the Postgres multi-site?

Additionally, we are generating custom metrics, mainly based on node exporter's textfilecollector functionality.
That is a great, easy to use feature, I highly recommend it.
Files containing the metrics can be placed in a directory and node exporter will serve those to Prometheus.

That’s a very good hint, thanks a lot for that.

We are also planning to use Grafana Loki for log files, instead of the current solution with mtail.
These exporters provide us ~15k metrics / node.

Really interesting also.

We split the metrics into several dashboards. At the moment we have the following main dashboards: Overview, Datacenters, Backend, Database, Frontend, Garbage Collection, Disk usage, Network, RED (based on Google's RED method), Replication, Repositories, Node exporter.

Most important thing is to have the Prometheus targets properly and consistently labeled. We are using the following common labels for our targets:
environment (dev, staging, production), configuration (master, slave), datacenter (for multi-site), job (exporter name), role (backend, frontend, gc, db)

Sample screenshot from our Overview dashboard: https://imgur.com/BLlGDt5

It is probably a good idea to have a shared, publicly available Grafana template for Gerrit that can be easily tailored to the given environments.
We are happy to contribute with our experience.

Have you thought about coming to a Gerrit User Summit and present your experience?

Luca.

To unsubscribe, email repo-discuss...@googlegroups.com

More info at http://groups.google.com/group/repo-discuss?hl=en

---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.

To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/904df83a-e7f3-448a-a189-b213c70bd5de%40googlegroups.com.

Message has been deleted

Mihály Petrényi

unread,

Jan 15, 2020, 6:43:48 PM1/15/20

to Repo and Gerrit Discussion

Out of topic: which Gerrit multi-site implementation are you using? Gerrit + multi-site plugin? WANdisco? In-house implementation?
Is it a Gerrit multi-master/multi-site or a simple master-slave replication?

We are using an in-house implementation as we are stuck with an older Gerrit version with postgresql and Lucene. At the moment, it is a "simple" master-slave setup. Master is using the high-availability plugin to achieve an active-passive configuration. This is a bottleneck in our implementation and we are planning to improve on this in the close future by introducing Elasticsearch and upgrading to NoteDb in order to bring us closer to the multi-master setup. For replication, we use the replication plugin with push replications.

How do you synchronise the Postgres multi-site?

We use the Postgres streaming replication feature and Pgpool for high availability.

Have you thought about coming to a Gerrit User Summit and present your experience?

I will bring this up within our team. Thanks for mentioning.

Mihaly