I was browsing the new[ish] Grafana.net dashboard repository at
https://grafana.net/dashboards.
Sad to say, there are no OpenTSDB supported dashboards listed.
I am going to work on cleaning up some of the dashboards I use with OpenTSDB and will submit a small handful.
I figure some of you may have some Grafana dashboards you might be willing to contribute.
Here's a list of ideas I intend on getting started on [mostly driven by what I use]:
- Host/Multi-Host Status driven by tcollector/scollector
- Drill down into CPU and processes and memory
- Drill down into disks and file systems
- Drill down into Network and interface stats
- TSDB/Multi-TSDB Status driven by OpenTSDB's internal metrics (drilldowns ?)
- Basic HBase Status driven by OpenTSDB's internal metrics (drilldowns ?)
- Advanced HBase Status driven by JMX and/or HBase's HTTP published JSON JMX metrics (drilldowns ?)
- ZooKeeper Status driven by Quorum's published JMX stats (drilldowns ?)
Other ones in early development:
- Oracle Instance and RAC / Postgresql Instance Status (currently driven by JDBC accessible stats from a gcollector [like tcollector, but groovy driven] script)
- Kafka instance and cluster Status driven by JMX stats
- WebSphere MQ instance Status driven by gcollector and PCF API
- DropWizard/JMX stats (custom collectors) Nowhere near a standard since this could go anywhere, but there's some work going into more dynamic dashboards which might support some sort of useful "meta" dashboard packaging.
One of the challenges in creating standard re-usable dashboards is that the data collectors must user standard metric-names and tags, so tcollector and scollector stats are good drivers, since the use opinionated and consistent naming conventions. For dashboards relying on non-standard data-collectors, the dashboard would need to accompanied by a minimal guide on how to implement the collection.
Lastly, the
Grafana auto-loading template variables (a feature that takes dashboards from interesting up to super-useful) requires that
tsd.core.meta.enable_realtime_ts be enabled, which I believe still has some performance implications. Interested to know, do you enable realtime_ts, or deliberately disable it ?
Any one else interested ?
//Nicholas