On general graphing I've got a munin mysql plugin update
https://github.com/munin-monitoring/munin/pull/164 in progress.
It certainly does primary/non-primary and cluster size alerts/graphs, flow control, though currently needs
https://github.com/codership/galera/pull/57 and
https://github.com/codership/galera/pull/50 to be merged to get more meaningful graphs (rather than gauges that average over the uptime) and the max/min values add measurement of brief volatile bulk changes that occur between status probes.
Improvements welcome.
----- Original Message -----
> Hello!
> Could you share the experience of monitoring Galera Cluster? Is it
> maybe some existing programs or plugins? I need to monitor:
> 1) List all queries, currently running.
Given binary row replication a query isn't generally available in galera. In fact I've been having trouble seeing anything meaningful (
https://mariadb.atlassian.net/browse/MDEV-6327).
> 2) List all replication processes currently running (SST and IST).
Watching Galera node state will give you an idea as to what SST are in place and when. I haven't seen how to monitor IST.
> 3) Monitor resource allocation among queries and replication
> processes.
Index size, wsrep_cert_deps_distance and flow control graphs will give you an idea of how the each node is keeping up. wsrep_flow_paused/wsrep_flow_paused_ns graphs show when pausing occurs.
On resource allocation among queries this happens mainly in the innodb engine. Munin has graphs for general innodb status. See my previous post to the list about galera memory usage and Alexey's explanation.
> 4) Monitor network connections and relationship to running queries.
wsrep_replicated_bytes/wsrep_received_bytes show network traffic within cluster. Other munin plugins can monitor general or specific network traffic. I've no idea how to map this to running queries or how it would be useful.
--
Daniel Black, Engineer @ Open Query (
http://openquery.com.au)
Remote expertise & maintenance for MySQL/MariaDB server environments.