Oh, interesting!
I was always thinking of building something along those lines, but purely live-linting rules loaded into a Prometheus server against the actual data that server (which you are also partially doing already).
It was going to output warnings:
- ...for any referenced metric name that isn't currently known to the Prometheus server
- ...for any label name on a metric name that isn't known
- ...for any common query mistakes, like rate() on a gauge, deriv() on counters, aggregating away the "le" label, etc.
...and potentially give an idea about which rules load how many time series in their current state.
Any of those could generate false positives, so it could output warnings at max, but could still be very helpful.
It seems like your tool already does most of that and more, but the common query gotchas one might be useful at some point too :)