Prometheus Pushgateway TTL

Braden Schaeffer

unread,

Jun 15, 2023, 12:20:24 AM6/15/23

to Prometheus Developers

I wanted to reopen this discussion because I am having a very difficult time understanding how pushgateway can be the suggested solution for batch job metric collection, yet simultaneously batch jobs are not a great use case example for why metric TTLs are needed in push gateway.

The most basic example, two batch jobs that produce the same metrics (grpc or http metrics). This is not just `last_completed_at` or something as I have seen before where its the same metric being updated over and over agin. You have to include a label that identifies these jobs as different so that metrics like gRPC request rates can be calculated correctly. In the kubernetes world this usually means pod ID. Simple enough until you have 1000s of these pod IDs compounded by other labels.

By now we all know those metrics are going to stay around forever, but I don't understand why the answer to this problem is "this is not a a good use case". For push gateway? For TTL? What am I doing wrong?

I've got a pipeline and library code streamlined for prometheus metric collection and the only solution I have seen offered at all is "use statsd". No. That's silly. I need new clients and two ways of defining metrics in code to account for each potential storage solution. Two APIs. etc.

Can someone please help me understand why pushgateway's existence is not reason enough to implement TTL?

Chris Sinjakli

unread,

Jun 15, 2023, 9:36:23 AM6/15/23

to Prometheus Developers

So, I want to be really clear that I'm saying this with a "using Prometheus at my job" hat on rather than a Prometheus team one.

I recently came to a similar conclusion that the Pushgateway isn't practical to use for anything other than pushing a last completion time for a given job (or instance of a job). I have a handful of cron jobs that run in the background in kube clusters, perform some work, and push some metrics to say they've done that. Right now they only push a gauge to say that they successfuly ran at the current time. I would love for them to push some counters about how many items they processed.

The series of unworkable approaches I ran through were:

- Use Pushgateway without a pod label: counter values get replaced, and if you typically process tiny numbers of events (like low single digits), it can confuse restart detection in rate functions

- Use Pushgateway with a pod label: the unbounded memory growth problem you discussed, and unwanted timeseries churn in Prometheus itself

- Use prom-aggregation-gateway: It sums gauges as well as counters, which makes no sense when you're pushing a timestamp gauge

I may well have missed something while trying to find a solution, and I'd be really happy to hear that. For the sake of doing something relatively quick at work, I'm going to be contributing a Prometheus push source to Vector soon, which will let me sum counters and replace gauges. This approach will still be subject to the usual caveats of cramming push metrics into Prometheus' pull model, but for this style of job it honestly feels like the right compromise.

With the team hat back on, I'd be curious if there's any appetite for a first-party solution that looks something like this. The Vector solution will get the job done, but I do feel a bit silly bringing in an any-to-any logs and metrics processor to only use features from it that live entirely within the Prometheus ecosystem.

Bjoern Rabenstein

unread,

Jun 29, 2023, 7:03:09 AM6/29/23

to Braden Schaeffer, Prometheus Developers

On 14.06.23 13:10, 'Braden Schaeffer' via Prometheus Developers wrote:
>
> The most basic example, two batch jobs that produce the same metrics (grpc
> or http metrics). This is not just `last_completed_at` or something as I
> have seen before where its the same metric being updated over and over
> agin. You have to include a label that identifies these jobs as different
> so that metrics like gRPC request rates can be calculated correctly. In the
> kubernetes world this usually means pod ID. Simple enough until you have
> 1000s of these pod IDs compounded by other labels.

I don't fully understand what you are trying to do. Could you explain
what metrics you are pushing exactly, and what PromQL expressions you
are using to "correctly calculate a gRPC request rate"?

--
Björn Rabenstein
[PGP-ID] 0x851C3DA17D748D03
[email] bjo...@rabenste.in

Braden Schaeffer

unread,

Jun 29, 2023, 8:47:56 AM6/29/23

to Bjoern Rabenstein, Prometheus Developers

It's the same as calculating the total incoming request rate of N pods in a deployment: sum(rate(grpc_request_count{service=foo}[5m]))

For our use case, we have N kubernetes jobs being spun up pushing many metrics, like grpc_request_count, to pushgateway. If I want to know the rate of outgoing requests, I have to have unique labels on the metric for prometheus to accurately calculate the request rate of the service as a whole. Without them they just overwrite each other. For pushgateway, though, those unique ids are never GCd like they are in prometheus.

So, every day a service creating 2 pods every 5 minutes and writing just a single grpc_request_count metric with 20 label values to pushgateway will have a cardinality of (2 pod ids * 288 times a day * 20 grpc_request_count labels). That's a cardinality of 11k series per day for that one metric from that one service that will not be GC'd by pushgateway until I restart it.

Bjoern Rabenstein

unread,

Jul 1, 2023, 5:32:41 PM7/1/23

to Braden Schaeffer, Prometheus Developers

On 29.06.23 08:47, 'Braden Schaeffer' via Prometheus Developers wrote:
> It's the same as calculating the total incoming request rate of N pods in a
> deployment: sum(rate(grpc_request_count{service=foo}[5m]))

🤔 I'm surprised that you seem to push a counter metric to the
Pushgateway.

I would say the intended use case for the Pushgateway is that a
batch job pushes its metrics upon completion. That means you only ever
have one value of those metrics, so a `rate` on those would always
result in zero.

Are you perhaps pushing multiple times during the runtime of your
batch jobs? That would be weird indeed for a PGW use case. Why don't
you just scrape your jobs normally then?

Braden Schaeffer

unread,

Jul 11, 2023, 3:23:48 PM7/11/23

to Bjoern Rabenstein, Prometheus Developers

They could live for 5s or 1 hour. Does it really matter what you send to pushgateway? It supports counters so why not push them?

A TTL is all we need here.

E

unread,

Jul 12, 2023, 5:34:34 AM7/12/23

to prometheus...@googlegroups.com, bra...@fullstory.com

I think optional TTL per time series is a good idea. It might have several use cases, it doesn't break anything, and it shouldn't be too hard to make. So why not?

I might have used this feature to trigger short-lived alerts with arbitrary text in a label, something I wouldn't do without TTL because it would require a cleanup.

--

Best regards,

Evgeniy Yunkin

--
You received this message because you are subscribed to the Google Groups "Prometheus Developers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-devel...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-developers/CAKG2A1cBTOk_wrJi%2B2uO_2y6LKG0t0AMpMptBCdM6yTR_cTDxg%40mail.gmail.com.

Bjoern Rabenstein

unread,

Jul 12, 2023, 7:45:28 AM7/12/23

to Braden Schaeffer, Prometheus Developers

On 11.07.23 15:23, 'Braden Schaeffer' via Prometheus Developers wrote:
> They could live for 5s or 1 hour.

The whole idea of a Prometheus counter doesn't really make sense for a
job that lives for just 5s, if you are scraping every 15s or every
minute or so.

And a job that lives for 1 hour should be scraped directly.

So in the first case, using a counter doesn't make sense, and in the
second case using the Pushgateway doesn't make sense.

> Does it really matter what you send to pushgateway? It supports
> counters so why not push them?

We could be stricter and just reject counters being pushed to the
Pushgateway, but that would be a breaking change. Historically, the
metric type information in Prometheus was (and to a good part still
is) some kind of "weak typing", so no hard restrictions were imposed
(you can apply `rate` to a gauge or `delta` to a counter without
Prometheus complaining about it).

Also, it feels natural to count "records backed up by the daily
database back up job" in a counter and push it to the
Pushgateway. However, when it arrives on your Prometheus server, it
doesn't really behave as a counter. Summing those values up across
instances is really painful with PromQL, and the reason for that is
that we are essentially handling events here, for which Prometheus as
a whole wasn't really designed.

If you really have to use Prometheus for that case, the "least bad"
solutions I know of is statsd with the statsd-exporter (
https://github.com/prometheus/statsd_exporter ) or the
prom-aggregation-gateway
( https://github.com/zapier/prom-aggregation-gateway ).

A TTL doesn't really address the fundamental problem. It might enable
a very brittle solution that is worse than the solution that are
already available.

Bjoern Rabenstein

unread,

Jul 12, 2023, 7:49:19 AM7/12/23

to E, prometheus...@googlegroups.com, bra...@fullstory.com

On 12.07.23 10:10, E wrote:
> I think optional TTL per time series is a good idea. It might have several
> use cases, it doesn't break anything, and it shouldn't be too hard to make.
> So why not?

Because all the use cases discussed so far have turned out to be
anti-patterns we don't want to support. This topic was brought up
multiple times at dev-summits etc., and the outcome was always the
same.

> I might have used this feature to trigger short-lived alerts with arbitrary
> text in a label, something I wouldn't do without TTL because it would
> require a cleanup.

I don't quite understand that use case, but feel free to flesh it out
a bit more and propose it as a topic for the dev-summit by adding it
to the agenda:
https://docs.google.com/document/d/11LC3wJcVk00l8w5P3oLQ-m3Y37iom6INAMEu2ZAGIIE/edit?pli=1

Reply all

Reply to author

Forward