I want to call knative service as a subscription using the broker-filter model. However, when a burst event occurs, the broker sends too many events to the knative service at the same time, causing too many failures.
For example, let's assume that the knative service is set to max-scale: 10, concurrency: 10, and each pod runs for 1 minute.
If I generate 10,000 events at once, in knative serving, up to 100 requests are running at the same time.
However, since many requests are simultaneously sent through the broker to the knative serving, most of the remaining requests fail.
Even though I configure delivery, many failures occur continuously and it takes too long time to process all the events.
To solve this problem, I want to set the rate-limit for calling knative service in the subscriber of the event.
Is there any way to set something like this?