Hello!
I have a situation where the server sets H/2 max concurrent streams intentionally low (e.g. 16 or 32) for good reason (that are somewhat orthogonal to gRPC so I will skip the details)
The behavior I seem to be observing is that Java gRPC client stalls when it hits that limit and waits for streams to free up before issuing more requests.
Is this the expected behavior?
Does Java gRPC stack have an equivalent?
It would be pretty useful in seamlessly handling service to service communication where server can support a higher gRCP req/sec but only if the client were to dial more H/2 connections.
I say "seamlessly" because of course we can write code to somehow dial more subchannels to server but we want the client to do this automatically without programmer picking hardcoded connection count.
Thanks in advance!
Gaurav