If gRPC connections are persistent, a single client will always talk to a single backend in that configuration. It's fine if you have lots of clients but how load balancing is done?
How server is able to handle so many open connections?? Won't it hit open file discriptors limit??Can some one please explain how it was implemented at socket level to handle that many connections??