The only official way of enabling gRPCLB is, as you found out, as described in the gRFC A5. This could be (locally)
hacked up to forcefully enable gRPCLB,
BUT wouldn't it be simpler to implement everything at the application level? That is, do all the service discovery on a regular grpc server, process its responses and then proceed. Re-purposing gRPCLB for this end is essentially that, but having to jump through extra hoops to enable it, having to implement an API (load_balancer.proto) that's not exactly designed for your usecase, etc.