I fork grpc-go uses as gateway just enioy h2c benifit (also remove pb IDL feature),which I implement 0-RTT TLS( cgo invoke libsodium) repalce the standard TLS and handle request just do http request to upstream; In benchmark of bidirectional streaming rpc ,high cpu usage under not much heavy load (maxConcurrencyStream = 100 or 1000 ,the same), according to "go tool pprof ", I find syscall.wirte consume much cpu and RT ( maybe cgo performance?). At least 3 time call system.wrtie (flush) will cause this problem (header + data + status)?Is orignal grpc have this issue?how to resolve or reduce invoke syscall.write?or waiting go add syscall.writev?