Hello XLA Devs,
I have an issue where the CompileRequest failed because of the xla message > 2GB. Upon further inspection, I found that the HLOModule Proto is the one that is large due to the tensors that are used as constants being sent across.
I dumped the hlo proto and zipped the file and it is comparatively very small.
I am guessing someone here must have encountered this else we are doing something wrong by adding the constant tensors in the HLO proto.
If you encountered a similar situation, how did you handle this?
Additionally,
1) Is there a compile pass that I can run to remove constants ?
2) or send tensors in a binary format?
Thanks, Naveen