[RFC] Quantization Support for CPU in OpenXLA

167 views
Skip to first unread message

Mahmoud Abuzaina

unread,
Jul 12, 2024, 6:49:14 PM7/12/24
to OpenXLA Discuss
Hello everyone!

We are glad to share a new RFC for adding quantization support for CPU in OpenXLA. The RFC proposes the leverage of Intel® Neural Compressor (INC) tool to generate Tensorflow quantized model and illustrates how it would be executed by the XLA backend.

We are planning to discuss this RFC in the upcoming OpenXLA community meeting on Tuesday, July 16th. We look forward for the valuable feedback from the community on this proposal.

Thanks,
Mahmoud
Reply all
Reply to author
Forward
0 new messages