[RFC] Quantization Support for CPU in OpenXLA

186 views

Skip to first unread message

Mahmoud Abuzaina

unread,

Jul 12, 2024, 6:49:14 PM7/12/24

to OpenXLA Discuss

Hello everyone!

We are glad to share a new RFC for adding quantization support for CPU in OpenXLA. The RFC proposes the leverage of Intel® Neural Compressor (INC) tool to generate Tensorflow quantized model and illustrates how it would be executed by the XLA backend.

We are planning to discuss this RFC in the upcoming OpenXLA community meeting on Tuesday, July 16th. We look forward for the valuable feedback from the community on this proposal.

Thanks,

Mahmoud

Reply all

Reply to author

Forward

0 new messages