Kahho Phong
unread,Jun 2, 2024, 10:48:56 PM6/2/24Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to OpenXLA Discuss, Sandeep Dasgupta, OpenXLA Discuss, Kahho Phong
Thanks Sandeep for your reply.
I guess dot_general is the only operator in the opset to have this constraint?
We are trying to pivot to use StableHLO v1.0.0 as IR for our compiler stack, but we are not stuck in the lowering because our inference engine supports asymmetrical quantization in the kernels (e.g. fullyconnected/linear, which we are mapping to dot_general in the lowering).
I guess it make sense to remove this constraint on dot_general to stay consistent with the rest of the opset, but probably will necessitates changes in the StableHLO interpreter (and legalization, etc).
Any advice that you can share, the best way for us to experiment locally, i.e. adapt dot_general to take in asymmetrical quantization?
Regards.