Thank you both for providing interesting input.
I guess I owe an explanation for my question, and this will allow me to ask a question in return.
We are trying to sell mlir and iree in an embedded context.
The combination mlir/iree has some very good qualities not found elsewhere : transparency (of the execution model, of all the code that is executed), control (of the compilation process and the generated code), traceability and even fine-grain performance tracing, low footprint, ability to compile more models (less errors, more systematic). But all of this comes at the cost of performance, with significant factors. For instance, a 1.5x/2x performance loss w.r.t. TFlite+delegates on a platform that is well-integrated, such as the Google Pixel phones. On a not-so-well-supported platform such as some Qualcomm SoC the difference w.r.t. the native toolchain (SNPE and now QNN for Qualcomm) is far bigger.
It may be that the traceability and perfomance tracing arguments are interesting enough for the iree solution we propose to be evaluated.
But it's not clear.
My question would be : how is iree justified in production today ?
Best regards,
Dumitru