I have been maintaining a nightly run of the IREE nightly pip package vs the Torch-MLIR E2E tests in the iree-torch repo (
https://github.com/google/iree-torch). In the last 3 weeks it has caught 2 legitimate regressions:
I would like to move these tests coreward to prevent regressions, as it has started to be a maintenance burden to file these bugs, XFAIL the tests, and then un-XFAIL them once the fix lands. Almost all the tests boil down to very small MLIR snippets (the IR in the bugs above is unreduced).
I don't think it is advisable to actually depend on pulling down Torch-MLIR itself for the testing (especially as new tests get added to the test suite). I propose the following:
1. We teach iree-torch how to spit out a snapshot of standalone iree-compile/iree-run-mlir invocations from the test suite. (this removes the dependence on Torch-MLIR)
2. We add an IREE CI job (presubmit ideally) that bulk runs those commandline invocations (should be pretty quick).
3. We add to the build cop rotation the weekly task to regenerate the test snapshot and update the XFAILs / file any bugs identified by new tests.
What do folks think? I'm happy to do this once we agree on the end state we want.