Hi everyone,
For the past few months we have been working on an ML framework leveraging StableHLO and XLA.
Recently, we started digging the custom calls topic and found out about the upcoming typed FFI API. It seems that progress is being made but is there an ETA/roadmap for that feature ?
In parallel, we've been patching StableHLO and XLA internally to run some tests and see what was possible.
Without too much work, we ended up having the custom calls work on CPU and GPU with the new typed FFI but some features are not working yet. Namely, accessing user data and allocating/freeing device memory is triggering a segfault. This was to be expected given that we had to patch stuff but we just wanted to be sure we are doing things right and understand how it will work eventually.
Do we agree that the expected way is to use the ExecuteContext through PJRT and then retrieve the user data in the custom call through the XLA_FFI_ExecutionContext_Get callback ?
Thanks a lot and keep up the good work !