Hi Joris,
+Mike Tyka who created those TFLite demos. Glad to hear those models have been useful for you!
What I'd recommend is training new versions of the models tailored for your use case. So, definitely remove the velocity prediction head, and play around with removing the LSTM layer or onset head. Then convert the trained model to TFLite, possibly using the quantization techniques others mentioned. Unfortunately, we don't have the time to support this effort, but everything you need to get started should be in the open source code.
Best of luck with your project!
-Fjord