We would also like to inform everyone about our recent work on reducing the memory footprint of CROSS signature scheme[1].
The large memory footprint was prohibitive for schemes like CROSS to be implemented on small microcontrollers, such as the Cortex-M4.
We proposed methods like on-the-fly hashing, optimised GGM and Merkle tree algorithms, and variable usage optimisation. Due to these optimizations, all versions of CROSS signature scheme now run on Cortex-M4.
We propose several levels of optimization. Our smallest implementation takes 15KB-111KB of memory for Sign, reduced from the reference implementation by 92%, albeit with some trade-off in run-time. The balanced implementation reduces the memory footprint of Keygen/Sign/Verify by 95/61/85%, while maintaining or indeed even gaining speed from 0.2%-33%.
We will submit our code as a pull request to the pqm4 library soon.
[1] https://eprint.iacr.org/2024/1929