Topic: ed448 space/performance
Working with bootloaders for an embedded system.
We wish to use ED448 for signature verification of appplications since we won't be able to replace bootloaders in the field and the product has a long time to end-of-life.
Without ED448_SMALL I managed to squeeze the bootloader into 115 KB on a STM32H7@480 MHz. Signature verification took 12 ms which is excellent.
Memory footprint will get us in trouble though when we implement bootladers on a STM32G4 and a STM32L0.
With ED448_SMALL bootloader size is reduced to 43 KB. Difference in size is 72 KB due to this flag. But performance is reduced to 609 ms, i.e. 50 times slower.
Startup requirements will cause trouble in the less performant microcontrollers.
Have you considered some intermediate optimization? There are papers describing implementations using the Karatsuba fast multiplication, e.g.