Dear FFTW users,
Version 3.3.3 of FFTW is now available from the FFTW web page
(
www.fftw.org). This version improves performance on x86+AVX and on
ARM+NEON. In addition, we fixed a deadlock in the MPI implementation.
Regards,
Matteo Frigo
Changes since 3.3.2:
* Fix deadlock bug in MPI transforms (thanks to Michael Pippig for the
bug report and patch, and to Graham Dennis for the bug report).
* Use 128-bit ARM NEON instructions instead of 64-bits. This change
appears to speed up even ARM processors with a 64-bit NEON pipe.
* Speed improvements for single-precision AVX.
* Speed up planner on machines without "official" cycle counters, such
as ARM.