FFTW 3.3.3 available; faster ARM NEON and x86 AVX; MPI bugfix.

410 views
Skip to first unread message

Matteo Frigo

unread,
Nov 25, 2012, 4:12:34 PM11/25/12
to fftw-announce
Dear FFTW users,

Version 3.3.3 of FFTW is now available from the FFTW web page
(www.fftw.org). This version improves performance on x86+AVX and on
ARM+NEON. In addition, we fixed a deadlock in the MPI implementation.

Regards,
Matteo Frigo

Changes since 3.3.2:

* Fix deadlock bug in MPI transforms (thanks to Michael Pippig for the
bug report and patch, and to Graham Dennis for the bug report).

* Use 128-bit ARM NEON instructions instead of 64-bits. This change
appears to speed up even ARM processors with a 64-bit NEON pipe.

* Speed improvements for single-precision AVX.

* Speed up planner on machines without "official" cycle counters, such
as ARM.
Reply all
Reply to author
Forward
0 new messages