Hi Dahua,
On AMD machines Intel libraries do not perform well, so Yeppp! is always faster (and it is also faster than AMD LibM on those machines).
Relative performance of vector elementary (mathematical) functions depends on the microarchitecture: the algorithms employed in Yeppp! are very friendly to wide SIMD units. On Haswell (256-bit units + FMA) Yeppp! is faster than VML, on Sandy Bridge (256-bit units, no FMA) they are about on par, on Nehalem (128-bit units, no FMA) Yeppp! is between 4-ULP and 1-ULP versions of VML. IPP is slower than MKL and Yeppp! on Intel platforms.
Attached is raw performance data from mid-November 2013 with the latest versions of libraries available at the time (AMD LibM 3.1, SLEEF 2.80, MKL 11.1.0, IPP 8.0.1, FDLibM 5.3 (OpenLibM is based on this library), Cephes 2.7, CRLibM from CVS, last commit on April 11, 2011). SLEEF, FDLibM, Cephes, CRLibM were recompiled for each processor with gcc-4.8.1 -O3 -march=native.
If you want to benchmark the libraries on your own machine, checkout the
Hysteria utility from BitBucket.
Regards,
Marat