--
You received this message because you are subscribed to the Google Groups "OpenBLAS-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openblas-user...@googlegroups.com.
To post to this group, send email to openbla...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
please set the environment before your test.export OPENBLAS_NUM_THREADS=1
Hi Xianyi,Thanks for your reply.After limiting the number of threads to 1, both versions perform the same.When setting the number of threads to 4, the debian packaged version gets improved, however the locally compiled version performs the same as before.
the output of 'make' is:OpenBLAS build complete.OS ... LinuxArchitecture ... x86_64BINARY ... 64bitC compiler ... GCC (command line : gcc)Fortran compiler ... GFORTRAN (command line : gfortran)Library Name ... libopenblas_sandybridgep-r0.2.8.a (Multi threaded; Max num-threads is 4)
When the number of threads is changed from 1 to 4, the performance should be better in a i7 CPU, right?It wasn't in my test, but the precompiled debian version gets better as expected.Now my question is, does building with just 'make' command (without any optional flag) result optimized binary?