Dear Wayne,
To add a remark on the speedup you observed and your expectation, you should keep in mind that vectorization only helps in case there are no other bottlenecks in the code, like bandwidth from main memory or from a particular cache level, how large the proportion of time is when accessing the solution vectors via indirect addressing (VectorizedArray::gather), and similar bottlenecks. Finally, not all CPUs have the same number of AVX-512 execution units than scalar/SSE2 execution units. You need to look up these specifications, too.
Your approach to check more of the spectrum in vectorization is a
step in the direction of building a performance model that helps
you understand performance.
Best,
Martin
--
The deal.II project is located at http://www.dealii.org/
For mailing list/forum options, see https://groups.google.com/d/forum/dealii?hl=en
---
You received this message because you are subscribed to the Google Groups "deal.II User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dealii+un...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dealii/767b397f-7847-4e5b-ac63-95b2aad49da2n%40googlegroups.com.