Do I read
agner.org correctly that movdqa was obsoleted by Nehalem,
Silvermont and Bulldozer in the sense that movdqu is no worse than
movdqa on those or later microarchitectures?
Do we have numbers of how our installed base is split between earlier
than Nehalem/Silvermont/Bulldozer vs. Nehalem/Silvermont/Bulldozer or
later?
How much do we care about performance on microarchitectures earlier
than Nehalem/Silvermont/Bulldozer? Would we want more readable,
smaller and on Nehalem/Silvermont/Bulldozer or later potentially a bit
faster code in exchange for a perf regression on
pre-Nehalem/Silvermont/Bulldozer microarchitectures?
(Notably, Core 2 Duo / Penryn at least used to be pretty popular and
predates Nehalem.)
--
Henri Sivonen
hsiv...@hsivonen.fi
https://hsivonen.fi/