> For your comparison to be valid requires a single point of
> difference. You have at least five points of difference;
> a different operating system, a different compiler, a different
> processor, a different I/O controller and a different storage
> device.
The compiler doesn't matter. The code performs according to the
efficiency of the operating-system. The I/O-controller doesn't
matter since I asked for repeated results from the cache.
But it does matter to compare different CPUs with different
operating-systems. I compared a Phenom X4 945 4-core with a
Ryzen Threadripper 3900X 64-core and the first one is faster
up to a block-size of 8kB. And as this is mostly rooted in the
efficiency of the operating-system because the Phenom is par-
tititally faster.
> It's likely that the prime difference is that Linux has a much
> better I/O subsystem that Windows.
Of course - and kernel-calls are supposed to be more efficient.