I wish you luck with all this. It seems as a lot of work and headache. Maybe, with Apple CPUs, possibly ARM CPUs on Linux servers and AMD CPUs, it could be interesting to also switch from Intel MKL to OpenBLAS (for AMD, the best option would be LibM in AOCL, but it is surely not desirable to keep and develop several solvers on several math libraries). Intel has been ordered by court several times to switch off the performance locks like e.g. AVX2 for only Intel CPUs, but we know they have not done it. (not saying AVX2 is any huge advantage in FDS)
With the GPU utilisation, I am very curious about the results. From some scientific papers, it seems very pomising, but I wonder how effective it will be with FDS. From my experience from ANSYS, it is beneficial particularly on smaller tasks, otherwise I would need a real high-end with a lot of VRAM.
I could help you with some testing if I am of any use, but I am no proper IT guy. If you need some testing, I have access to:
Old MacBook – Intel (Haswell, I think. 2 cores)
Linux – AMD Renoir / NVidia RTX (6GB VRAM)
Windows – AMD Renoir / NVidia RTX (6GB VRAM)
Windows – Intel 9th Gen (Coffee Lake Refresh, I think) / NVidia Quadro p620
Windows Server VR – Intel Xeon Gold 6148 (or 6152 I am not sure now)
Unfortunately, I don't have access to the new M1 from Apple as of now
...
Dne středa 12. ledna 2022 v 3:58:52 UTC+13 uživatel Kevin napsal: