Hi, I have seen the document "Implementing Singular Value
and Symmetric/Hermitian Eigenvalue
Solvers" and find there is only a single node compare for scalapck. And the performace has a slight improve. What is the situation when scale to a large nodes like 128, 256 or 512? Does slate still obtain performance increase and how much is the performance?
We are currently wish to port our code to solve eigenvalue to GPU and wish to know whether now we can expect a large performance increase from slate? Or maybe we just stay to the scalapack now?
Thank you!
Runfeng Jin