Performance about eigenvalue solve

78 views
Skip to first unread message

Runfeng Jin

unread,
Feb 6, 2024, 11:40:00 AM2/6/24
to SLATE User
Hi, I have seen the document "Implementing Singular Value and Symmetric/Hermitian Eigenvalue Solvers" and find there is only a single node compare for scalapck.  And the performace has a slight improve. What is the situation when scale to a large nodes like 128, 256 or 512? Does slate still obtain performance increase and how much is the performance?

We are currently wish to port our code to solve eigenvalue to GPU and wish to know whether now we can expect a large performance increase from slate? Or maybe we just stay to the scalapack now? 

Thank you!

Runfeng Jin

Mark Gates

unread,
Feb 6, 2024, 3:00:07 PM2/6/24
to Runfeng Jin, SLATE User
Unfortunately, the eigensolver & SVD codes in SLATE do not yet scale to multiple nodes. The main bottleneck is the 2nd stage band-to-(bi or tri)-diagonal, which is currently multi-threaded but not parallelized with MPI.

Mark

Interim Director, Innovative Computing Laboratory (ICL)
Research Assistant Professor, University of Tennessee, Knoxville

Runfeng Jin

unread,
Feb 6, 2024, 8:09:34 PM2/6/24
to SLATE User, mga...@icl.utk.edu, SLATE User, Runfeng Jin
Hi, thank you for your answer! Do you have any suggested eigensolver for dense matrix and all eigenvalues that support multi-node and AMD GPU?  By the way, I found many solvers that support GPU but all single-node or sparse matrix. Is this still a pending task that diagonalize dense matrix for all eigenvalues in GPU and multi-node?

Chris Hillenbrand

unread,
Mar 11, 2024, 9:11:31 PM3/11/24
to SLATE User, jsfa...@gmail.com, mga...@icl.utk.edu, SLATE User
ELPA has an experimental option to compile for AMD GPUs: https://github.com/marekandreas/elpa/blob/master/documentation/INSTALL.md

Does anyone know if the DPLASMA eigensolver works on multiple nodes with GPU? (It seems to be Nvidia only)

Mark Gates

unread,
Mar 20, 2024, 12:49:06 PM3/20/24
to Chris Hillenbrand, SLATE User, jsfa...@gmail.com
The DPLASMA eigensolver is not GPU accelerated. I inquired with the DPLASMA developers, who said GPU acceleration covers potrf, trsm, trmm, gemm, LU-nopiv, and QR. GPU support is currently only CUDA.

Mark
Reply all
Reply to author
Forward
0 new messages