Is solve_dense() parallelizable?

Rainbow Star

unread,

Jun 9, 2026, 6:33:47 AMJun 9

to Dedalus Users

Hello everyone,

When trying to solve a problem. I have noticed a large difference in performance between solve_sparse() and solve_dense() when running on a cluster.

The sparse solver appears to benefit from MPI parallelization and scales reasonably well with increasing core count. However, when using solve_dense(), increasing the number of MPI ranks does not seem to reduce the runtime significantly. From the behavior, it appears that the dense eigensolve may effectively be running on a single process after the matrices are assembled.

Is this the expected behavior of solve_dense()? Does the dense solver currently use a distributed eigensolver, or is the dense matrix gathered onto a single rank before diagonalization?

If the latter, is there any recommended way to parallelize dense eigenvalue solves in Dedalus, or any support for distributed dense eigensolver backends?

Thank you for any clarification.

Calum Skene

unread,

Jun 9, 2026, 7:09:32 AMJun 9

to Dedalus Users

Hi

Both dense and sparse solvers solve eigenvalue problems for a specific subproblem on a single core. The matrices are not distributed, and neither is the eigensolver.

https://github.com/DedalusProject/dedalus/blob/2a2ad15108f2cd2d24d0b103f8167962562b5cbc/dedalus/core/solvers.py#L180

I am not sure why it would scale with MPI ranks in either case, or how it would even work unless you are looping over all subproblems a core owns (which is distributed).

Am I missing something?

Best,

Calum

Keaton Burns

unread,

Jun 9, 2026, 8:12:45 AMJun 9

to dedalu...@googlegroups.com

That’s right, the solves themselves are not parallelized except if you’re manually looping over all subproblems. Another possibility is that transform construction is being triggered somewhere, and building the local (rather than global) transform matrices will be faster under MPI. What size problem are you looking at?

--
You received this message because you are subscribed to the Google Groups "Dedalus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dedalus-user...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dedalus-users/a457d054-1d44-4111-b83d-2c1f71f5a34an%40googlegroups.com.

Message has been deleted

Rainbow Star

unread,

Jun 11, 2026, 2:54:35 AMJun 11

to Dedalus Users

Thanks Calum and Keaton for your replies. I had misattributed the scaling.

For context on size: it's a fluid dynamics eigenvalue problem in a spherical shell, solved by subproblems in , at N_theta = N_r = 64.

With solve_sparse I don't run into this issue, but with solve_dense the single-subproblem solve becomes the wall-clock bottleneck and doesn't improve with more cores — it appears to run on a single core even when I assign ~100 cores to one . Is there any recommended way to speed up solve_dense for a subproblem of this size?

Thanks again.

Keaton Burns

unread,

Jun 11, 2026, 10:17:30 AMJun 11

to dedalu...@googlegroups.com

Hi Arnab,

The underlying scipy/arpack eigenvalue routines are not MPI parallelized, so adding more cores doesn’t change anything there. You may see a slight improvement by using more threads and a single core, depending on your LAPACK/BLAS implementation. You can look at the solve_dense routine, it’s quite simple, and you should be able to extract the matrices being solved if you want to try an alternative eigenvalue routine from another library.

Best,

-Keaton

On Jun 11, 2026 at 12:53:38 AM, Arnab Pradhan <arnabpra...@gmail.com> wrote:

Thanks Calum and Keaton for your replies. I had misattributed the scaling.

For context on size: it's a fluid dynamics eigenvalue problem in a spherical shell, solved by subproblems in , at N_theta = N_r = 64.

With solve_sparse I don't run into this issue, but with solve_dense the single-subproblem solve becomes the wall-clock bottleneck and doesn't improve with more cores — it appears to run on a single core even when I assign ~100 cores to one . Is there any recommended way to speed up solve_dense for a subproblem of this size?

Thanks again.

On Tuesday, 9 June 2026 at 17:42:45 UTC+5:30 keaton...@gmail.com wrote:

To view this discussion visit https://groups.google.com/d/msgid/dedalus-users/ffd2c443-6696-44a0-8c6f-affaee46e495n%40googlegroups.com.

Reply all

Reply to author

Forward