Since I don't have an NVIDIA GPU, I'm unable to take advantage the parallel computations with CUDA.
I remember reading in the post at [1] support for CUDA 10.
For your development, it might be helpful for others if you could support the newer CUDA Toolkits [2] for 11.x and 12.x .
In the CUDA branch makefile at [3,] or the past post at [4], you should see that the makefile currently only supports up to Compute Capability 7.5.
So for your development, it might be helpful to others if you could also support NVIDIA GPU with higher Compute Capability (8.0 - 12.0 seen at [5]).
Sincerely,
Gavin
VAMPIRE user