Dear Guido and Wolfgang,
thanks a lot for your answers.
Writing some wrapper functions and using cuSparse and cuBlas libraries from
cuda should not be the problem, the same with using cuda functionalities
from PETSc and Trilinos, but of course, performance is the point. And, just
as a comment, PETSc seems not to be working with Cuda 5.0 and gcc 4.6.2,
maybe Cuda 4.1, also there, a lot of is going on.
Besides an "efficient" implementation, the problem that I see are the
dimensions of matrices we are dealing with, as long as the GPU can use local
memory, it will be very fast, but if objects are too large and also global
memory has to be used, I am not sure that there is much increase in
performance over a usual implementation. Thats why my first idea was using
GPUs in a matrix-free solver, but thinking about details it gets more and
more difficult, especially on a distributed triangulation ...
Guido, if I can help with something, implementing some wrapper functions or
whatever, let me know.
Best,
Martin
-----Urspr�ngliche Nachricht-----
From: Wolfgang Bangerth
Sent: Thursday, May 16, 2013 8:19 PM
To:
dea...@googlegroups.com
Subject: Re: [deal.II] Porting deal.II to GPUs
--
The deal.II project is located at
http://www.dealii.org/
For mailing list/forum options, see
https://groups.google.com/d/forum/dealii?hl=en
---
You received this message because you are subscribed to the Google Groups
"deal.II User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to
dealii+un...@googlegroups.com.
For more options, visit
https://groups.google.com/groups/opt_out.