Complex bicgstab on gpu

Elliot

unread,

Nov 8, 2010, 8:55:01 PM11/8/10

to cusp-users

Hi,

I'm new to gpu programming and am trying to use cusp to speed up some
large complex matrix solves for matrices in csr format. As a first
step I had encouraging results in double precision for the poisson
equation example. For a 1000 x 1000 grid I was seeing a factor of 8
speedup on the gpu. Now I'm trying to get things working on a simple
complex example. I'm not 100% sure what has been implemented for
complex.

For my simple example I'm trying to solve the
coordinate_complex_general matrix (\cusp-library\testing\data\test)
example using a right hand side of 1+i. When I try running it on the
host I get a result which agrees with Matlab. When I try running it on
the gpu the program crashes with both the diagonal and identity
preconditioners. I'm compiling it with -arch sm_13.

Any suggestions would be greatly appreciated

Thanks

Elliot

#include <cusp/precond/diagonal.h>
#include <cusp/dia_matrix.h>
#include <cusp/csr_matrix.h>
#include <cusp/io/matrix_market.h>
#include <cusp/krylov/bicgstab.h>
#include <iostream>
#include <cusp/complex.h>
#include <cusp/blas.h>

typedef cusp::device_memory DM; // define Device Memory
typedef cusp::host_memory HM; // define Host Memory
typedef cusp::complex<double> Complex;

int main(void)
{
cusp::csr_matrix<int, Complex, HM> mycsr; // allocate memory
cusp::io::read_matrix_market_file(mycsr, "A.mtx"); //get a
csr_matrix in host memory

cusp::dia_matrix<int,Complex,HM>A(mycsr); //switch to dia_matrix
format
cusp::array1d<Complex, HM> x_host(A.num_rows, Complex(0,0));
cusp::array1d<Complex, HM> b_host(A.num_rows, Complex(1,1));
cusp::verbose_monitor<Complex> monitor_host(b_host, 100, 1e-5);
cusp::precond::diagonal<Complex, HM> M_host(A);
cusp::krylov::bicgstab(A, x_host, b_host, monitor_host, M_host);
cusp::print_matrix(x_host);

cusp::dia_matrix<int,Complex,DM>D(mycsr); //have dia_matrix in
device memory
cusp::array1d<Complex, DM> x(D.num_rows, Complex(0,0));
cusp::array1d<Complex, DM> b(D.num_rows, Complex(1,1));
cusp::verbose_monitor<Complex> monitor(b, 100, 1e-5);
cusp::precond::diagonal<Complex, DM> M(D);
cusp::krylov::bicgstab(D, x, b, monitor, M);

return 0;
}

Filipe Maia

unread,

Nov 8, 2010, 10:59:08 PM11/8/10

to cusp-...@googlegroups.com

Hi,

Here the device part of the code doesn't even compile.

I get:

./cusp/detail/device/spmv/dia.h(87): error: ambiguous "?" operation: second operand of type "int" can be converted to third operand type "Complex", and vice versa

What CUDA/nvcc version do you have?

Cheers,

Filipe

--
You received this message because you are subscribed to the Google Groups "cusp-users" group.
To post to this group, send email to cusp-...@googlegroups.com.
To unsubscribe from this group, send email to cusp-users+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/cusp-users?hl=en.

Filipe Maia

unread,

Nov 8, 2010, 11:04:42 PM11/8/10

to cusp-...@googlegroups.com

I opened an issue about it [1].

Try the patch I uploaded there.

[1] - http://code.google.com/p/cusp-library/issues/detail?id=44

Luca

unread,

Nov 9, 2010, 9:24:11 AM11/9/10

to cusp-users

I am also testing the complex version ... I really need it.

I can compile with matrices of type cusp::ell_matrix<int,
cusp::complex<float>, cusp::device_memory> and use the multiply
algorithm but the result is wrong. It looks like the imaginary part is
set to zero in the matrix. Is this related to the ongoing effort to
implement complex numbers?

Cheers,
Luca

ps: I use CUDA 3.2 with arch=sm_21

On Nov 9, 4:59 am, Filipe Maia <filipe.c.m...@gmail.com> wrote:
> Hi,
>
> Here the device part of the code doesn't even compile.
> I get:
>
> ./cusp/detail/device/spmv/dia.h(87): error: ambiguous "?" operation: second
> operand of type "int" can be converted to third operand type "Complex", and
> vice versa
>
> What CUDA/nvcc version do you have?
>
> Cheers,
> Filipe
>

> > cusp-users+...@googlegroups.com<cusp-users%2Bunsu...@googlegroups.com>
> > .

Luca

unread,

Nov 9, 2010, 11:30:23 AM11/9/10

to cusp-users

Sorry I forgot to mention that I first define a matrix
cusp::coo_matrix <int, cusp::complex<float>, cusp::host_memory> which
I then convert to ell_matrix on the device.

If I output the coo_matrix with cusp::io::write_matrix_market_file,
this is where the imaginary part is already lost.

Cheers,
Luca

Filipe Maia

unread,

Nov 9, 2010, 11:46:10 AM11/9/10

to cusp-...@googlegroups.com

On Tue, Nov 9, 2010 at 08:30, Luca <cov...@gmail.com> wrote:

Sorry I forgot to mention that I first define a matrix
cusp::coo_matrix <int, cusp::complex<float>, cusp::host_memory> which
I then convert to ell_matrix on the device.

If I output the coo_matrix with cusp::io::write_matrix_market_file,
this is where the imaginary part is already lost.

It's entirely possible that things don't work with the complex numbers as all the code was written assuming reals and the complex numbers were added later.

We need more people like you that try it out and let the others know what things don't work so that they can be fixed.

To unsubscribe from this group, send email to cusp-users+...@googlegroups.com.

Filipe Maia

unread,

Nov 9, 2010, 2:51:13 PM11/9/10

to cusp-...@googlegroups.com

On Tue, Nov 9, 2010 at 08:30, Luca <cov...@gmail.com> wrote:

Sorry I forgot to mention that I first define a matrix
cusp::coo_matrix <int, cusp::complex<float>, cusp::host_memory> which
I then convert to ell_matrix on the device.

If I output the coo_matrix with cusp::io::write_matrix_market_file,
this is where the imaginary part is already lost.

Could you please post the code you are using?

To unsubscribe from this group, send email to cusp-users+...@googlegroups.com.

Elliot

unread,

Nov 9, 2010, 2:52:28 PM11/9/10

to cusp-users

Thanks for the quick reply and the patch. It seems like there is quite
a bit of interest in the complex implementation.

I downloaded your patch and tried re-running the code but it still
won't work.

The code will compile with some warnings. Two of them have to do with
dia.h

1>C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin/../
include\cusp/detail/device/spmv/dia.h(87): Warning: Cannot tell what
pointer points to, assuming global memory space
1>C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin/../
include\cusp/detail/device/spmv/dia.h(87): Warning: Cannot tell what
pointer points to, assuming global memory space

I'm trying to run this on a gts450 with I believe the newest version
of cuda

CUDA Driver Version: 3020
Device Number: 0
Device Name: GeForce GTS 450
Device Revision Number: 2.1
Global Memory Size: 1041694720

nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2010 NVIDIA Corporation
Built on Tue_Oct_19_02:27:10_PDT_2010
Cuda compilation tools, release 3.2, V0.2.1221

Cheers,

Elliot

Filipe Maia

unread,

Nov 9, 2010, 3:06:38 PM11/9/10

to cusp-...@googlegroups.com

On Tue, Nov 9, 2010 at 11:52, Elliot <elliot...@gmail.com> wrote:

Thanks for the quick reply and the patch. It seems like there is quite
a bit of interest in the complex implementation.

I downloaded your patch and tried re-running the code but it still
won't work.

The code will compile with some warnings. Two of them have to do with
dia.h

1>C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin/../
include\cusp/detail/device/spmv/dia.h(87): Warning: Cannot tell what
pointer points to, assuming global memory space
1>C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin/../
include\cusp/detail/device/spmv/dia.h(87): Warning: Cannot tell what
pointer points to, assuming global memory space

I can't reproduce these warnings on my machine. What compiler options did you use?

Also the warning seems related to the y[row] which was already there.

Elliot

unread,

Nov 9, 2010, 5:08:18 PM11/9/10

to cusp-users

So with your new patch the complex bicgstab example works for you now?

I do get a few warnings when I compile the code but nothing that
prevented linking. Things generally seemed to be working, and I didn't
understand the warnings, so I ignored the warnings ...
Here is the build and some of the warnings I get.

Cheers,

Elliot

1>Compiling with CUDA Build Rule...
1>"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\\bin
\nvcc.exe" -arch sm_13 -ccbin "C:\Program Files (x86)\Microsoft
Visual Studio 8\VC\bin" -Xcompiler "/EHsc /W3 /nologo /O2 /Zi /
MT " -I"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\
\include" -I"C:\ProgramData\NVIDIA Corporation\NVIDIA GPU Computing
SDK 3.2\C\common\inc" -maxrregcount=32 --compile -o "x64\Release
\sample.cu.obj" "c:\Users\Elliot\Documents\Visual Studio 2005\Projects
\CUDAWinApp2\CUDAWinApp2\sample.cu"
1>sample.cu

Some of the warnings I get

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin/../include
\cusp/detail/device/spmv/dia.h(87): Warning: Cannot tell what pointer
points to, assuming global memory space

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin/../include
\cusp/detail/device/spmv/dia.h(87): Warning: Cannot tell what pointer
points to, assuming global memory space

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin/../include

\thrust/functional.h(409) : warning C4995: 'absolute_value': name was
marked as #pragma deprecated