I am having problems trying to get the LogRegCompiler example to work with CUDA. There seems to be a problem compiling the cuda code at runtime. I'm not sure what. CUDA appears to be installed properly. I am able to compile and run all the examples from NVidia. Any help is appreciated. Thanks.
$ delite LogRegCompiler input.txt test.txt -t 8 --cuda 1
== executing application: LogRegCompiler.deg input.txt test.txt
Delite Runtime executing with the following arguments:
LogRegCompiler.deg,input.txt,test.txt
Delite Runtime executing with: 8 Scala thread(s), 0 Cpp thread(s), 1 Cuda(s), 0 OpenCL(s)
/usr/local/cuda/bin/nvcc -I/home/wrightr/optiml/generatedCache/cuda/src/datastructures -I/home/wrightr/optiml/generatedCache/cpp/src/datastructures -I/home/wrightr/optiml/generatedCache/cuda/src/kernels -I/home/wrightr/optiml/generatedCache/cpp/src/kernels -I/home/wrightr/optiml/generatedCache/cuda/src/runtime -I/home/wrightr/optiml/generatedCache/cpp/src/runtime -I/usr/lib/jvm/java-7-oracle/include -I/usr/lib/jvm/java-7-oracle/include/linux -I/home/wrightr/optiml/runtime/cuda -m64 -w -O3 -lcublas -arch compute_11 -code sm_11 -shared -Xcompiler '-fPIC' -o /home/wrightr/optiml/generatedCache/cuda/bin/runtime/cudaHost.so /home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1483.cu /home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1827.cu /home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1879.cu /home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1930x2154x2119.cu /home/wrightr/optiml/generatedCache/cuda/src/runtime/While_x2040_8.cu /home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu /home/wrightr/optiml/generatedCache/cpp/src/datastructures/cppList.cpp /home/wrightr/optiml/generatedCache/cpp/src/datastructures/cppRef.cpp /home/wrightr/optiml/generatedCache/cpp/src/datastructures/cppDeliteArray.cpp /home/wrightr/optiml/generatedCache/cuda/src/kernels/cudahelperFuncs.cu /home/wrightr/optiml/runtime/cuda/DeliteCuda.cu
--cuda compile args: /usr/local/cuda/bin/nvcc,-I/home/wrightr/optiml/generatedCache/cuda/src/datastructures,-I/home/wrightr/optiml/generatedCache/cpp/src/datastructures,-I/home/wrightr/optiml/generatedCache/cuda/src/kernels,-I/home/wrightr/optiml/generatedCache/cpp/src/kernels,-I/home/wrightr/optiml/generatedCache/cuda/src/runtime,-I/home/wrightr/optiml/generatedCache/cpp/src/runtime,-I/usr/lib/jvm/java-7-oracle/include,-I/usr/lib/jvm/java-7-oracle/include/linux,-I/home/wrightr/optiml/runtime/cuda,-m64,-w,-O3,-lcublas,-arch,compute_11,-code,sm_11,-shared,-Xcompiler,'-fPIC',-o,/home/wrightr/optiml/generatedCache/cuda/bin/runtime/cudaHost.so,/home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1483.cu,/home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1827.cu,/home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1879.cu,/home/wrightr/optiml/generatedCache/cuda/src/runtime/MultiLoop_GPU_Array_x1930x2154x2119.cu,/home/wrightr/optiml/generatedCache/cuda/src/runtime/While_x2040_8.cu,/home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu,/home/wrightr/optiml/generatedCache/cpp/src/datastructures/cppList.cpp,/home/wrightr/optiml/generatedCache/cpp/src/datastructures/cppRef.cpp,/home/wrightr/optiml/generatedCache/cpp/src/datastructures/cppDeliteArray.cpp,/home/wrightr/optiml/generatedCache/cuda/src/kernels/cudahelperFuncs.cu,/home/wrightr/optiml/runtime/cuda/DeliteCuda.cu/home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu(61): error: identifier "recvViewCPPfromJVM_cppRef_double_" is undefined
/home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu(62): error: identifier "sendCuda_cudaRef_double_" is undefined
/home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu(84): error: identifier "recvViewCPPfromJVM_cppRef_int_" is undefined
/home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu(85): error: identifier "sendCuda_cudaRef_int_" is undefined
/home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu(87): error: identifier "recvViewCPPfromJVM_cppRef_cppDenseVectorDouble__" is undefined
/home/wrightr/optiml/generatedCache/cuda/src/runtime/Executable8.cu(88): error: identifier "sendCuda_cudaRef_cudaDenseVectorDouble__" is undefined
6 errors detected in the compilation of "/tmp/tmpxft_0000469d_00000000-86_Executable8.cpp1.ii".
Exception in thread "main" java.lang.RuntimeException: cuda compilation failed with exit value 2
at scala.sys.package$.error(package.scala:27)
at ppl.delite.runtime.codegen.CCompile$class.checkError(CCompile.scala:129)
at ppl.delite.runtime.codegen.CudaCompile$.checkError(CudaCompile.scala:17)
at ppl.delite.runtime.codegen.CCompile$class.compile(CCompile.scala:87)
at ppl.delite.runtime.codegen.CudaCompile$.compile(CudaCompile.scala:17)
at ppl.delite.runtime.codegen.CCompile$class.compile(CCompile.scala:73)
at ppl.delite.runtime.codegen.CudaCompile$.compile(CudaCompile.scala:17)
at ppl.delite.runtime.codegen.Compilers$.compileSchedule(Compilers.scala:75)
at ppl.delite.runtime.Delite$.embeddedMain(Delite.scala:119)
at ppl.delite.runtime.Delite$.main(Delite.scala:48)
at ppl.delite.runtime.Delite.main(Delite.scala)
error: Delite execution failed