I'm trying to get my Win7 x64 working with Theano + CUDA GPU support. I followed the tutorial for how to setup CUDA on Windows (installed 64bit drivers, but 32bit Toolkit and SDK).I have Python 2.7.3 32-bit witn NumPy and SciPy installed. I also use Visual Studio 2012I can run the 32-bit CUDA samples - for example here is the output of the deviceQuery.exeC:\ProgramData\NVIDIA Corporation\CUDA Samples\v5.0\bin\win32\Release>deviceQuery.exedeviceQuery.exe Starting...CUDA Device Query (Runtime API) version (CUDART static linking)Detected 1 CUDA Capable device(s)Device 0: "GeForce GTX 680"CUDA Driver Version / Runtime Version 5.0 / 5.0CUDA Capability Major/Minor version number: 3.0Total amount of global memory: 2048 MBytes (2147483648 bytes)( 8) Multiprocessors x (192) CUDA Cores/MP: 1536 CUDA CoresGPU Clock rate: 1085 MHz (1.08 GHz)Memory Clock rate: 3004 MhzMemory Bus Width: 256-bitL2 Cache Size: 524288 bytesMax Texture Dimension Size (x,y,z) 1D=(65536), 2D=(65536,65536), 3D=(4096,4096,4096)Max Layered Texture Size (dim) x layers 1D=(16384) x 2048, 2D=(16384,16384) x 2048Total amount of constant memory: 65536 bytesTotal amount of shared memory per block: 49152 bytesTotal number of registers available per block: 65536Warp size: 32Maximum number of threads per multiprocessor: 2048Maximum number of threads per block: 1024Maximum sizes of each dimension of a block: 1024 x 1024 x 64Maximum sizes of each dimension of a grid: 2147483647 x 65535 x 65535Maximum memory pitch: 2147483647 bytesTexture alignment: 512 bytesConcurrent copy and kernel execution: Yes with 1 copy engine(s)Run time limit on kernels: YesIntegrated GPU sharing Host Memory: NoSupport host page-locked memory mapping: YesAlignment requirement for Surfaces: YesDevice has ECC support: DisabledCUDA Device Driver Mode (TCC or WDDM): WDDM (Windows Display Driver Model)Device supports Unified Addressing (UVA): NoDevice PCI Bus ID / PCI location ID: 1 / 0Compute Mode:< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 5.0, CUDA Runtime Version = 5.0, NumDevs = 1, Device0 = GeForce GTX 680Testing Theano with GPU per instructions here (http://deeplearning.net/software/theano/tutorial/using_gpu.html#using-gpu) I get:>>>WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded.WARNING (theano.sandbox.cuda): CUDA is installed, but device gpu is not available[Elemwise{exp,no_inplace}(<TensorType(float32, vector)>)]Looping 1000 times took 3.45799994469 secondsResult is [ 1.23178029 1.61879337 1.52278066 ..., 2.20771813 2.299677611.62323284]Used the cpuThe g++ error I understand as I haven't installed MinGW (but it's not listed anywhere as being mandatory - is it?)Here is my .theanorc:[global]floatX = float32device = gpu[nvcc]flags=-LC:\Python27\libscompiler_bindir=C:\Program Files (x86)\Microsoft Visual Studio 11.0\VC\binfastmath = TrueThoughts?--
---
You received this message because you are subscribed to the Google Groups "theano-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to theano-dev+...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.