I used Numba to get some system info (see below). Looks like the cublas library is working fine.
System info:
--------------------------------------------------------------------------------
__Time Stamp__
Report started (local time) : 2020-09-26 00:17:34.207700
UTC start time : 2020-09-26 04:17:34.207709
Running time (s) : 3.237414
__Hardware Information__
Machine : x86_64
CPU Name : haswell
CPU Count : 20
Number of accessible CPUs : 20
List of accessible CPUs cores : 0-19
CFS Restrictions (CPUs worth of runtime) : None
CPU Features : 64bit aes avx avx2 bmi bmi2 cmov
cx16 cx8 f16c fma fsgsbase fxsr
invpcid lzcnt mmx movbe pclmul
popcnt rdrnd sahf sse sse2 sse3
sse4.1 sse4.2 ssse3 xsave xsaveopt
Memory Total (MB) : 58373
Memory Available (MB) : 57310
__OS Information__
Platform Name : Linux-4.15.0-118-generic-x86_64-with-glibc2.10
Platform Release : 4.15.0-118-generic
OS Name : Linux
OS Version : #119-Ubuntu SMP Tue Sep 8 12:30:01 UTC 2020
OS Specific Version : ?
Libc Version : glibc 2.27
__Python Information__
Python Compiler : GCC 7.3.0
Python Implementation : CPython
Python Version : 3.8.5
Python Locale : en_US.UTF-8
__LLVM Information__
LLVM Version : 10.0.1
__CUDA Information__
CUDA Device Initialized : True
CUDA Driver Version : 9010
CUDA Detect Output:
Found 1 CUDA devices
id 0 b'Tesla K40m' [SUPPORTED]
compute capability: 3.5
pci device id: 5
pci bus id: 0
Summary:
1/1 devices are supported
CUDA Librairies Test Output:
Finding cublas from System
named libcublas.so.9.2.88
trying to open library... ok
Finding cusparse from System
named libcusparse.so.9.2.88
trying to open library... ok
Finding cufft from System
named libcufft.so.9.2.88
trying to open library... ok
Finding curand from System
named libcurand.so.9.2.88
trying to open library... ok
Finding nvvm from System
named libnvvm.so.3.2.0
trying to open library... ok
Finding cudart from System
named libcudart.so.9.2.88
trying to open library... ok
Finding libdevice from System
searching for compute_20... ok
searching for compute_30... ok
searching for compute_35... ok
searching for compute_50... ok
__ROC information__
ROC Available : False
ROC Toolchains : None
HSA Agents Count : 0
HSA Agents:
None
HSA Discrete GPUs Count : 0
HSA Discrete GPUs : None
__SVML Information__
SVML State, config.USING_SVML : False
SVML Library Loaded : False
llvmlite Using SVML Patched LLVM : True
SVML Operational : False
__Threading Layer Information__
TBB Threading Layer Available : True
+-->TBB imported successfully.
OpenMP Threading Layer Available : True
+-->Vendor: GNU
Workqueue Threading Layer Available : True
+-->Workqueue imported successfully.
__Numba Environment Variable Information__
None found.