However, it can't allocate memory on CPU host. That's why I wanted to alloc gpu memory and distribute input manually.
I can't understand why it returns that error because I have a 512 G mem in DGX server. Could you tell me why?
./testing_dsyevd --ngpu 8 -n 50000
% MAGMA 2.6.0 32-bit magma_int_t, 64-bit pointer.
Compiled with CUDA support for 6.0
% CUDA runtime 11000, driver 11000. OpenMP threads 80. MKL 2021.0.3, MKL threads 40.
% device 0: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% device 1: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% device 2: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% device 3: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% device 4: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% device 5: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% device 6: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% device 7: Tesla P100-SXM2-16GB, 1480.5 MHz clock, 16280.9 MiB memory, capability 6.0
% Tue Jul 27 15:30:28 2021
% Usage: ./testing_dsyevd [options] [-h|--help]
% jobz = No vectors, uplo = Lower, ngpu = 8
% N CPU Time (sec) GPU Time (sec) |S-S_magma| |A-USU^H| |I-U^H U|
%============================================================================
Error: magma_dmalloc_cpu( &h_A, N*lda )
failed at testing/testing_dsyevd.cpp:151: error -112: cannot allocate memory on CPU host