I don't have that version, but reducing the default internal block/panel size to 32 and recompiling might be the quickest way to resolve the issue.
--
You received this message because you are subscribed to the Google Groups "MAGMA User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to magma-user+...@icl.utk.edu.
To view this discussion visit https://groups.google.com/a/icl.utk.edu/d/msgid/magma-user/5bb09449-f53f-4421-a308-422e273ee5adn%40icl.utk.edu.
To unsubscribe from this group and stop receiving emails from it, send an email to magma-user+unsubscribe@icl.utk.edu.
To unsubscribe from this group and stop receiving emails from it, send an email to magma-user+...@icl.utk.edu.
To unsubscribe from this group and stop receiving emails from it, send an email to magma-user+unsubscribe@icl.utk.edu.
Hi Natalie,
Thanks again for your help with this issue! I've made progress in investigating the problem -- I had tried to compile Magma some time ago, but it failed midway, but I still had some stuff compiled, and managed to find that function.
I ran the testing_dgeqrf_gpu program which passed all tests, even with 100×33 matrices with the -c flag and --version 1 .
I've tried to make my code match the testing code as closely as possible by
Despite these changes, the issue persists in my code. Perhaps the difference is in how it was compiled? I tried to compile just the test script, but I can't get the magma_opts to work properly.
Given that the issue occurs exactly at the 32-column boundary, it seems like it might be related to a GPU block size or memory alignment issue that's triggered under certain conditions.
Thanks! :)
To unsubscribe from this group and stop receiving emails from it, send an email to magma-user+...@icl.utk.edu.
if ( opts.version == 3 ) {
// copy diagonal blocks of R back to A
for( int i=0; i < min_mn-nb; i += nb ) {
magma_int_t ib = min( min_mn-i, nb );
magmablas_zlacpy( MagmaUpper, ib, ib, &dT[min_mn*nb + i*nb], nb, &d_A[ i + i*ldda ], ldda, opts.queue );
}
}
Then we get the A output (copied in h_R here) to be what we expect. version 1 is meant to be used to solve a system Ax = b, not to generate the Q and R matrices directly.
Sorry for the confusion. If you have a GitHub account, feel free to open an issue about the documentation/confusion (https://github.com/icl-utk-edu/magma).
If you are not a GitHub user, I can open an issue.
-- Natalie
To unsubscribe from this group and stop receiving emails from it, send an email to magma-user+unsubscribe@icl.utk.edu.
To unsubscribe from this group and stop receiving emails from it, send an email to magma-user+...@icl.utk.edu.