Hello all,
By curiosity I have made some experimentations on the dense matrix vector product using Julia, my goal being to compare the perfs of Julia vs C in serial and parallel (without using blas/lapack, just the language).
I began by making a naive implementation, then extracted the vector doc product in a function. Comparing the naive and the implementation using the extracted dot prod gave me some interesting results :
extracted dot prod -> elapsed time: 2.0150580406188965 seconds
naive -> elapsed time: 14.084102869033813 seconds
The matrix is of n*n size and the vector of size n, n=5000 for presented results. The Julia source code is attached to my post.
I would like to improve these timings in serial using some of the commodities of Julia, do you have any advice/idea on how to improve the matrix vector product using Julia. What about a parallel implementation ?
Thanks.