Good afternoon!
I have tried to run SVM example (found together with OptiML sources). After succesfully staging and producing kernels and .deg file, I tried to run them on 8 scala threads. What I see is Delite runtime takes more then 1 hour to produce needed files in genedatedCache/runtime directory. As far as I understand, this directory should contain scheduled code which will be executed on hardware. So, I suspect that all this time Delite is busy with sheduling(i.e. splitting) the program into required number of threads; just because running on 1 thread goes very fast.
Is this expected behaviour or I am doing something wrong? Is there any way to accelerate this process? Is this effect expected to disappear when running one more time due to reusing already generated (cached) code?
Thank you in advance,
Alexander Filippov