I was wondering if there is any implementation of Support Vector Machines in H2O?
Been looking up Google for some time now, but I couldn't find anything comparable.
If there is no implementation, could you recommend any alternative (I'm using R by the way) that is especially capable of multithreaded SVM?
Thanks for your help,
Sarem
Hi Sarem,
The SVM algorithm has some inherent scalability challenges and although it's possible we could implement this algo in the future, we have chosen not to implement it thus far.
This post suggests that caret's SVM is multicore (can't verify this). https://geekoverdose.wordpress.com/2014/07/25/svm-classification-example-with-performance-measures-using-r-caret/ This may just mean that it's tuning itself via cross-validation, which is done in parallel, and the actual training of the SVM model is single-threaded...-- Erin LeDell Ph.D. Statistician & Machine Learning Scientist | H2O.ai
OR
L1-Regularized Distributed Optimization: A Communication-Efficient Primal-Dual Framework
As Erin mentioned we decided not to implement our own SVM, instead in Sparkling Water we expose Spark's SVM implementation (http://spark.apache.org/docs/latest/mllib-linear-methods.html#linear-support-vector-machines-svms).
This feature is only available through Sparkling Water (and I'm still working on a blog post about it), but if that's ok with you then just download one of the newest versions and you can try it out through FlowUI. I understand this might be a bit inconvenient for you as you're using R, maybe rsparkling could help here?
Regards,
Mateusz