]Hi all,
I am presently working on H2O in Python and R for building gradient boosting models(GBM).I have used functions like doParallel,foreach(in R ) and multiprocessing (in python) for parallelization of GBM. But my runtimes are longer than expected. I am not able measure/observe how H2O is utilizing my system cores and memory.Below is the help i need:
- How does H2O perform parallelization?
- what does "H2O is parallel at algorithm level and not at model level" mean?
- GBM is a sequential model. Can I parallelize a sequential process? If yes, how can we do it in R/python and how can we be sure/observe that parallel processing is happening as intended.
- If i am building decision trees will "H2O make sure that all tress all build parallely in different processors"? or do i need to specify that using in built functions?
Please give me an detailed explanation, i am unable to understand these concepts.
There is additional information about this topic in this SO
question:
https://stackoverflow.com/questions/43444333/parallel-processing-in-r-with-h2o
--
You received this message because you are subscribed to the Google Groups "H2O Open Source Scalable Machine Learning - h2ostream" group.
To unsubscribe from this group and stop receiving emails from it, send an email to h2ostream+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
-- Erin LeDell, Ph.D. Chief Machine Learning Scientist | H2O.ai