spark.driver.memory
resp., spark.yarn.am.memory
andspark.executor.memory
),spark.driver.extraJavaOptions
resp.,spark.yarn.am.extraJavaOptions
and spark.executor.extraJavaOptions
)spark.yarn.driver.memoryOverhead
, spark.yarn.am.memoryOverhead
,
orspark.yarn.executor.memoryOverhead
For running Sparkling Water on top of Yarn:
Furthermore, we recommend to configure the following Spark properties to speedup and stabilize creation of H2O services on top of Spark cluster:
Property | Context | Value | Explanation |
---|---|---|---|
spark.locality.wait |
all | 3000 |
Number of seconds to wait for task launch on data-local node. We recommend to increase since we would like to make sure that H2O tasks are processed locally with data. |
spark.scheduler.minRegisteredResourcesRatio |
all | 1 |
Make sure that Spark starts scheduling when it sees 100% of resources. |
spark.task.maxFailures |
all | 1 |
Do not try to retry failed tasks. |
spark...extraJavaOptions |
all | -XX:MaxPermSize=384m |
Increase PermGen size if you are running on Java7. Make sure to configure it on driver/executor/Yarn application manager. |
spark.yarn.....memoryOverhead |
yarn | increase | Increase memoryOverhead if it is necessary. |
spark.yarn.max.executor.failures |
yarn | 1 |
Do not try restart executors after failure and directly fail computation. |
--
You received this message because you are subscribed to the Google Groups "H2O Open Source Scalable Machine Learning - h2ostream" group.
To unsubscribe from this group and stop receiving emails from it, send an email to h2ostream+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.