--
You received this message because you are subscribed to the Google Groups "actionml-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to actionml-use...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/actionml-user/dcf94173-4de3-440f-a046-f8094c746292%40googlegroups.com.
We have a two machine cluster: Master and one Worker. Both configured with 64Gb.The Harness log shows a successful connection.We were unable to get Harness and Spark cluster to connect until we added these to our Engine Spark configuration and modified the compose .yml file with same property values."spark.driver.host": "<some host>","spark.driver.port": "45678",
Both Master and Work Spark UI's show things executing. But Worker stderr has the output below. stdout has nothing. The same events train successfully when Spark is configured for localhost.
Spark Executor Command: "/usr/lib/jvm/java-1.8-openjdk/bin/java" "-cp" "///conf:/spark/jars/*" "-Xmx61440M" "-Dspark.driver.port=45678" "org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url" "spark://CoarseGrainedScheduler@harness.vm:45678" "--executor-id" "0" "--hostname" "worker.spark.vm" "--cores" "4" "--app-id" "app-20200409134745-0003" "--worker-url" "spark://Wor...@worker.spark.vm:37145"
========================================Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties20/04/09 13:48:56 INFO CoarseGrainedExecutorBackend: Started daemon with process name: 1...@worker.spark.vm20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for TERM20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for HUP20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for INT20/04/09 13:48:57 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable20/04/09 13:48:57 INFO SecurityManager: Changing view acls to: root20/04/09 13:48:57 INFO SecurityManager: Changing modify acls to: root20/04/09 13:48:57 INFO SecurityManager: Changing view acls groups to:20/04/09 13:48:57 INFO SecurityManager: Changing modify acls groups to:20/04/09 13:48:57 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(root); groups with view permissions: Set(); users with modify permissions: Set(root); groups with modify permissions: Set()20/04/09 13:48:57 INFO TransportClientFactory: Successfully created connection to harness.vm/10.145.94.199:45678 after 83 ms (0 ms spent in bootstraps)20/04/09 13:48:58 INFO SecurityManager: Changing view acls to: root20/04/09 13:48:58 INFO SecurityManager: Changing modify acls to: root20/04/09 13:48:58 INFO SecurityManager: Changing view acls groups to:20/04/09 13:48:58 INFO SecurityManager: Changing modify acls groups to:20/04/09 13:48:58 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(root); groups with view permissions: Set(); users with modify permissions: Set(root); groups with modify permissions: Set()20/04/09 13:48:58 INFO TransportClientFactory: Successfully created connection to harness.vm/10.145.94.199:45678 after 2 ms (0 ms spent in bootstraps)20/04/09 13:48:58 INFO DiskBlockManager: Created local directory at /tmp/spark-ecacf44e-92b8-4596-9917-1df157adef40/executor-c95689b4-5891-4852-9cc8-27e30385f4f1/blockmgr-22da189f-793d-466c-a836-37fb7b5bbafd20/04/09 13:48:58 INFO MemoryStore: MemoryStore started with capacity 31.8 GB
20/04/09 13:48:58 INFO CoarseGrainedExecutorBackend: Connecting to driver: spark://CoarseGrainedScheduler@harness.vm:45678
We only put those configs in because harness wasn't opening (or listening) on the port the worker was trying to connect to. We got connection refused error on the port.So maybe we a different underlying issue?Thanks.
On Thursday, April 9, 2020 at 11:15:35 AM UTC-4, bspr...@gmail.com wrote:
We have a two machine cluster: Master and one Worker. Both configured with 64Gb.The Harness log shows a successful connection.We were unable to get Harness and Spark cluster to connect until we added these to our Engine Spark configuration and modified the compose .yml file with same property values."spark.driver.host": "<some host>","spark.driver.port": "45678",
Both Master and Work Spark UI's show things executing. But Worker stderr has the output below. stdout has nothing. The same events train successfully when Spark is configured for localhost.
Spark Executor Command: "/usr/lib/jvm/java-1.8-openjdk/bin/java" "-cp" "///conf:/spark/jars/*" "-Xmx61440M" "-Dspark.driver.port=45678" "org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url" "spark://CoarseGrainedScheduler@harness.vm:45678" "--executor-id" "0" "--hostname" "worker.spark.vm" "--cores" "4" "--app-id" "app-20200409134745-0003" "--worker-url" "spark://Wor...@worker.spark.vm:37145"
========================================Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties20/04/09 13:48:56 INFO CoarseGrainedExecutorBackend: Started daemon with process name: 1...@worker.spark.vm20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for TERM20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for HUP20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for INT20/04/09 13:48:57 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable20/04/09 13:48:57 INFO SecurityManager: Changing view acls to: root20/04/09 13:48:57 INFO SecurityManager: Changing modify acls to: root20/04/09 13:48:57 INFO SecurityManager: Changing view acls groups to:20/04/09 13:48:57 INFO SecurityManager: Changing modify acls groups to:20/04/09 13:48:57 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(root); groups with view permissions: Set(); users with modify permissions: Set(root); groups with modify permissions: Set()20/04/09 13:48:57 INFO TransportClientFactory: Successfully created connection to harness.vm/10.145.94.199:45678 after 83 ms (0 ms spent in bootstraps)20/04/09 13:48:58 INFO SecurityManager: Changing view acls to: root20/04/09 13:48:58 INFO SecurityManager: Changing modify acls to: root20/04/09 13:48:58 INFO SecurityManager: Changing view acls groups to:20/04/09 13:48:58 INFO SecurityManager: Changing modify acls groups to:20/04/09 13:48:58 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(root); groups with view permissions: Set(); users with modify permissions: Set(root); groups with modify permissions: Set()20/04/09 13:48:58 INFO TransportClientFactory: Successfully created connection to harness.vm/10.145.94.199:45678 after 2 ms (0 ms spent in bootstraps)20/04/09 13:48:58 INFO DiskBlockManager: Created local directory at /tmp/spark-ecacf44e-92b8-4596-9917-1df157adef40/executor-c95689b4-5891-4852-9cc8-27e30385f4f1/blockmgr-22da189f-793d-466c-a836-37fb7b5bbafd20/04/09 13:48:58 INFO MemoryStore: MemoryStore started with capacity 31.8 GB
20/04/09 13:48:58 INFO CoarseGrainedExecutorBackend: Connecting to driver: spark://CoarseGrainedScheduler@harness.vm:45678
--
You received this message because you are subscribed to the Google Groups "actionml-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to actionml-use...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/actionml-user/41d09845-30d2-451d-b208-b6a5add99d1a%40googlegroups.com.
We have a two machine cluster: Master and one Worker. Both configured with 64Gb.The Harness log shows a successful connection.We were unable to get Harness and Spark cluster to connect until we added these to our Engine Spark configuration and modified the compose .yml file with same property values."spark.driver.host": "<some host>","spark.driver.port": "45678",
Both Master and Work Spark UI's show things executing. But Worker stderr has the output below. stdout has nothing. The same events train successfully when Spark is configured for localhost.
Spark Executor Command: "/usr/lib/jvm/java-1.8-openjdk/bin/java" "-cp" "///conf:/spark/jars/*" "-Xmx61440M" "-Dspark.driver.port=45678" "org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url" "spark://CoarseGrainedScheduler@harness.vm:45678" "--executor-id" "0" "--hostname" "worker.spark.vm" "--cores" "4" "--app-id" "app-20200409134745-0003" "--worker-url" "spark://Wor...@worker.spark.vm:37145"
========================================Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties20/04/09 13:48:56 INFO CoarseGrainedExecutorBackend: Started daemon with process name: 1...@worker.spark.vm20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for TERM20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for HUP20/04/09 13:48:56 INFO SignalUtils: Registered signal handler for INT20/04/09 13:48:57 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable20/04/09 13:48:57 INFO SecurityManager: Changing view acls to: root20/04/09 13:48:57 INFO SecurityManager: Changing modify acls to: root20/04/09 13:48:57 INFO SecurityManager: Changing view acls groups to:20/04/09 13:48:57 INFO SecurityManager: Changing modify acls groups to:20/04/09 13:48:57 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(root); groups with view permissions: Set(); users with modify permissions: Set(root); groups with modify permissions: Set()20/04/09 13:48:57 INFO TransportClientFactory: Successfully created connection to harness.vm/10.145.94.199:45678 after 83 ms (0 ms spent in bootstraps)20/04/09 13:48:58 INFO SecurityManager: Changing view acls to: root20/04/09 13:48:58 INFO SecurityManager: Changing modify acls to: root20/04/09 13:48:58 INFO SecurityManager: Changing view acls groups to:20/04/09 13:48:58 INFO SecurityManager: Changing modify acls groups to:20/04/09 13:48:58 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(root); groups with view permissions: Set(); users with modify permissions: Set(root); groups with modify permissions: Set()20/04/09 13:48:58 INFO TransportClientFactory: Successfully created connection to harness.vm/10.145.94.199:45678 after 2 ms (0 ms spent in bootstraps)20/04/09 13:48:58 INFO DiskBlockManager: Created local directory at /tmp/spark-ecacf44e-92b8-4596-9917-1df157adef40/executor-c95689b4-5891-4852-9cc8-27e30385f4f1/blockmgr-22da189f-793d-466c-a836-37fb7b5bbafd20/04/09 13:48:58 INFO MemoryStore: MemoryStore started with capacity 31.8 GB
20/04/09 13:48:58 INFO CoarseGrainedExecutorBackend: Connecting to driver: spark://CoarseGrainedScheduler@harness.vm:45678
--
You received this message because you are subscribed to the Google Groups "actionml-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to actionml-use...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/actionml-user/add94615-221a-4949-b425-9870b3633836%40googlegroups.com.