13/06/17 15:27:20 ERROR local.LocalScheduler: Exception in task 6 java.lang.OutOfMemoryError: Java heap space at java.lang.String.substring(String.java:1913) at java.lang.String.split(String.java:2288) at java.lang.String.split(String.java:2355) at sparktutorial.SparkLRhdfs2$.readPoint(SparkLRhdfs2.scala:20)
val sc = new SparkContext("local", "SparkLRhdfs2", "/home/jayyonamine/devel/spark", List("target/scala-2.9.2/spark-tutorial_2.9.2-0.1.jar"))when you are running it?, seems obvious but local there will mean it won't use your cluster nodes at all, the log references the local scheduler a lot and only storing data on one node.
-Jay--
You received this message because you are subscribed to the Google Groups "Spark Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to spark-users...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
Another thing we aren't totally clear about in the docs - it's not possible to submit jobs over a WAN. The reason is that the driver also spawns a server and it needs to be able to receive incoming connections from the scheduler.
13/06/17 20:21:19 INFO slf4j.Slf4jEventHandler: Slf4jEventHandler started 13/06/17 20:21:20 INFO actor.ActorSystemImpl: RemoteServerStarted@akka://sparkE...@ip-10-232-52-182.ec2.internal:59847 13/06/17 20:21:20 INFO executor.StandaloneExecutorBackend: Connecting to driver: akka://sp...@10.170.9.137:39863/user/StandaloneScheduler 13/06/17 20:21:20 INFO actor.ActorSystemImpl: RemoteClientStarted@akka://sp...@10.170.9.137:39863 13/06/17 20:21:20 INFO executor.StandaloneExecutorBackend: Successfully registered with driver 13/06/17 20:21:20 INFO slf4j.Slf4jEventHandler: Slf4jEventHandler started 13/06/17 20:21:20 INFO actor.ActorSystemImpl: RemoteServerStarted@akka://sp...@ip-10-232-52-182.ec2.internal:57851 13/06/17 20:21:20 INFO spark.SparkEnv: Connecting to BlockManagerMaster: akka://sp...@10.170.9.137:39863/user/BlockManagerMaster 13/06/17 20:21:20 INFO storage.MemoryStore: MemoryStore started with capacity 3.8 GB. 13/06/17 20:21:20 INFO storage.DiskStore: Created local directory at /mnt/spark/spark-local-20130617202120-b19c 13/06/17 20:21:20 INFO storage.DiskStore: Created local directory at /mnt2/spark/spark-local-20130617202120-d058 13/06/17 20:21:20 INFO network.ConnectionManager: Bound socket to port 41050 with id = ConnectionManagerId(ip-10-232-52-182.ec2.internal,41050) 13/06/17 20:21:20 INFO storage.BlockManagerMaster: Trying to register BlockManager 13/06/17 20:21:20 INFO actor.ActorSystemImpl: RemoteClientStarted@akka://sp...@10.170.9.137:39863 13/06/17 20:21:30 WARN storage.BlockManagerMaster: Error sending message to BlockManagerMaster in 1 attempts java.util.concurrent.TimeoutException: Futures timed out after [10000] milliseconds at akka.dispatch.DefaultPromise.ready(Future.scala:870) at akka.dispatch.DefaultPromise.result(Future.scala:874) at akka.dispatch.Await$.result(Future.scala:74) at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:136) at spark.storage.BlockManagerMaster.tell(BlockManagerMaster.scala:115) at spark.storage.BlockManagerMaster.registerBlockManager(BlockManagerMaster.scala:46) at spark.storage.BlockManager.initialize(BlockManager.scala:138) at spark.storage.BlockManager.<init>(BlockManager.scala:123) at spark.storage.BlockManager.<init>(BlockManager.scala:130) at spark.SparkEnv$.createFromSystemProperties(SparkEnv.scala:102) at spark.executor.Executor.<init>(Executor.scala:68) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:39) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:36) at akka.actor.Actor$class.apply(Actor.scala:318) at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:16) at akka.actor.ActorCell.invoke(ActorCell.scala:626) at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197) at akka.dispatch.Mailbox.run(Mailbox.scala:179) at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516) at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259) at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975) at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479) at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) 13/06/17 20:21:43 WARN storage.BlockManagerMaster: Error sending message to BlockManagerMaster in 2 attempts java.util.concurrent.TimeoutException: Futures timed out after [10000] milliseconds at akka.dispatch.DefaultPromise.ready(Future.scala:870) at akka.dispatch.DefaultPromise.result(Future.scala:874) at akka.dispatch.Await$.result(Future.scala:74) at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:136) at spark.storage.BlockManagerMaster.tell(BlockManagerMaster.scala:115) at spark.storage.BlockManagerMaster.registerBlockManager(BlockManagerMaster.scala:46) at spark.storage.BlockManager.initialize(BlockManager.scala:138) at spark.storage.BlockManager.<init>(BlockManager.scala:123) at spark.storage.BlockManager.<init>(BlockManager.scala:130) at spark.SparkEnv$.createFromSystemProperties(SparkEnv.scala:102) at spark.executor.Executor.<init>(Executor.scala:68) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:39) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:36) at akka.actor.Actor$class.apply(Actor.scala:318) at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:16) at akka.actor.ActorCell.invoke(ActorCell.scala:626) at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197) at akka.dispatch.Mailbox.run(Mailbox.scala:179) at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516) at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259) at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975) at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479) at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) 13/06/17 20:21:56 WARN storage.BlockManagerMaster: Error sending message to BlockManagerMaster in 3 attempts java.util.concurrent.TimeoutException: Futures timed out after [10000] milliseconds at akka.dispatch.DefaultPromise.ready(Future.scala:870) at akka.dispatch.DefaultPromise.result(Future.scala:874) at akka.dispatch.Await$.result(Future.scala:74) at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:136) at spark.storage.BlockManagerMaster.tell(BlockManagerMaster.scala:115) at spark.storage.BlockManagerMaster.registerBlockManager(BlockManagerMaster.scala:46) at spark.storage.BlockManager.initialize(BlockManager.scala:138) at spark.storage.BlockManager.<init>(BlockManager.scala:123) at spark.storage.BlockManager.<init>(BlockManager.scala:130) at spark.SparkEnv$.createFromSystemProperties(SparkEnv.scala:102) at spark.executor.Executor.<init>(Executor.scala:68) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:39) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:36) at akka.actor.Actor$class.apply(Actor.scala:318) at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:16) at akka.actor.ActorCell.invoke(ActorCell.scala:626) at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197) at akka.dispatch.Mailbox.run(Mailbox.scala:179) at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516) at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259) at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975) at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479) at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) 13/06/17 20:21:59 ERROR executor.StandaloneExecutorBackend: Error sending message to BlockManagerMaster [message = RegisterBlockManager(BlockManagerId(11, ip-10-232-52-182.ec2.internal, 41050),4081511301,Actor[akka://spark/user/BlockManagerActor1])] spark.SparkException: Error sending message to BlockManagerMaster [message = RegisterBlockManager(BlockManagerId(11, ip-10-232-52-182.ec2.internal, 41050),4081511301,Actor[akka://spark/user/BlockManagerActor1])] at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:150) at spark.storage.BlockManagerMaster.tell(BlockManagerMaster.scala:115) at spark.storage.BlockManagerMaster.registerBlockManager(BlockManagerMaster.scala:46) at spark.storage.BlockManager.initialize(BlockManager.scala:138) at spark.storage.BlockManager.<init>(BlockManager.scala:123) at spark.storage.BlockManager.<init>(BlockManager.scala:130) at spark.SparkEnv$.createFromSystemProperties(SparkEnv.scala:102) at spark.executor.Executor.<init>(Executor.scala:68) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:39) at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:36) at akka.actor.Actor$class.apply(Actor.scala:318) at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:16) at akka.actor.ActorCell.invoke(ActorCell.scala:626) at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197) at akka.dispatch.Mailbox.run(Mailbox.scala:179) at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516) at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259) at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975) at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479) at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000] milliseconds at akka.dispatch.DefaultPromise.ready(Future.scala:870) at akka.dispatch.DefaultPromise.result(Future.scala:874) at akka.dispatch.Await$.result(Future.scala:74) at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:136) ... 19 more 13/06/17 20:21:59 INFO executor.StandaloneExecutorBackend: Connecting to driver: akka://sp...@10.170.9.137:39863/user/StandaloneScheduler 13/06/17 20:21:59 INFO executor.StandaloneExecutorBackend: Got assigned task 22 13/06/17 20:21:59 ERROR executor.StandaloneExecutorBackend: Received launchTask but executor was null
package sparktutorialimport spark.SparkContextimport SparkContext._import spark._object WordCount2 {def main(args: Array[String]) {val sc = new SparkContext(args(1), "Wordcount2", "/root/spark/", List("target/scala-2.9.2/spark-tutorial_2.9.2-0.1.jar"))val file = sc.textFile(args(2)).cache()val counts = file.flatMap(line => line.split(" ")).map(word => (word, 1)).reduceByKey(_ + _)println(counts)}}I get this to end my log:[success] Total time: 4 s, completed Jun 28, 2013 8:22:31 PM > 13/06/28 20:22:32ERROR client.Client$ClientActor: Connection to master failed; stopping client 13/06/28 20:22:32ERROR cluster.SparkDeploySchedulerBackend: Disconnected from Spark cluster! 13/06/28 20:22:32ERROR cluster.ClusterScheduler: Exiting due to error from cluster scheduler: Disconnected from Spark clusterany ideas there?