scala> 13/04/02 16:00:55 INFO cluster.SparkDeploySchedulerBackend: Executor 1 disconnected, so removing it
13/04/02 16:00:55 ERROR cluster.ClusterScheduler: Lost an executor 1 (already removed): remote Akka client shutdown
13/04/02 16:00:55 INFO client.Client$ClientActor: Executor updated: app-20130402225934-0003/1 is now FAILED (Command exited with code 1)
13/04/02 16:00:55 INFO cluster.SparkDeploySchedulerBackend: Executor app-20130402225934-0003/1 removed: Command exited with code 1
13/04/02 16:00:55 INFO client.Client$ClientActor: Executor added: app-20130402225934-0003/2 on worker-20130402225332-u9-r1.mtv-53693 (u9-r1.mtv) with 8 cores
13/04/02 16:00:55 INFO cluster.SparkDeploySchedulerBackend: Granted executor ID app-20130402225934-0003/2 on host u9-r1.mtv with 8 cores, 512.0 MB RAM
13/04/02 16:00:55 INFO client.Client$ClientActor: Executor updated: app-20130402225934-0003/2 is now RUNNING
13/04/02 16:00:56 INFO cluster.SparkDeploySchedulerBackend: Registered executor: Actor[akka://sparkE...@u9-r1.mtv:48641/user/Executor] with ID 2
I discovered there are logs for every executor, and I see executor logs like this:
2013-04-02 23:00:11 WARN [sparkExecutor-akka.actor.default-dispatcher-1]: BlockManagerMaster:64 - Error sending message to BlockManagerMaster in 3 attempts
java.util.concurrent.TimeoutException: Futures timed out after [10000] milliseconds
at akka.dispatch.DefaultPromise.ready(Future.scala:870)
at akka.dispatch.DefaultPromise.result(Future.scala:874)
at akka.dispatch.Await$.result(Future.scala:74)
at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:154)
at spark.storage.BlockManagerMaster.tell(BlockManagerMaster.scala:133)
at spark.storage.BlockManagerMaster.registerBlockManager(BlockManagerMaster.scala:64)
at spark.storage.BlockManager.initialize(BlockManager.scala:123)
at spark.storage.BlockManager.<init>(BlockManager.scala:108)
at spark.storage.BlockManager.<init>(BlockManager.scala:115)
at spark.SparkEnv$.createFromSystemProperties(SparkEnv.scala:91)
at spark.executor.Executor.initialize(Executor.scala:72)
at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:39)
at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:36)
at akka.actor.Actor$class.apply(Actor.scala:318)
at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:16)
at akka.actor.ActorCell.invoke(ActorCell.scala:626)
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197)
at akka.dispatch.Mailbox.run(Mailbox.scala:179)
at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516)
at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259)
at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975)
at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
2013-04-02 23:00:14 ERROR [sparkExecutor-akka.actor.default-dispatcher-2]: StandaloneExecutorBackend:47 - Error sending message to BlockManagerMaster [message = RegisterBlockManager(BlockManagerId(0, u9-r1.mtv, 50205),339585269,Actor[akka://spark/user/BlockManagerActor1])]
spark.SparkException: Error sending message to BlockManagerMaster [message = RegisterBlockManager(BlockManagerId(0, u9-r1.mtv, 50205),339585269,Actor[akka://spark/user/BlockManagerActor1])]
at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:168)
at spark.storage.BlockManagerMaster.tell(BlockManagerMaster.scala:133)
at spark.storage.BlockManagerMaster.registerBlockManager(BlockManagerMaster.scala:64)
at spark.storage.BlockManager.initialize(BlockManager.scala:123)
at spark.storage.BlockManager.<init>(BlockManager.scala:108)
at spark.storage.BlockManager.<init>(BlockManager.scala:115)
at spark.SparkEnv$.createFromSystemProperties(SparkEnv.scala:91)
at spark.executor.Executor.initialize(Executor.scala:72)
at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:39)
at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:36)
at akka.actor.Actor$class.apply(Actor.scala:318)
at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:16)
at akka.actor.ActorCell.invoke(ActorCell.scala:626)
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197)
at akka.dispatch.Mailbox.run(Mailbox.scala:179)
at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516)
at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259)
at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975)
at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000] milliseconds
at akka.dispatch.DefaultPromise.ready(Future.scala:870)
at akka.dispatch.DefaultPromise.result(Future.scala:874)
at akka.dispatch.Await$.result(Future.scala:74)
at spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:154)
... 19 more
2013-04-02 23:00:14 ERROR [sparkExecutor-akka.actor.default-dispatcher-3]: StandaloneExecutorBackend:47 - Slave registration failed: Duplicate executor ID: 0