[JIRA] (JENKINS-57555) Remote children jobs should be restartable in case the connection drops

2 views
Skip to first unread message

alexandru.balut@gmail.com (JIRA)

unread,
May 20, 2019, 4:05:03 AM5/20/19
to jenkinsc...@googlegroups.com
Alexandru Băluț created an issue
 
Jenkins / New Feature JENKINS-57555
Remote children jobs should be restartable in case the connection drops
Issue Type: New Feature New Feature
Assignee: Unassigned
Components: core
Created: 2019-05-20 08:04
Environment: Jenkins ver. 2.164.3
Labels: remote
Priority: Minor Minor
Reporter: Alexandru Băluț

When a job running on a remote node "fails" because the connection drops, it should be possible to restart it automatically. Currently it's cumbersome to get a list of the failed children jobs and checking how they failed by grepping for java.io.EOFException in their log files.

https://groups.google.com/forum/#!topic/jenkinsci-users/sjnF6zlPd3Q

This is how the failure looks in the children job console output:

Building remotely on instance-1 (tag1) in workspace /var/lib/jenkins/workspace/eval
[vmu-eval-single] $ /bin/sh -xe /tmp/jenkins4616924287086740166.sh
+ /path/to/evaluation-tool
FATAL: command execution failed
java.io.EOFException
	at java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2681)
	at java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:3156)
	at java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:862)
	at java.io.ObjectInputStream.<init>(ObjectInputStream.java:358)
	at hudson.remoting.ObjectInputStreamEx.<init>(ObjectInputStreamEx.java:49)
	at hudson.remoting.Command.readFrom(Command.java:140)
	at hudson.remoting.Command.readFrom(Command.java:126)
	at hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:36)
	at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:63)
Caused: java.io.IOException: Unexpected termination of the channel
	at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:77)
Caused: java.io.IOException: Backing channel 'instance-1' is disconnected.
	at hudson.remoting.RemoteInvocationHandler.channelOrFail(RemoteInvocationHandler.java:214)
	at hudson.remoting.RemoteInvocationHandler.invoke(RemoteInvocationHandler.java:283)
	at com.sun.proxy.$Proxy78.isAlive(Unknown Source)
	at hudson.Launcher$RemoteLauncher$ProcImpl.isAlive(Launcher.java:1144)
	at hudson.Launcher$RemoteLauncher$ProcImpl.join(Launcher.java:1136)
	at hudson.tasks.CommandInterpreter.join(CommandInterpreter.java:155)
	at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:109)
	at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:66)
	at hudson.tasks.BuildStepMonitor$1.perform(BuildStepMonitor.java:20)
	at hudson.model.AbstractBuild$AbstractBuildExecution.perform(AbstractBuild.java:744)
	at hudson.model.Build$BuildExecution.build(Build.java:206)
	at hudson.model.Build$BuildExecution.doRun(Build.java:163)
	at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:504)
	at hudson.model.Run.execute(Run.java:1816)
	at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43)
	at hudson.model.ResourceController.execute(ResourceController.java:97)
	at hudson.model.Executor.run(Executor.java:429)
Add Comment Add Comment
 
This message was sent by Atlassian Jira (v7.11.2#711002-sha1:fdc329d)
Reply all
Reply to author
Forward
0 new messages