| This is a fresher issue, with fewer things going on, this time the agent that got disconnected is called grub Job console output shows (jobConsoleOutput.txt) show at 17:27:54
hudson.remoting.ChannelClosedException: Channel "unknown": Remote call on grub failed. The channel is closing down or has closed down
at hudson.remoting.Channel.call(Channel.java:948)
at hudson.FilePath.act(FilePath.java:1089)
at hudson.FilePath.act(FilePath.java:1078)
.....
17:27:55 ERROR: Issue with creating launcher for agent grub. The agent has not been fully initialized yet
jenkins master log at that time (jenkins.log) shows the following lines:
Jul 04, 2018 5:27:54 PM hudson.remoting.SynchronousCommandTransport$ReaderThread run
SEVERE: I/O error in channel grub
java.io.IOException: Unexpected termination of the channel
at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:77)
Caused by: java.io.EOFException
at java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2328)
at java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:2797)
at java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:802)
... [trimmed stacktrace]
Jul 04, 2018 5:27:55 PM hudson.model.Slave reportLauncherCreateError
WARNING: Issue with creating launcher for agent grub. The agent has not been fully initialized yetProbably there is a race condition with Agent reconnection or disconnection, check other log entries
java.lang.IllegalStateException: No remoting channel to the agent OR it has not been fully initialized yet
at hudson.model.Slave.reportLauncherCreateError(Slave.java:524)
at hudson.model.Slave.createLauncher(Slave.java:496)
... [trimmed stacktrace]
Jul 04, 2018 5:27:55 PM hudson.model.Slave reportLauncherCreateError
WARNING: Issue with creating launcher for agent grub. The agent has not been fully initialized yetProbably there is a race condition with Agent reconnection or disconnection, check other log entries
java.lang.IllegalStateException: No remoting channel to the agent OR it has not been fully initialized yet
at hudson.model.Slave.reportLauncherCreateError(Slave.java:524)
at hudson.model.Slave.createLauncher(Slave.java:496)
... [trimmed stacktrace]
Jul 04, 2018 5:27:55 PM com.squareup.okhttp.internal.Platform$JdkWithJettyBootPlatform getSelectedProtocol
INFO: ALPN callback dropped: SPDY and HTTP/2 are disabled. Is alpn-boot on the boot class path?
Jul 04, 2018 5:27:55 PM org.jenkinsci.plugins.workflow.job.WorkflowRun finish
INFO: rndtest_vortexLibrary/master #289 completed: ABORTED
The agent remoting log that shows the error is the file created at 5:08 pm (remoting.log.2 inside grub.remoting.logs.zip)
but it does not have a timestamp in the message. it would be handy to have one. because I can not work out if the agent or jenkins master initiated the disconnection. I've also included
- The full support log (support_2018-07-04_07.35.22.zip)
- The logs under ${JENKINS_HOME}/logs/slaves/grub (slaveLogInMaster.grub.zip)
- Agent system Information that I grub just minutes after seeing the disconnection.
- System Information (grubSystemInformation.html) - Heap Dump (JavaMelodyGrubHeapDump_4_07_18.pdf) - threads (JavaMelodyNodeGrubThreads_4_07_18.pdf) - (MonitoringJavaelodyOnNodes.html) * A screenshot (NetworkAndMachineStats.png) of the stats of the master (jenkinssecure1) and the agent (grub) showing the netowrk activity, memory and cpu history. Hardly anything going on.
|