Groups keyboard shortcuts have been updated
Dismiss
See shortcuts

[JIRA] (JENKINS-52922) slave is getting disconnected in the middle of the job

1,515 views
Skip to first unread message

g.sumanthnath@gmail.com (JIRA)

unread,
Aug 7, 2018, 12:04:02 PM8/7/18
to jenkinsc...@googlegroups.com
sumanthnath gangavarapu created an issue
 
Jenkins / Bug JENKINS-52922
slave is getting disconnected in the middle of the job
Issue Type: Bug Bug
Assignee: Unassigned
Components: slave-squatter-plugin
Created: 2018-08-07 16:03
Environment: Production
Priority: Major Major
Reporter: sumanthnath gangavarapu

FATAL: command execution failedFATAL: command execution failedjava.nio.channels.ClosedChannelException at org.jenkinsci.remoting.protocol.impl.ChannelApplicationLayer.onReadClosed(ChannelApplicationLayer.java:209) at org.jenkinsci.remoting.protocol.ApplicationLayer.onRecvClosed(ApplicationLayer.java:222) at org.jenkinsci.remoting.protocol.ProtocolStack$Ptr.onRecvClosed(ProtocolStack.java:832) at org.jenkinsci.remoting.protocol.FilterLayer.onRecvClosed(FilterLayer.java:287) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.onRecvClosed(SSLEngineFilterLayer.java:181) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.switchToNoSecure(SSLEngineFilterLayer.java:283) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.processWrite(SSLEngineFilterLayer.java:503) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.processQueuedWrites(SSLEngineFilterLayer.java:248) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.doSend(SSLEngineFilterLayer.java:200) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.doCloseSend(SSLEngineFilterLayer.java:213) at org.jenkinsci.remoting.protocol.ProtocolStack$Ptr.doCloseSend(ProtocolStack.java:800) at org.jenkinsci.remoting.protocol.ApplicationLayer.doCloseWrite(ApplicationLayer.java:173) at org.jenkinsci.remoting.protocol.impl.ChannelApplicationLayer$ByteBufferCommandTransport.closeWrite(ChannelApplicationLayer.java:314) at hudson.remoting.Channel.close(Channel.java:1450) at hudson.remoting.Channel.close(Channel.java:1403) at hudson.slaves.SlaveComputer.closeChannel(SlaveComputer.java:821) at hudson.slaves.SlaveComputer.access$800(SlaveComputer.java:105) at hudson.slaves.SlaveComputer$3.run(SlaveComputer.java:737) at jenkins.util.ContextResettingExecutorService$1.run(ContextResettingExecutorService.java:28) at jenkins.security.ImpersonatingExecutorService$1.run(ImpersonatingExecutorService.java:59) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748)Caused: java.io.IOException: Backing channel 'JNLP4-connect connection from myip/myip:49165' is disconnected. at hudson.remoting.RemoteInvocationHandler.channelOrFail(RemoteInvocationHandler.java:214) at hudson.remoting.RemoteInvocationHandler.invoke(RemoteInvocationHandler.java:283) at com.sun.proxy.$Proxy128.isAlive(Unknown Source) at hudson.Launcher$RemoteLauncher$ProcImpl.isAlive(Launcher.java:1137) at hudson.Launcher$RemoteLauncher$ProcImpl.join(Launcher.java:1129) at hudson.tasks.CommandInterpreter.join(CommandInterpreter.java:155) at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:109) at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:66) at hudson.tasks.BuildStepMonitor$1.perform(BuildStepMonitor.java:20) at hudson.model.AbstractBuild$AbstractBuildExecution.perform(AbstractBuild.java:744) at hudson.model.Build$BuildExecution.build(Build.java:206) at hudson.model.Build$BuildExecution.doRun(Build.java:163) at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:504) at hudson.model.Run.execute(Run.java:1815) at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43) at hudson.model.ResourceController.execute(ResourceController.java:97) at hudson.model.Executor.run(Executor.java:429)FATAL: Unable to delete script file C:\Windows\TEMP\jenkins880525539975639792.ps1java.nio.channels.ClosedChannelException at org.jenkinsci.remoting.protocol.impl.ChannelApplicationLayer.onReadClosed(ChannelApplicationLayer.java:209) at org.jenkinsci.remoting.protocol.ApplicationLayer.onRecvClosed(ApplicationLayer.java:222) at org.jenkinsci.remoting.protocol.ProtocolStack$Ptr.onRecvClosed(ProtocolStack.java:832) at org.jenkinsci.remoting.protocol.FilterLayer.onRecvClosed(FilterLayer.java:287) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.onRecvClosed(SSLEngineFilterLayer.java:181) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.switchToNoSecure(SSLEngineFilterLayer.java:283) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.processWrite(SSLEngineFilterLayer.java:503) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.processQueuedWrites(SSLEngineFilterLayer.java:248) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.doSend(SSLEngineFilterLayer.java:200) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.doCloseSend(SSLEngineFilterLayer.java:213) at org.jenkinsci.remoting.protocol.ProtocolStack$Ptr.doCloseSend(ProtocolStack.java:800) at org.jenkinsci.remoting.protocol.ApplicationLayer.doCloseWrite(ApplicationLayer.java:173) at org.jenkinsci.remoting.protocol.impl.ChannelApplicationLayer$ByteBufferCommandTransport.closeWrite(ChannelApplicationLayer.java:314) at hudson.remoting.Channel.close(Channel.java:1450) at hudson.remoting.Channel.close(Channel.java:1403) at hudson.slaves.SlaveComputer.closeChannel(SlaveComputer.java:821) at hudson.slaves.SlaveComputer.access$800(SlaveComputer.java:105) at hudson.slaves.SlaveComputer$3.run(SlaveComputer.java:737) at jenkins.util.ContextResettingExecutorService$1.run(ContextResettingExecutorService.java:28) at jenkins.security.ImpersonatingExecutorService$1.run(ImpersonatingExecutorService.java:59) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748)Caused: hudson.remoting.ChannelClosedException: Channel "unknown": Remote call on JNLP4-connect connection from Myip/Myip:49165 failed. The channel is closing down or has closed down at hudson.remoting.Channel.call(Channel.java:948) at hudson.FilePath.act(FilePath.java:1070) at hudson.FilePath.act(FilePath.java:1059) at hudson.FilePath.delete(FilePath.java:1563) at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:123) at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:66) at hudson.tasks.BuildStepMonitor$1.perform(BuildStepMonitor.java:20) at hudson.model.AbstractBuild$AbstractBuildExecution.perform(AbstractBuild.java:744) at hudson.model.Build$BuildExecution.build(Build.java:206) at hudson.model.Build$BuildExecution.doRun(Build.java:163) at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:504) at hudson.model.Run.execute(Run.java:1815) at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43) at hudson.model.ResourceController.execute(ResourceController.java:97) at hudson.model.Executor.run(Executor.java:429)Build step 'Windows PowerShell' marked build as failure[htmlpublisher] Archiving HTML reports...ERROR: Build step failed with exceptionjava.lang.NullPointerException at htmlpublisher.HtmlPublisher.publishReports(HtmlPublisher.java:226) at htmlpublisher.HtmlPublisher.perform(HtmlPublisher.java:193) at hudson.tasks.BuildStepMonitor$1.perform(BuildStepMonitor.java:20) at hudson.model.AbstractBuild$AbstractBuildExecution.perform(AbstractBuild.java:744) at hudson.model.AbstractBuild$AbstractBuildExecution.performAllBuildSteps(AbstractBuild.java:690) at hudson.model.Build$BuildExecution.post2(Build.java:186) at hudson.model.AbstractBuild$AbstractBuildExecution.post(AbstractBuild.java:635) at hudson.model.Run.execute(Run.java:1840) at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43) at hudson.model.ResourceController.execute(ResourceController.java:97) at hudson.model.Executor.run(Executor.java:429)Build step 'Publish HTML reports' marked build as failureERROR: Step �Publish NUnit test result report� failed: no workspace for morpheus_qa_intgr_incentive #614Email was triggered for: AlwaysSending email for trigger: AlwaysERROR: Error: No workspace found!Sending email to: sumanthnathConnection error sending email, retrying once more in 10 seconds...Connection error sending email, retrying once more in 10 seconds...Failed after second try sending emailFinished: FAILURE

Add Comment Add Comment
 
This message was sent by Atlassian JIRA (v7.10.1#710002-sha1:6efc396)

o.v.nenashev@gmail.com (JIRA)

unread,
Aug 7, 2018, 12:12:02 PM8/7/18
to jenkinsc...@googlegroups.com

o.v.nenashev@gmail.com (JIRA)

unread,
Aug 7, 2018, 12:12:02 PM8/7/18
to jenkinsc...@googlegroups.com
Oleg Nenashev updated an issue
Change By: Oleg Nenashev
Component/s: remoting
Component/s: slave-squatter-plugin

g.sumanthnath@gmail.com (JIRA)

unread,
Aug 7, 2018, 12:16:01 PM8/7/18
to jenkinsc...@googlegroups.com
sumanthnath gangavarapu updated an issue
Change By: sumanthnath gangavarapu
Attachment: image-2018-08-07-21-45-04-675.png

g.sumanthnath@gmail.com (JIRA)

unread,
Aug 7, 2018, 12:16:02 PM8/7/18
to jenkinsc...@googlegroups.com
sumanthnath gangavarapu commented on Bug JENKINS-52922
 
Re: slave is getting disconnected in the middle of the job

Jenkins version: 2.134
Jenkins Master installed in Linux
Jenkins Slave installed in Windows
Below is the Jenkins slave properties window.

g.sumanthnath@gmail.com (JIRA)

unread,
Aug 8, 2018, 1:30:02 AM8/8/18
to jenkinsc...@googlegroups.com
sumanthnath gangavarapu updated an issue
Change By: sumanthnath gangavarapu
Attachment: image-2018-08-08-10-59-23-440.png

g.sumanthnath@gmail.com (JIRA)

unread,
Aug 8, 2018, 1:31:02 AM8/8/18
to jenkinsc...@googlegroups.com
 
Re: slave is getting disconnected in the middle of the job

We are using below SSH slave and Windows Slaves plugins to connect with the slaves.

g.sumanthnath@gmail.com (JIRA)

unread,
Aug 10, 2018, 5:04:02 AM8/10/18
to jenkinsc...@googlegroups.com

Hi Team,

Can I get help on this issue. If you need any further information please comment in this thread, I will provide you the details.

Thanks & Regards
Sumanthnath G

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 6, 2018, 8:52:01 AM9/6/18
to jenkinsc...@googlegroups.com
sumanthnath gangavarapu updated an issue
Change By: sumanthnath gangavarapu
Priority: Major Blocker
This message was sent by Atlassian Jira (v7.11.2#711002-sha1:fdc329d)

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 6, 2018, 8:54:02 AM9/6/18
to jenkinsc...@googlegroups.com
 
Re: slave is getting disconnected in the middle of the job

Hi Team,

This is a complete blocker for us.
Can you please look into this on priority.

Thanks & Regards
Sumanthnath G

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 6, 2018, 9:13:01 AM9/6/18
to jenkinsc...@googlegroups.com

Jeff Thompson Allen Polak Erina Gomi Gemini A abang herbal Hiroshi Ujishi Paul Weber Muthamizh K dfgjshej jshrehrj Alyson Henry Abhijit Paul Bruno Henrique da Silva Carlos Henrique de Sousa Junior Henry Senior Allen Jeng

Can anyone of you please guide me how we can resolve this issue.
This has been escalated in my current organisation.
I have opened this ticket one month before anticipating help from this team.
Please guide me if there I missed anything in the process to get help.
Thank you

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 6, 2018, 9:14:01 AM9/6/18
to jenkinsc...@googlegroups.com
[~jthompson] [~00500005] [~0000144223] [~007g3m1n1] [~abang] [~0007924021] [~000panther] [~0023] [~fdnsjdghjhb] [~alhen123] [~abhijitpaul_02] [~brunohe] [~bode] [~delphboy] [~ajeng]

Can anyone of you please guide me how
we can we resolve this issue.

This has been escalated in my current organisation.
I have opened this ticket one month before anticipating help from this team.
Please guide me if there I missed anything in the process to get help.
Thank you

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 6, 2018, 9:15:02 AM9/6/18
to jenkinsc...@googlegroups.com
[~jthompson] [~00500005] [~0000144223] [~007g3m1n1] [~abang] [~0007924021] [~000panther] [~0023] [~fdnsjdghjhb] [~alhen123] [~abhijitpaul_02] [~brunohe] [~bode] [~delphboy] [~ajeng]

Can anyone of you please guide me how can we resolve this issue.

This has been escalated in my current organisation.
I have opened this ticket one month before anticipating help from this team.
Please guide me if there I missed anything in the process to get help.
Thank you

Sumanth

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 6, 2018, 9:28:01 AM9/6/18
to jenkinsc...@googlegroups.com
[~jthompson] [~00500005] [~0000144223] [~007g3m1n1] [~abang] [~0007924021] [~000panther] [~0023] [~fdnsjdghjhb] [~alhen123] [~abhijitpaul_02] [~brunohe] [~bode] [~delphboy] [~ajeng]

Can anyone of you please guide me how can we resolve this issue.
This has been escalated in my current organisation.
I have opened this ticket one month before anticipating help from this team.
Please guide me if there I have missed anything in the process to get help.
Thank you
Sumanth

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 7, 2018, 3:51:01 AM9/7/18
to jenkinsc...@googlegroups.com
[~jthompson] [~00500005] [~0000144223] [~007g3m1n1] [~abang] [~0007924021] [~000panther] [~0023] [~fdnsjdghjhb] [~alhen123] [~abhijitpaul_02] [~brunohe] [~bode] [~delphboy] [~ajeng]

Can anyone of you please guide me how can we resolve this issue.
This has been escalated in my current organisation.
I have opened this ticket one month before anticipating help from this team.
Please guide me if I have missed anything in the process to get help.
Thank you
Sumanth

jthompson@cloudbees.com (JIRA)

unread,
Sep 11, 2018, 4:21:02 PM9/11/18
to jenkinsc...@googlegroups.com

sumanthnath gangavarapu, there isn't enough information in your report to perform a meaningful diagnosis. A ClosedChannelException merely indicates that the channel the agent and master were communicating on has been closed. Nothing in the report provides any indication as to how or why the channel was closed.

Generally, these sorts of failures are due to system or environment issues. It isn't a problem in the code but something external that causes the connection to close. You should investigate your systems and networks so that you can isolate and determine the cause of the failure.

A short, abbreviated sample of the things you should examine:

  1. Check agent logs.
  2. Check master logs.
  3. Check system logs, such as Windows Event logs.
  4. Look for high resource usage (CPU, disk, network, etc.) on master and agent machines.
  5. Investigate network issues.
  6. Look for any commonalities among the failures such as
    1. Time of day
    2. Duration of job
    3. Operating system
    4. System uptime
  7. Check the health and logs of the hosting environments.
  8. Match all of the timestamps and events up together with the failures to try and develop a picture of what is occurring.

As a simple example, I saw some similar behavior on my Windows machine when it decided it was time to go to sleep. (Windows server OSes don't usually have this behavior.)

I'm sorry that I'm not able to solve this problem for you, but the investigation needs to be performed at your end. If you manage to figure out the cause, I'd be curious to learn what you discovered.

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 11, 2018, 11:12:02 PM9/11/18
to jenkinsc...@googlegroups.com
sumanthnath gangavarapu updated an issue
Change By: sumanthnath gangavarapu
Attachment: MasterNode-Logs.txt
Attachment: SlaveNode-Logs.txt

g.sumanthnath@gmail.com (JIRA)

unread,
Sep 11, 2018, 11:17:04 PM9/11/18