[JIRA] (JENKINS-58301) Kubernetes Plugin Repeated Socket Ping Timeout Exceptions

27 views
Skip to first unread message

autarch@outlook.com (JIRA)

unread,
Jul 2, 2019, 10:12:02 AM7/2/19
to jenkinsc...@googlegroups.com
autarch princeps created an issue
 
Jenkins / Bug JENKINS-58301
Kubernetes Plugin Repeated Socket Ping Timeout Exceptions
Issue Type: Bug Bug
Assignee: Carlos Sanchez
Components: kubernetes-plugin
Created: 2019-07-02 14:11
Environment: Jenkins version: 2.182
Kubernetes Plugin: 1.6.2
Kubernetes: v1.13.7-eks-c57ff8
Labels: plugin kubernetes-plugin kubernetes timeout
Priority: Major Major
Reporter: autarch princeps

I often get issues like this:
java.net.SocketTimeoutException: sent ping but didn't receive pong within 1000ms (after 330 successful ping/pongs)

One single issue breaks the entire task and makes it hard to even cancel the task. Should this not be retried rather than break execution? Our Jenkins will run longer running tasks as well. Any single task breaking stop in the middle is a real issue, and I don't see why one network issue after 330 successful ones (in this case) is such a big issue.

I set the timeout (both Connection and Read) in the settings of the Kubernetes Plugin as well, but it still only has a timeout of one second.

Add Comment Add Comment
 
This message was sent by Atlassian Jira (v7.11.2#711002-sha1:fdc329d)

deiwin.sarjas@gmail.com (JIRA)

unread,
Jul 2, 2019, 3:42:02 PM7/2/19
to jenkinsc...@googlegroups.com
Deiwin Sarjas commented on Bug JENKINS-58301
 
Re: Kubernetes Plugin Repeated Socket Ping Timeout Exceptions

We've also been seeing quite a bit of this recently on EKS with Jenkins 2.164.2, Kubernetes plugin 1.15.2.

autarch@outlook.com (JIRA)

unread,
Jul 3, 2019, 3:33:02 AM7/3/19
to jenkinsc...@googlegroups.com
autarch princeps updated an issue
 
Change By: autarch princeps
Environment: Jenkins version: 2.182
Kubernetes Plugin: 1.
6 16 .2
Kubernetes: v1.13.7-eks-c57ff8

jglick@cloudbees.com (JIRA)

unread,
Jul 16, 2019, 3:06:03 PM7/16/19
to jenkinsc...@googlegroups.com
Jesse Glick updated an issue
Change By: Jesse Glick
Labels: kubernetes kubernetes-plugin plugin timeout

jglick@cloudbees.com (JIRA)

unread,
Jul 16, 2019, 3:43:47 PM7/16/19
to jenkinsc...@googlegroups.com
Jesse Glick assigned an issue to Unassigned
Change By: Jesse Glick
Assignee: Carlos Sanchez

jtcheng@alauda.io (JIRA)

unread,
Jul 18, 2019, 5:43:04 AM7/18/19
to jenkinsc...@googlegroups.com

grechtyrone1989@gmail.com (JIRA)

unread,
Aug 1, 2019, 2:34:02 AM8/1/19
to jenkinsc...@googlegroups.com

We are also encountering this issue fairly often in our CI system running:

  • On premises Kubernetes cluster on version 1.14.1
  • Jenkins version 2.186
  • Kubernetes Plugin version 1.17.2

deiwin.sarjas@gmail.com (JIRA)

unread,
Sep 18, 2019, 5:57:04 AM9/18/19
to jenkinsc...@googlegroups.com

We configured -Dkubernetes.websocket.ping.interval=30000 for Jenkins based on this comment on another issue. I'll report back if it helps or not.

This message was sent by Atlassian Jira (v7.13.6#713006-sha1:cc4451f)
Atlassian logo

juha.tiensyrja@arm.com (JIRA)

unread,
Sep 18, 2019, 6:05:05 AM9/18/19
to jenkinsc...@googlegroups.com

That option helped for us. But the reason why the pings started to fail was actually the JVM garbage collector, which caused the master to hang for more than 1 second. We switched from the default to G1GC to reduce time the master is blocked, and this helped with other timeouts too.

Lee.McClintock@fisglobal.com (JIRA)

unread,
Sep 19, 2019, 8:26:04 AM9/19/19
to jenkinsc...@googlegroups.com
Lee McClintock updated an issue
 
Change By: Lee McClintock
I often get issues like this:
java.net.SocketTimeoutException: sent ping but didn't receive pong within 1000ms (after 330 successful ping/pongs)

One single issue breaks the entire task and makes it hard to even cancel the task. Should this not be retried rather than break execution? Our Jenkins will run longer running tasks as well. Any single task breaking stop in the middle is a real issue, and I don't see why one network issue after 330 successful ones (in this case) is such a big issue.

I set the timeout (both Connection and Read) in the settings of the Kubernetes Plugin as well, but it still only has a timeout of one second. x

aburdajewicz@cloudbees.com (JIRA)

unread,
Dec 16, 2019, 8:54:03 PM12/16/19
to jenkinsc...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages