Hi,
We're currently running Gerrit 2.13.4 (via a docker container) and one of our users has reported numerous hangs where multiple attempts are required to push a patch via ssh for review.
Looking at show-connections and show-queue I'm able to see a number of connections/tasks stuck corresponding with the times they reported the issue.
Session User Remote Host
--------------------------------------------------------------
decdfa53 user1 10.37.229.16
2b2c2314 user1 10.37.229.16
920d4eb5 user1 10.37.229.16
71b69c75 user1 10.37.229.16
60fc8833 user1 10.37.229.16
ea4521d1 user1 10.37.229.16
218b64e8 user1 10.37.229.16
75f708e7 user1 10.37.229.16
bce9fb98 user2 10.37.229.130
d3cce359 user2 10.37.229.130
7178972e user2 10.37.229.130
0f6722b6 <me> 10.37.78.218
--
SSHD Backend: nio2
Task State StartTime Command
------------------------------------------------------------------------------
de789af3 Jun-07 12:28 git-receive-pack ... (user1)
ab17b3c3 Jun-07 12:37 git-receive-pack ... (user1)
d21746c7 Jun-07 12:41 git-receive-pack ... (user1)
b19f34e4 Jun-07 12:45 git-receive-pack ... (user1)
20345011 Jun-07 12:50 git-receive-pack ... (user1)
ea50c19c Jun-08 15:47 git-receive-pack ... (user1)
61807cc5 Jun-08 15:51 git-receive-pack ... (user1)
b50100b8 Jun-09 12:03 git-receive-pack ... (user1)
fcf3f38a Jun-09 12:23 git-receive-pack ... (user2)
13c6fb3b Jun-09 12:26 git-receive-pack ... (user2)
f196c782 Jun-09 12:32 git-receive-pack ... (user2)
c9e00b24 23:00:00.004 Jun-07 08:38 Log File Compressor
------------------------------------------------------------------------------
15 tasks
I've pruned some connections relating to our CI systems, because they are watching stream events.
Looking at the SSH log I can see the connections on Jun 7th hanging:
[2017-06-07 08:09:22,253 +0000] f3e92ad2 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:09:23,206 +0000] f3e92ad2 user1 a/109 git-upload-pack.... 7ms 489ms 0
[2017-06-07 08:09:23,546 +0000] f3e92ad2 user1 a/109 LOGOUT
[2017-06-07 08:09:24,890 +0000] f80b47ac user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:13:26,583 +0000] f80b47ac user1 a/109 LOGOUT
[2017-06-07 08:13:26,583 +0000] f80b47ac user1 a/109 git-receive-pack.... 0ms 241238ms killed
[2017-06-07 08:13:31,829 +0000] a2fb5e96 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:13:32,770 +0000] a2fb5e96 user1 a/109 git-upload-pack.... 7ms 479ms 0
[2017-06-07 08:13:33,106 +0000] a2fb5e96 user1 a/109 LOGOUT
[2017-06-07 08:13:34,515 +0000] c2ef72d2 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:13:35,438 +0000] c2ef72d2 user1 a/109 git-receive-pack.... 1ms 467ms 0 git/2.7.4
[2017-06-07 08:13:35,774 +0000] c2ef72d2 user1 a/109 LOGOUT
[2017-06-07 08:18:27,645 +0000] e53165bc user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:18:28,593 +0000] e53165bc user1 a/109 git-upload-pack.... 7ms 484ms 0
[2017-06-07 08:18:28,930 +0000] e53165bc user1 a/109 LOGOUT
[2017-06-07 08:18:30,417 +0000] a54e0d31 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:18:31,329 +0000] a54e0d31 user1 a/109 git-receive-pack.... 0ms 457ms 0 git/2.7.4
[2017-06-07 08:18:31,665 +0000] a54e0d31 user1 a/109 LOGOUT
[2017-06-07 08:19:06,584 +0000] aaefbad9 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:19:07,576 +0000] aaefbad9 user1 a/109 git-upload-pack.... 7ms 529ms 0
[2017-06-07 08:19:07,913 +0000] aaefbad9 user1 a/109 LOGOUT
[2017-06-07 08:19:09,594 +0000] ea3132bc user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:19:10,574 +0000] ea3132bc user1 a/109 git-receive-pack.... 1ms 524ms 0 git/2.7.4
[2017-06-07 08:19:10,914 +0000] ea3132bc user1 a/109 LOGOUT
[2017-06-07 08:19:42,131 +0000] 2a1aeadd user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:19:43,077 +0000] 2a1aeadd user1 a/109 git-upload-pack.... 7ms 483ms 0
[2017-06-07 08:19:43,415 +0000] 2a1aeadd user1 a/109 LOGOUT
[2017-06-07 08:19:44,736 +0000] 3f030e92 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 08:19:45,983 +0000] 3f030e92 user1 a/109 git-receive-pack.... 0ms 788ms 0 git/2.7.4
[2017-06-07 08:19:46,323 +0000] 3f030e92 user1 a/109 LOGOUT
[2017-06-07 12:28:35,642 +0000] 5e926a3c user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:28:36,629 +0000] 5e926a3c user1 a/109 git-upload-pack.... 5ms 521ms 0
[2017-06-07 12:28:36,968 +0000] 5e926a3c user1 a/109 LOGOUT
[2017-06-07 12:28:38,303 +0000] decdfa53 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:37:54,941 +0000] eb4a8bc3 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:37:55,974 +0000] eb4a8bc3 user1 a/109 git-upload-pack.... 7ms 539ms 0
[2017-06-07 12:37:56,337 +0000] eb4a8bc3 user1 a/109 LOGOUT
[2017-06-07 12:37:57,706 +0000] 2b2c2314 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:41:12,567 +0000] 062d9f10 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:41:13,648 +0000] 062d9f10 user1 a/109 git-upload-pack.... 8ms 593ms 0
[2017-06-07 12:41:13,987 +0000] 062d9f10 user1 a/109 LOGOUT
[2017-06-07 12:41:15,328 +0000] 920d4eb5 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:45:37,841 +0000] d1c508c7 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:45:38,796 +0000] d1c508c7 user1 a/109 git-upload-pack.... 7ms 492ms 0
[2017-06-07 12:45:39,133 +0000] d1c508c7 user1 a/109 LOGOUT
[2017-06-07 12:45:40,542 +0000] 71b69c75 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:50:07,232 +0000] 40ae2428 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:50:08,178 +0000] 40ae2428 user1 a/109 git-upload-pack.... 6ms 484ms 0
[2017-06-07 12:50:08,516 +0000] 40ae2428 user1 a/109 LOGOUT
[2017-06-07 12:50:09,838 +0000] 60fc8833 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:54:20,431 +0000] 9dcb283f user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:54:21,376 +0000] 9dcb283f user1 a/109 git-upload-pack.... 5ms 483ms 0
[2017-06-07 12:54:21,716 +0000] 9dcb283f user1 a/109 LOGOUT
[2017-06-07 12:54:23,097 +0000] 3d41dcae user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:54:36,584 +0000] 3d41dcae user1 a/109 git-receive-pack.... 0ms 12997ms 0 git/2.7.4
[2017-06-07 12:54:36,920 +0000] 3d41dcae user1 a/109 LOGOUT
The connections that hung:
[2017-06-07 12:28:38,303 +0000] decdfa53 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:37:57,706 +0000] 2b2c2314 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:41:15,328 +0000] 920d4eb5 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:45:40,542 +0000] 71b69c75 user1 a/109 LOGIN FROM 10.37.229.16
[2017-06-07 12:50:09,838 +0000] 60fc8833 user1 a/109 LOGIN FROM 10.37.229.16
I haven't seen anything obvious from the changelog for 2.13.5+ that indicates a bug being fixed that would prevent these from hanging, and I'm unsure on how to debug further. So far the people affected appear to be about 2 out of 60+.
We've asked the user affected the most to run 'git-review' with GIT_SSH_COMMAND='ssh -vvv' for a while to see if there is a clear indication as to what might be happening from the client perspective. But given these connections appear to be permanently stuck on the server, it doesn't seem likely that will provide much more info.
Any suggestions on what to look for next? Any debug that can be turned on that might be useful?
--
Darragh Bailey