HiWe are experiencing sporadic errors with ssh connections to gerrit server running 2.5.1. Normally there is no impact on gerrit users, but continuous integration engines like Jenkins constantly fail due to this error. The environment we use for our gerrit server is:Red Hat Enterprise Linux Server release 5.3 (Tikanga)Linux hostname 2.6.18-194.el5 #1 SMP Tue Mar 16 21:52:39 EDT 2010 x86_64 x86_64 x86_64 GNU/LinuxOpenSSH_4.3p2OpenSSL 0.9.8e-fips-rhel5
We ran a script that loops ssh connection to port 29418 with one second delay for about 30 hours, we got around 200 errors.The error message:hash mismatchdebug1: ssh_rsa_verify: signature incorrectkey_verify failed for server_host_key
You can see the full ssh output attached.Investigating this issue led us to a conclusion that this might be related to OpenSSH, OpenSSL, or OS Kernel versions. Anyone experienced such behavior before ? Any hints to resolve this ?
Are you using control master?
I've just confirmed that I can reproduce the bug on a clean CentOS install, running both Gerrit 2.4.2 and Gerrit 2.7. Also, since it doesn't seem to matter what SSH client is connecting, almost seems like something deep in the kernel. Which is definitely less-than-good news.
A bit more info. This may be related to the version of OpenJDK that ships with CentOS/RHEL. I went ahead and installed the Bouncy Castle library, and that appears to resolve the issue.
Doug, could you run more tests, especially when the server is busy? IMHO Bouncy Castle has nothing to do with this issue.
My suspicion is that this comes from jsch library. In our experience the error happens when there are concurrent ssh connections, the busier the server gets the more often we get this error.For us it started on master instance and we couldn't reproduce the error on any mirror up to a point when some of the mirrors started to be heavily used and this was the moment when the error popped up there as well.
Personally I think this issue is similar to the topic discussed in this thread a while ago:You can try to use GIT_SSH environment variable as Swan suggests in that topic and this is also described in the Scaling Gerrit Installations Wiki:
OK, I just confirmed both boxes are running the Oracle JRE -- the Ubuntu box is running "Java(TM) SE Runtime Environment (build 1.6.0_27-b07)" and the CentOS box is running "Java(TM) SE Runtime Environment (build 1.7.0_25-b15)". Additionally, I also reproduced this in a VM with a clean CentOS install running the latest version of the OpenJDK that CentOS has packaged (1.7.0_25 -- with a JRE string of "OpenJDK Runtime Environment (rhel-2.3.10.4.el6_4-x86_64)"). So, back to the drawing board with "what in the world did RedHat do?" ;) Personally, I wouldn't be surprised to see something funny like SELinux playing into this, but anything's possible.
HiWe are experiencing sporadic errors with ssh connections to gerrit server running 2.5.1. Normally there is no impact on gerrit users, but continuous integration engines like Jenkins constantly fail due to this error. The environment we use for our gerrit server is:
/ ... snip
Investigating this issue led us to a conclusion that this might be related to OpenSSH, OpenSSL, or OS Kernel versions. Anyone experienced such behavior before ? Any hints to resolve this ?
For our issue, it has been resolved by adding the bouncy castle library thanks to Doug's suggestion. We tested this with both Jetty and Tomcat.
For more options, visit https://groups.google.com/d/optout.--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to a topic in the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/repo-discuss/JE7OM6o7DMs/unsubscribe.
To unsubscribe from this group and all its topics, send an email to repo-discuss...@googlegroups.com.
FWIW we hit this problem in Stash recently, and this thread came up. Turns out the latest version of Mina (0.11) hadn't made the appropriate change yet.If anyone is interested the relevant issue is here (fixed in 0.12):
On Tuesday, November 5, 2013 8:26:32 AM UTC+11, Doug Kelly wrote:From the notes I found while looking into the JSch side of this problems, it seems some things changed in JCE according to the JSch developer's notes in the issue tracker. Specifically, he added this little note in KeyExchange.java:+ /*+ * It seems JCE included in Oracle's Java7u6(and later) has suddenly changed+ * its behavior. The secrete generated by KeyAgreement#generateSecret()+ * may start with 0, even if it is a positive value.+ */So, I guess this would have to be something for the Apache Mina developers to take up accordingly... but for now, Bouncy Castle is humming along fine.--Doug
--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
On Thu, Jun 26, 2014 at 3:22 PM, Charles O'Farrell <char...@gmail.com> wrote:
FWIW we hit this problem in Stash recently, and this thread came up. Turns out the latest version of Mina (0.11) hadn't made the appropriate change yet.If anyone is interested the relevant issue is here (fixed in 0.12):Thanks for the info!We also hit this problem several times in last couple of weeks... Looks like it started after we switchedto Java 7.
--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to a topic in the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/repo-discuss/JE7OM6o7DMs/unsubscribe.
To unsubscribe from this group and all its topics, send an email to repo-discuss...@googlegroups.com.