Good day everyone!
At my company we have a HA gerrit installion, and we are struggling to properly configure gerrit to use full host potential.
Please find or typical 'high' load during nighlty builds in the attachments.
As you can see on a screenshot, nearly half of the cores are doing nothing (25% total load at peak)
On the other hand, all of our sshd.threads are busy with git clones, gerrit ls-projects taking forever to show result and even web ui starts giving 504 error and taking long time to show changes/repos.
Also, it seems related and concerning that only around 800 fd's are open during nightly fetch, while max is at 8192
How can we improve this situation?Any opinion will be much helpful. Thank you.
Host specs:
60 CPUs
180Gb RAM
SSD on NFS mount
Here's our config:
[core]
packedGitWindowSize = 64k
packedGitLimit = 16g
packedGitOpenFiles = 8192
[container]
javaOptions = -Xms160g
javaOptions = -Xmx160g
javaOptions = -XX:-UseAdaptiveSizePolicy
javaOptions = -XX:+AlwaysPreTouch
javaOptions = -XX:+UseParallelGC
user = root
heapLimit = 160g
[sshd]
batchThreads = 110
commandStartThreads = 10
waitTimeout = 60m
idleTimeout = 60m
maxConnectionsPerUser = 1000
threads = 120
[sshd]
batchThreads = 110
commandStartThreads = 10
waitTimeout = 60m
idleTimeout = 60m
maxConnectionsPerUser = 1000
threads = 120
P.S. There was a discussion about adding some cheat sheet to gerrit reppo, but I've failed to find one. And it might have been very useful
--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/0290dd1a-2567-4ce1-a842-2c31641e5f3dn%40googlegroups.com.
Good day everyone!
At my company we have a HA gerrit installion, and we are struggling to properly configure gerrit to use full host potential.
Please find or typical 'high' load during nighlty builds in the attachments.
As you can see on a screenshot, nearly half of the cores are doing nothing (25% total load at peak)
On the other hand, all of our sshd.threads are busy with git clones, gerrit ls-projects taking forever to show result and even web ui starts giving 504 error and taking long time to show changes/repos.
Also, it seems related and concerning that only around 800 fd's are open during nightly fetch, while max is at 8192
How can we improve this situation?Any opinion will be much helpful. Thank you.
On Thu, Mar 14, 2024 at 2:41 PM Dmytro Rodionov <smpli...@gmail.com> wrote:Good day everyone!
At my company we have a HA gerrit installion, and we are struggling to properly configure gerrit to use full host potential.
Please find or typical 'high' load during nighlty builds in the attachments.
As you can see on a screenshot, nearly half of the cores are doing nothing (25% total load at peak)
On the other hand, all of our sshd.threads are busy with git clones, gerrit ls-projects taking forever to show result and even web ui starts giving 504 error and taking long time to show changes/repos.
Also, it seems related and concerning that only around 800 fd's are open during nightly fetch, while max is at 8192
How can we improve this situation?Any opinion will be much helpful. Thank you.First of all install a monitoring solution e.g. https://gerrit.googlesource.com/gerrit-monitoring/+/refs/heads/masterMonitor the time the JVM spends on running Java gc. If percentage spent on Java gc goes through the roofyou are overloading the process. In that case it's typically better to reduce the sshd thread pool sizes toprevent overload.
Check the hit rates of Gerrit caches and increase their size if necessary.Run git gc on a regular schedule on all repos. Run it more frequently on busy repositories.
On Thursday, March 14, 2024 at 3:05:26 PM UTC+1 Matthias Sohn wrote:On Thu, Mar 14, 2024 at 2:41 PM Dmytro Rodionov <smpli...@gmail.com> wrote:Good day everyone!
At my company we have a HA gerrit installion, and we are struggling to properly configure gerrit to use full host potential.
Please find or typical 'high' load during nighlty builds in the attachments.
As you can see on a screenshot, nearly half of the cores are doing nothing (25% total load at peak)
On the other hand, all of our sshd.threads are busy with git clones, gerrit ls-projects taking forever to show result and even web ui starts giving 504 error and taking long time to show changes/repos.
Also, it seems related and concerning that only around 800 fd's are open during nightly fetch, while max is at 8192
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/a8706fef-7d8a-4079-8233-88ee53f70e8fn%40googlegroups.com.
+1 the usual suspect, well worth investigating (but I don't think it should affect `ls-projects` which smells more like cache issue).