--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/fc96d2a8-3efa-4f50-8821-d844b74f943c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
I went through one of the earlier post where this issue was discussed but unfortunately no solutions were provided. I am facing the exact same issue very frequently, at least twice a week. Whenever this happens, only solution is to restart gerrit after which everything is back to normal. This is causing huge productivity loss.I am using Gerrit 2.13.9 version that is running on HTTPS/SSL. Whenever this happens, I see that httpd thread count is maxed out [256 threads] and CPU is at 100%. My java melody plugin shows a huge spike in JDBC threads whenever we see this issue, not sure if this is due to this gerrit bug. All the java melody graphs are attached below.
To unsubscribe, email rep...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-d...@googlegroups.com.
Stack-trace
at org.eclipse.jetty.servlet.ServletHandler.doScope(<a href="https://git-master.nvidia.com/r/monitoring?part=source&class=org.eclipse.jetty.servlet.ServletHandler#515" type="external" title="org.eclipse.jetty.servlet.ServletHandler" style="color:rgb(102,0,187)" target="_blank" rel="nofollow" onmousedown="this.href='https://www.google.com/url?q\x3dhttps%3A%2F%2Fgit-master.nvidia.com%2Fr%2Fmonitoring%3Fpart%3Dsource%26class%3Dorg.eclipse.jetty.servlet.ServletHandler%23515\x26sa\x3dD\x26sntz\x3d1\x26usg\x3dAFQjCNFkqLZmfNSjvCsb20AlMVmWOUJOFA';return true;" onclick="this.href='https://www.google.com/url?q\x3dhttps%3A%2F%2Fgit-master.nvidia.com%2Fr%2Fmonitoring%3Fpart%3Dsource%26class%3Dorg.eclipse.jetty.servlet.ServletHandler%235
On 31 May 2019, at 10:10, hari <bhh...@gmail.com> wrote:Hi Luca,Thanks for your quick reply. Regarding your questions,1) you have an excessive heap utilization: are you using slaves for the Git clone traffic? How big are your repos? Have you tried to get a heap analysis in JavaMelody to see where you consume all that memory?- Yes, we are using read-only slaves worldwide for git clone traffic. However we have certain super users who will fetch directly from the gerrit master server ( these super users contribute to around 3K-4K fetches every day on gerrit master). And regarding the heap analysis, I am trying to generate heap dump as we speak. I can share it once it is generated and analyzed.
2)looks like you have some peaks of 60k queries per minute !!! (up to 1k queries a second) I am not surprised you end up eating all your TCP ports and then not able to connect to MySQL anymore. Why don't you use a co-located MySQL on localhost?- Pardon my ignorance, but what you exactly mean by co-located mysql on localhost? Do you mean having multi-master ?
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/9deab1f8-d8da-45d1-9b5e-7ce28d21fb96%40googlegroups.com.
On 31 May 2019, at 10:18, hari <bhh...@gmail.com> wrote:Yes, upgrade is in pipeline. We are about to upgrade to 2.16.8 next quarter.
But I am just making sure, we are not hitting the same issue again may be due to poor workflow.
--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/9171efd6-260d-4f6a-b4a3-5e4cb7896767%40googlegroups.com.
Yes, upgrade is in pipeline. We are about to upgrade to 2.16.8 next quarter. But I am just making sure, we are not hitting the same issue again may be due to poor workflow. And regarding the 'suexec', we indeed have given the 'RunAs' to few super-users (mostly service accounts). But I am unsure how to trace their fetch calls to confirm if they are using suexec parameters.
--
Also our repos are quite big. The biggest repo is around 60 G and entire repos constitute around 1000Gigs
--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/b465bc23-d4a0-4a41-b4af-e9d6cc6a9b9f%40googlegroups.com.
To unsubscribe, email repo-d...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-d...@googlegroups.com.
Here's our current settings.[core]packedGitOpenFiles = 4096packedGitLimit = 8gpackedGitWindowSize = 16k
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/26d4211e-c344-4c3c-8539-1decd52dac94%40googlegroups.com.
On Fri, May 31, 2019 at 11:40 AM hari <bhh...@gmail.com> wrote:Here's our current settings.[core]packedGitOpenFiles = 4096packedGitLimit = 8gpackedGitWindowSize = 16kyour packedGitLimit is way too small if you have 1TB of repositories and your largest repo has 60GB.This means you can only read 8GB worth of git objects from cache, for all objects not in the cacheyou need to read from the filesystem and (re-)parse git objects. If someone clones your 60GBrepository the cache content will be displaced at least 5 times and all requests for other repositoriesneed to go back and read from disk and (re-)parse objects.If you increase it don't forget to also increase container.heapLimit.Probably also packedGitOpenFiles is too small.We usecore.packedGitOpenFiles =30000core.packedGitLimit=96gcontainer.heapLimit=256g
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/CAKSZd3TUp80RdqEuqodyrTq-74rNFj873JQsvjfeMwVgqAQNRg%40mail.gmail.com.
Sent from my iPhoneOn Fri, May 31, 2019 at 11:40 AM hari <bhh...@gmail.com> wrote:Here's our current settings.[core]packedGitOpenFiles = 4096packedGitLimit = 8gpackedGitWindowSize = 16kyour packedGitLimit is way too small if you have 1TB of repositories and your largest repo has 60GB.This means you can only read 8GB worth of git objects from cache, for all objects not in the cacheyou need to read from the filesystem and (re-)parse git objects. If someone clones your 60GBrepository the cache content will be displaced at least 5 times and all requests for other repositoriesneed to go back and read from disk and (re-)parse objects.If you increase it don't forget to also increase container.heapLimit.Probably also packedGitOpenFiles is too small.We usecore.packedGitOpenFiles =30000core.packedGitLimit=96gcontainer.heapLimit=256gWow, what JVM are you using? With Java 8 and a 256Gb of heap the STW GC would take minutes !!!!

On May 31, 2019, at 3:26 AM, Luca Milanesio <luca.mi...@gmail.com> wrote:On 31 May 2019, at 10:18, hari <bhh...@gmail.com> wrote:Yes, upgrade is in pipeline. We are about to upgrade to 2.16.8 next quarter.Plan small steps, not skipping any release in between.But I am just making sure, we are not hitting the same issue again may be due to poor workflow.Don't expect an upgrade to solve your scalability issues: first understand what is the problem and how to change your setup to make it more scalable. Then, do small steps to improve things out.Upgrade is one of them, but isn't the "silver bullet" to solve scalability issues.
To view this discussion on the web visit https://groups.google.com/d/msgid/repo-discuss/A5A2C484-DC17-4D68-9023-84D9A272DBE3%40gmail.com.
during week days we serve around 100k http requests + 100k ssh requests per hourfrom the master (we also use slaves mostly to offload load from build servers)
and GC pauses are typically between 10ms up to around 1s: