Gerrit Code Review 2.11.3 now 10:13:57 CDTuptime 13 hrs 30 minName |Entries | AvgGet |Hit Ratio|| Mem Disk Space| |Mem Disk|--------------------------------+---------------------+---------+---------+accounts | 499 | 2.7ms | 97% |accounts_byemail | 217 | 2.3ms | 87% |accounts_byname | 502 | 1.8ms | 86% |adv_bases | | | |changes | | 54.5ms | 0% |groups | 785 | 1.4ms | 91% |groups_byinclude | | 1.6ms | 16% |groups_byname | | | |groups_byuuid | 321 | 2.7ms | 44% |groups_external | | 3.7ms | 22% |groups_members | 535 | 3.0ms | 99% |ldap_group_existence | | | |ldap_groups | 6 | 45.9ms | 33% |ldap_groups_byinclude | | | |ldap_usernames | 5 | 1.5ms | 88% |permission_sort | 18055 | | 98% |plugin_resources | | | |project_list | 1 | 2.8s | 91% |projects | 10552 | 3.4ms | 98% |sshkeys | 17 | 5.0ms | 98% |D change_kind | 121 121 55.54k| 4.7ms | 65% 100%|D conflicts | 18 18 16.28k| | 57% 100%|D diff | 308 308 384.38k| 111.6ms | 79% 100%|D diff_intraline | 41 41 19.05k| 8.0ms | 14% |D git_tags | 235 235 60.02m| | 35% 99%|D mergeability | 28 28 18.65k| 339.8ms | 68% 100%|D web_sessions | 917 4039 1.66m| | 98% 6%|SSH: 8 users, oldest session started 8 hrs 18 min agoTasks: 4 total = 2 running + 0 ready + 2 sleepingMem: 27.22g total = 23.09g used + 1.76g free + 2.37g buffers64.00g max4096 open filesThreads: 32 CPUs available, 164 threadsNEW RUNNABLE BLOCKED WAITING TIMED_WAITING TERMINATEDSSH git-receive-pack 0 0 0 1 0 0HTTP 0 8 0 0 5 0SSH-Interactive-Worker 0 0 0 30 0 0Other 0 11 0 44 31 0ReceiveCommits 0 0 0 32 0 0SshCommandStart 0 0 0 2 0 0JVM: Oracle Corporation Java HotSpot(TM) 64-Bit Server VM 24.76-b04on Linux 2.6.18-348.el5 amd64
"HTTP-4983" prio=10 tid=0x00002b98ada82800 nid=0x21b6 runnable [0x00002b98a9c4a000]java.lang.Thread.State: RUNNABLEat org.apache.lucene.search.TermScorer.nextDoc(TermScorer.java:65)at org.apache.lucene.search.Weight$DefaultBulkScorer.scoreAll(Weight.java:192)
- locked <0x00002b93ab0686a8> (a com.google.inject.servlet.GuiceFilter$Context)
at com.google.inject.servlet.GuiceFilter.doFilter(GuiceFilter.java:130)at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:221)at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)at org.eclipse.jetty.server.handler.RequestLogHandler.handle(RequestLogHandler.java:95)at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)at org.eclipse.jetty.server.Server.handle(Server.java:497)at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)at org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)at java.lang.Thread.run(Unknown Source)
2017-05-11T03:26:33.297-0500: 24190.995: [GC [PSYoungGen: 20841162K->324230K(19940352K)] 37491735K->17271351K(40344064K), 0.3186810 secs] [Times: user=1.40 sys=0.19, real=0.32 secs]2017-05-11T03:27:52.141-0500: 24269.839: [GC [PSYoungGen: 19939974K->64623K(18904064K)] 36887095K->17041180K(39307776K), 0.0518430 secs] [Times: user=0.55 sys=0.03, real=0.05 secs]2017-05-11T03:28:55.948-0500: 24333.646: [GC [PSYoungGen: 18903663K->76096K(18176000K)] 35880220K->17055388K(38579712K), 0.0550160 secs] [Times: user=0.49 sys=0.05, real=0.06 secs]2017-05-11T03:29:43.879-0500: 24381.577: [GC [PSYoungGen: 18175808K->103826K(17499136K)] 35155100K->17101686K(37902848K), 0.0825140 secs] [Times: user=0.72 sys=0.07, real=0.08 secs]2017-05-11T03:30:37.197-0500: 24434.895: [GC [PSYoungGen: 17499026K->269229K(16993792K)] 34496886K->17267089K(37397504K), 0.1586370 secs] [Times: user=1.07 sys=0.28, real=0.16 secs]2017-05-11T03:31:26.637-0500: 24484.335: [GC [PSYoungGen: 16993709K->245822K(16331776K)] 33991569K->17243682K(36735488K), 0.0896720 secs] [Times: user=0.94 sys=0.39, real=0.09 secs]
[ip 1] - [user a] [11/May/2017:03:29:13 -0500] "GET /accounts/[user g]%40[domain1].com/avatar?s=26 HTTP/1.1" 404 30 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:13 -0500] "GET /changes/958601/revisions/a746eb84b70e90aad53ccab25fb71f54cbbb53db/comments HTTP/1.1" 200 28 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 1] - [user a] [11/May/2017:03:29:13 -0500] "GET /accounts/[user h]%40[domain2].com/avatar?s=26 HTTP/1.1" 404 30 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:13 -0500] "GET /changes/958601/revisions/a746eb84b70e90aad53ccab25fb71f54cbbb53db/drafts HTTP/1.1" 200 28 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 3] - [user c] [11/May/2017:03:29:13 -0500] "GET /changes/992960/detail?O=404 HTTP/1.1" 200 2165 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:13 -0500] "GET /changes/958601/revisions/a746eb84b70e90aad53ccab25fb71f54cbbb53db/files?reviewed HTTP/1.1" 200 28 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:13 -0500] "GET /changes/958601/revisions/a746eb84b70e90aad53ccab25fb71f54cbbb53db/files HTTP/1.1" 200 216 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:13 -0500] "GET /projects/[project path 1]%2[project path 1]/config HTTP/1.1" 200 499 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:13 -0500] "GET /changes/958601/revisions/a746eb84b70e90aad53ccab25fb71f54cbbb53db/commit?links HTTP/1.1" 200 529 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 4] - - [11/May/2017:03:29:13 -0500] "GET /favicon.ico HTTP/1.1" 200 - - "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:47.0) Gecko/20100101 Firefox/47.0"[ip 5] - [user d] [11/May/2017:03:29:13 -0500] "GET /changes/992994/detail?O=404 HTTP/1.1" 304 - "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:53.0) Gecko/20100101 Firefox/53.0"[ip 6] - [user e] [11/May/2017:03:29:14 -0500] "GET /changes/969435/detail?O=404 HTTP/1.1" 304 - "https://[gerrit server]/" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/51.0.2704.103 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:14 -0500] "GET /changes/958601/revisions/1/comments HTTP/1.1" 200 28 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:14 -0500] "GET /changes/?q=project:[project path 1]+change:I325d6b393bae120c184bb0426b445f779d0b5192+-change:958601+-is:abandoned&O=a HTTP/1.1" 200 28 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 7] - [user f] [11/May/2017:03:29:14 -0500] "GET /accounts/self/capabilities?q=createProject&q=createGroup&q=viewPlugins HTTP/1.1" 200 28 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 2] - [user b] [11/May/2017:03:29:14 -0500] "GET /changes/958601/revisions/a746eb84b70e90aad53ccab25fb71f54cbbb53db/related HTTP/1.1" 200 40 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 7] - [user f] [11/May/2017:03:29:14 -0500] "GET /config/server/top-menus HTTP/1.1" 200 645 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 7] - [user f] [11/May/2017:03:29:14 -0500] "GET /accounts/self/preferences HTTP/1.1" 200 315 "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"[ip 4] - - [11/May/2017:03:29:14 -0500] "GET /config/server/top-menus HTTP/1.1" 200 645 "https://[gerrit server]/" "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:47.0) Gecko/20100101 Firefox/47.0"[ip 4] - - [11/May/2017:03:29:14 -0500] "GET /gerrit_ui/gwt/chrome/images/hborder.png HTTP/1.1" 200 - "https://[gerrit server]/gerrit_ui/gwt/chrome/7CF1DE6EF2AABFEFAE4D469A16D60071.cache.css" "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:47.0) Gecko/20100101 Firefox/47.0"[ip 7] - [user f] [11/May/2017:03:29:14 -0500] "GET /changes/958601/edit?list HTTP/1.1" 204 - "https://[gerrit server]/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"
logs> cat error_log |grep 03:29:[2017-05-11 03:29:26,968] INFO com.google.gerrit.httpd.auth.ldap.LdapLoginServlet : '[user i]' failed to sign in: Incorrect username or passwordlogs> cat error_log |grep 03:30:logs>
logs> cat sshd_log |grep 03:29:10[2017-05-11 03:29:10,400 -0500] 3d007e4b [user j] a/1001028 LOGIN FROM [ip 8][2017-05-11 03:29:10,581 -0500] 3d007e4b [user j] a/1001028 git-upload-pack./home/repo/scm/pre_build_scripts.git 1ms 175ms 0[2017-05-11 03:29:10,581 -0500] 3d007e4b [user j] a/1001028 LOGOUT[2017-05-11 03:29:10,620 -0500] 5d09b278 [user k] a/1004944 LOGIN FROM [ip 9][2017-05-11 03:29:10,621 -0500] 5d09b278 [user k] a/1004944 LOGOUT[2017-05-11 03:29:10,764 -0500] bdf30e9a [user j] a/1001028 LOGIN FROM [ip 8][2017-05-11 03:29:10,827 -0500] bdf30e9a [user j] a/1001028 git-upload-pack./home/repo/scm/pre_build_scripts.git 1ms 56ms 0[2017-05-11 03:29:10,865 -0500] bdf30e9a [user j] a/1001028 LOGOUTlogs> cat sshd_log |grep 03:29:11logs> cat sshd_log |grep 03:29:12logs> cat sshd_log |grep 03:29:13logs> cat sshd_log |grep 03:29:14logs> cat sshd_log |grep 03:29:15logs> cat sshd_log |grep 03:29:16logs>
"HTTP-601" prio=10 tid=0x00002b98ac4bb800 nid=0x653b runnable [0x00002b98a8b0e000]
java.lang.Thread.State: RUNNABLEat org.apache.lucene.util.PriorityQueue.downHeap(PriorityQueue.java:246)at org.apache.lucene.util.PriorityQueue.updateTop(PriorityQueue.java:204)at org.apache.lucene.search.TopFieldCollector$MultiComparatorNonScoringCollector.updateBottom(TopFieldCollector.java:398)at org.apache.lucene.search.TopFieldCollector$MultiComparatorNonScoringCollector.collect(TopFieldCollector.java:427)at org.apache.lucene.search.Weight$DefaultBulkScorer.scoreAll(Weight.java:193)
at org.apache.lucene.search.Weight$DefaultBulkScorer.score(Weight.java:163)at org.apache.lucene.search.BulkScorer.score(BulkScorer.java:35)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:621)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:581)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:533)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:510)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:378)
"HTTP-680" prio=10 tid=0x00002b98a49c0000 nid=0x6981 runnable [0x00002b989ab69000]java.lang.Thread.State: RUNNABLEat org.apache.lucene.search.TopFieldCollector$MultiComparatorNonScoringCollector.collect(TopFieldCollector.java:424)at org.apache.lucene.search.Weight$DefaultBulkScorer.scoreAll(Weight.java:193)
at org.apache.lucene.search.Weight$DefaultBulkScorer.score(Weight.java:163)at org.apache.lucene.search.BulkScorer.score(BulkScorer.java:35)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:621)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:581)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:533)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:510)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:378)at com.google.gerrit.lucene.LuceneChangeIndex$QuerySource.read(LuceneChangeIndex.java:408)
"HTTP-4983" prio=10 tid=0x00002b98ada82800 nid=0x21b6 runnable [0x00002b98a9c4a000]java.lang.Thread.State: RUNNABLE
at org.apache.lucene.util.PriorityQueue.downHeap(PriorityQueue.java:246)at org.apache.lucene.util.PriorityQueue.pop(PriorityQueue.java:177)at org.apache.lucene.search.TopFieldCollector.populateResults(TopFieldCollector.java:1203)at org.apache.lucene.search.TopDocsCollector.topDocs(TopDocsCollector.java:156)at org.apache.lucene.search.TopDocsCollector.topDocs(TopDocsCollector.java:93)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:582)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:533)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:510)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:378)
"HTTP-5822" prio=10 tid=0x00002b98a44ff800 nid=0xd5b runnable [0x00002b98b00fd000]
java.lang.Thread.State: RUNNABLEat org.apache.lucene.util.PriorityQueue.downHeap(PriorityQueue.java:246)at org.apache.lucene.util.PriorityQueue.updateTop(PriorityQueue.java:204)at org.apache.lucene.search.TopFieldCollector$MultiComparatorNonScoringCollector.updateBottom(TopFieldCollector.java:398)at org.apache.lucene.search.TopFieldCollector$MultiComparatorNonScoringCollector.collect(TopFieldCollector.java:427)at org.apache.lucene.search.Weight$DefaultBulkScorer.scoreAll(Weight.java:193)
at org.apache.lucene.search.Weight$DefaultBulkScorer.score(Weight.java:163)at org.apache.lucene.search.BulkScorer.score(BulkScorer.java:35)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:621)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:581)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:533)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:510)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:378)
"HTTP-6086" prio=10 tid=0x00002b98ac06e800 nid=0x7c5a runnable [0x00002b98ab27d000]
java.lang.Thread.State: RUNNABLEat org.apache.lucene.util.PriorityQueue.downHeap(PriorityQueue.java:246)
at org.apache.lucene.util.PriorityQueue.pop(PriorityQueue.java:177)at org.apache.lucene.search.TopFieldCollector.populateResults(TopFieldCollector.java:1203)at org.apache.lucene.search.TopDocsCollector.topDocs(TopDocsCollector.java:156)at org.apache.lucene.search.TopDocsCollector.topDocs(TopDocsCollector.java:93)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:582)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:533)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:510)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:378)at com.google.gerrit.lucene.LuceneChangeIndex$QuerySource.read(LuceneChangeIndex.java:408)
"HTTP-640" prio=10 tid=0x000000001a288000 nid=0x60a8 runnable [0x00002b989631d000]
java.lang.Thread.State: RUNNABLEat org.apache.lucene.util.PriorityQueue.downHeap(PriorityQueue.java:246)at org.apache.lucene.util.PriorityQueue.updateTop(PriorityQueue.java:204)at org.apache.lucene.search.TopFieldCollector$MultiComparatorNonScoringCollector.updateBottom(TopFieldCollector.java:398)at org.apache.lucene.search.TopFieldCollector$MultiComparatorNonScoringCollector.collect(TopFieldCollector.java:427)at org.apache.lucene.search.Weight$DefaultBulkScorer.scoreAll(Weight.java:193)
--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en
---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
On 11 May 2017, at 21:40, Leandro Fonseca <lean...@motorola.com> wrote:
Hi Luca,Yes, I did, pasting it here again anyway:
[gerrit]basePath = [path to projects]canonicalWebUrl = https://[server name]/replicateOnStartup = false[core]
packedGitOpenFiles = 4096 <<<<====
Another information: currently we are just running git-gc on our project, not gerrit-gc.Would that be a problem as well?
if that's one of the bad threads it looks like you have a performance issue with lucene.
Hi Matthew,Correct, version 2.11.3 has some gerrit-gc problems (ack-nak problems while trying to sync the projects if I remember correctly), and that's the reason why we use git-gc.As per what Martin reported, we also run git-gc weekly with Gerrit running and have no problems with that. Do you think it could be causing our issue?Hi Martin & Matthias,Yes, our server specs are underpowered for our need, and we are already working on some upgrades.We have a customized & independent replication system that is not disputing resources with the Gerrit JVM, so our SSH threads are not being used by the replication plugin.For the usage data on this specific instance, that has 1.6k active users, these are the daily averages:14k git-upload-packs40k git-receive-packs3.2k http logins (GET on https://<server-address>/login)1k patch-sets createdWe are not using bitmap indexes on our git-gc operations.
Just one more information: we really started experiencing this problems after we deployed SSL to this instance. And as per Zivkov comment on the related post below this could be related to Jetty. Would there be any setting/log we could use to confirm that? Increasing the heap limit would help anyhow?Also, Matthias have pointed before that the problem could be related to the Lucene index:if that's one of the bad threads it looks like you have a performance issue with lucene.Would the be any way to confirm it besides re-indexing all changes again? Like something I could look into the logs?
"HTTP-63170" prio=10 tid=0x000000000c561000 nid=0x6eae runnable [0x00002b58ea0bb000]
java.lang.Thread.State: RUNNABLEat org.apache.lucene.util.PriorityQueue.downHeap(PriorityQueue.java:246)
at org.apache.lucene.util.PriorityQueue.pop(PriorityQueue.java:177)at org.apache.lucene.search.TopFieldCollector.populateResults(TopFieldCollector.java:1203)at org.apache.lucene.search.TopDocsCollector.topDocs(TopDocsCollector.java:156)at org.apache.lucene.search.TopDocsCollector.topDocs(TopDocsCollector.java:93)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:582)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:533)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:510)at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:378)at com.google.gerrit.lucene.LuceneChangeIndex$QuerySource.read(LuceneChangeIndex.java:408)at com.google.gerrit.server.index.IndexedChangeQuery.read(IndexedChangeQuery.java:95)at com.google.gerrit.server.index.IndexedChangeQuery.restart(IndexedChangeQuery.java:138)at com.google.gerrit.server.query.change.AndSource.readImpl(AndSource.java:133)at com.google.gerrit.server.query.change.AndSource.read(AndSource.java:99)at com.google.gerrit.server.query.change.QueryProcessor.queryChanges(QueryProcessor.java:153)at com.google.gerrit.server.query.change.QueryProcessor.queryChanges(QueryProcessor.java:102)at com.google.gerrit.server.query.change.QueryChanges.query0(QueryChanges.java:143)at com.google.gerrit.server.query.change.QueryChanges.query(QueryChanges.java:132)at com.google.gerrit.server.query.change.QueryChanges.apply(QueryChanges.java:99)
...244: if (heap[i] == element) {245: heap[i] = heap[size];246: heap[size] = null; // permit GC of objects247: size--;...
On 12 Jul 2017, at 08:09, 'Björn Pedersen' via Repo and Gerrit Discussion <repo-d...@googlegroups.com> wrote:Hi,
if java gc is the culprit, maybe take a look at
https://www.cloudbees.com/blog/joining-big-leagues-tuning-jenkins-gc-responsiveness-and-stability
and try to tune it.
Björn
at com.google.gerrit.server.query.change.QueryChanges.apply(QueryChanges.java:99)
Hi Luca,I just checked the stack-trace of multiple examples of these "bad threads".Except for the first lines, that may differ a little but are all under lucene, AND the thread id on the "locked line" (both blocks are highlighted below), everything else looks the same for all threads.Any idea what that could be? Maybe a result of a REST call?Thanks again,Leandro.
Hi everyoneOur Gerrit 2.11.3 installation, that is currently using HTTPS/SSL, is repeatedly increasing its CPU usage until a point that all the server CPUs are 100% used and the Gerrit process obviously breaks. The only solution to free up the used CPU is to restart the Gerrit service.I used jstack to found that the CPU high usage is coming from HTTP threads (Yes, the Gerrit process was using ~ 900% of the server CPU at the time this data was collected):Process id CPU utilization Thread4156 99.9 "HTTP-688" prio=10 tid=0x00000000041cd000 nid=0x103c runnable [0x00002baa61d19000]6476 99.9 "HTTP-785" prio=10 tid=0x00002baa5c796000 nid=0x194c runnable [0x00002baa602ff000]17037 99.9 "HTTP-845" prio=10 tid=0x00002baa5c794800 nid=0x428d runnable [0x00002baa60703000]8466 99.9 "HTTP-882" prio=10 tid=0x00002baa645d5800 nid=0x2112 runnable [0x00002baa6c703000]17404 99.9 "HTTP-924" prio=10 tid=0x00002baa5c0b7000 nid=0x43fc runnable [0x00002baa6201c000]4376 99.9 "HTTP-929" prio=10 tid=0x0000000004650000 nid=0x1118 runnable [0x00002baa6dc14000]2965 99.9 "HTTP-963" prio=10 tid=0x0000000004f28800 nid=0xb95 runnable [0x00002baa5670a000]8810 99.9 "HTTP-10958" prio=10 tid=0x00000000055f1000 nid=0x226a runnable [0x00002baa6edbf000]690 99.9 "HTTP-12100" prio=10 tid=0x0000000005373800 nid=0x2b2 runnable [0x00002baa6ecae000]I could not find much information on the thread full information (using thread 0x226a as example, can provide the info of the others):"HTTP-10958" prio=10 tid=0x00000000055f1000 nid=0x226a runnable [0x00002baa6edbf000]
- locked <0x00002b9a8b8561c8> (a com.google.inject.servlet.GuiceFilter$Context)
at com.google.inject.servlet.GuiceFilter.doFilter(GuiceFilter.java:130)at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:221)at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)at org.eclipse.jetty.server.handler.RequestLogHandler.handle(RequestLogHandler.java:95)at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)at org.eclipse.jetty.server.Server.handle(Server.java:497)at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)at org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)at java.lang.Thread.run(Unknown Source)
All these threads were running using 0.0% of CPU since the last time Gerrit process was restarted, some have some small usage and then return to 0.0%, but then all of the sudden they ramp up to 100% CPU usage and stay around this usage rate (97, 98 & 99%) during all their existence, until it gets to a critical level and I need to restart Gerrit again. For the above detailed thread, according to my monitors, this ramp up happened around 09/May/2017:13:04:24.I looked on the httpd_log for any suspicious activity, but there was nothing wrong (not only for this specific case, but also for the others). I even accessed the below listed changes and couldn't reproduce the problem:<ip1> - <user1> [09/May/2017:13:04:22 -0500] "GET /changes/990298/detail?O=404 HTTP/1.1" 304 - "https://<gerrit server name>/" "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"<ip2> - <user2> [09/May/2017:13:04:23 -0500] "GET /changes/990656/detail?O=404 HTTP/1.1" 304 - "https://<gerrit server name>/" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.96 Safari/537.36"<ip3> - <user3> [09/May/2017:13:04:23 -0500] "GET /changes/991102/detail?O=404 HTTP/1.1" 304 - "https://<gerrit server name>/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"<ip4> - <user4> [09/May/2017:13:04:23 -0500] "GET /changes/991792/detail?O=404 HTTP/1.1" 304 - "https://<gerrit server name>/" "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:53.0) Gecko/20100101 Firefox/53.0"100.64.193.60 - - [09/May/2017:13:04:23 -0500] "GET /changes/?n=25&O=81 HTTP/1.1" 200 28 "https://<gerrit server name>/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"<ip5> - <user5> [09/May/2017:13:04:24 -0500] "GET /changes/991116/detail?O=404 HTTP/1.1" 304 - "https://<gerrit server name>/" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_5) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36"<ip6> - <user6> [09/May/2017:13:04:24 -0500] "GET /changes/978360/detail?O=404 HTTP/1.1" 304 - "https://<gerrit server name>/" "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:53.0) Gecko/20100101 Firefox/53.0"Nothing was reported on error_log at this specific period:[logs]$ cat error_log | grep "13:04:20"[logs]$ cat error_log | grep "13:04:21"[logs]$ cat error_log | grep "13:04:22"[logs]$ cat error_log | grep "13:04:23"[logs]$ cat error_log | grep "13:04:24"[logs]$ cat error_log | grep "13:04:25"[logs]$Only few pushes reported on sshd_log at this specific period:[2017-05-09 13:04:20,111 -0500] 8008524b <userA> a/1001028 LOGIN FROM <ip X>[2017-05-09 13:04:20,526 -0500] 203c06f2 <userB> a/1010806 LOGIN FROM <ip Y>[2017-05-09 13:04:21,456 -0500] 203c06f2 <userB> a/1010806 git-receive-pack./<project 1> 1ms 379ms 0[2017-05-09 13:04:21,798 -0500] 8008524b <userA> a/1001028 git-receive-pack./<project 2> 1ms 1671ms 0[2017-05-09 13:04:21,799 -0500] 8008524b <userA> a/1001028 LOGOUT[2017-05-09 13:04:21,857 -0500] 203c06f2 <userB> a/1010806 LOGOUT[2017-05-09 13:04:23,118 -0500] 4016da78 <userA> a/1001028 LOGIN FROM <ip X>[2017-05-09 13:04:23,207 -0500] e0182ea3 <userB> a/1010806 LOGIN FROM <ip Y>[2017-05-09 13:04:23,776 -0500] 4016da78 <userA> a/1001028 git-receive-pack./<project 3> 0ms 643ms 0[2017-05-09 13:04:23,777 -0500] 4016da78 <userA> a/1001028 LOGOUT[2017-05-09 13:04:24,178 -0500] e0182ea3 <userB> a/1010806 git-receive-pack./<project 4> 1ms 421ms 0[2017-05-09 13:04:24,587 -0500] e0182ea3 <userB> a/1010806 LOGOUTAs I mentioned, this is happening repeatedly, so it is a very big problem.I have found a similar problem reported on this forum at the page below but unfortunately no conclusive information on how to fix or reproduce the problem was given:Does anyone know what could be happening here?Thanks in advance,Leandro.