Hi,
I have a few questions.
1. Does this problem occur after you upgrade MR3 1.5 to 1.6-SNAPSHOT? I would like to know if there was any change made to your installation that caused this problem.
2. Does HiveServer2 always fail with the same error?
3. Is the number of concurrent users larger than 20? For example, Metastore client says 25 concurrent connections.
2022-10-10T19:03:54,596 INFO [Heartbeater-4] metastore.HiveMetaStoreClient: Opened a connection to metastore, current connections: 25
4. HiveServer2 stops after this message (when it has 40 sessions). Does
2022-10-10T19:05:59,140 INFO [org.apache.hadoop.hive.common.JvmPauseMonitor$Monitor@45f8415b] common.JvmPauseMonitor: Detected pause in JVM or host machine (eg GC): pause of approximately 4095ms
GC pool 'PS MarkSweep' had collection(s): count=2 time=4281ms
GC pool 'PS Scavenge' had collection(s): count=3 time=72ms
2022-10-10T19:05:59,140 INFO [HiveServer2-Handler-Pool: Thread-1045] service.CompositeService: Session closed, SessionHandle [979b5e81-79f1-493c-8f40-24d1fa98475c], current sessions:40
...
2022-10-10T19:05:59,162 INFO [HiveServer2-Handler-Pool: Thread-1045] server.HiveServer2: Shutting down HiveServer2