Hi Thomas,
Thanks for reaching out, and for your fantastic work on the KeyCloak chart! I’m personally loving the chart. We’ve been using KeyCloak under K8s since 2016, but our setup was very basic and super-flaky, losing sessions when pods restarted etc. I moved everything over to the Codecentric chart about a year ago, and it was a massive improvement. We started out on v8 of the chart, I believe, and it didn’t support horizontal scaling out of the box, but I wrote my own HPA and it worked reasonably well.
The biggest issue we faced was when a minor deployment accidentally upgraded us to version 9 of the chart (we weren’t pinning our chart versions!) and broke "All Teh Things!!1!”. While fixing that, I noticed that scaling support had been added, so I removed my HPA and configured it as per the documentation. This was late November last year, and it’s been running without a problem since, serving many thousands of users each day.
The thing I found most difficult was configuring the autoscaling behaviour. For a long time we either had too many pods, or things were crawling due to memory pressure, but the metrics would show RAM usage under 50%, so the autoscaling wouldn’t kick in. IIRC, it wasn’t until I added the ‘behaviours’ section that things really stabilised and started running well. It’s been great ever since, though: