Hi SIG-arch,
Internally we have been discussing how we can improve the safety and reliability of Kubernetes in general, and in upgrades and the use of new features in particular. We thought it may be time to introduce production readiness reviews for Kubernetes features, and would like feedback on this idea.
The goal here is to ensure that new versions and features can be rolled out to clusters in a safe, controlled manner, and can be monitored, debugged, and disabled if necessary. This starter KEP provides a bit more detail:
Please feel free to comment here or, preferably, on the PR.
While this does add some process overhead, I think that at this stage in Kubernetes' development, it's critical that upgrades do not cause outages for workloads (or for the control plane as much as possible). Without the kind of supports described here, that will be impossible to achieve.
John