no there is not. That is one of the reasons why we decided to create
this working group. We have a lot of ideas and plans but that was always
discussed between a small number of people and mainly privately. With
the working group we want to change that. I am currently waiting on some
On Sat, Nov 15, 2025 at 04:48:50AM -0800, Camilla Montonen wrote:
> Hello everyone!
>
> > We are already discussing the future of
> Checkpoint/Restore in Kubernetes at conferences, in private email
> threads, private Slack channels, in GitHub issues and pull requests, and
> an increasing number of people propose good ideas on how to move
> forward.
>
> I was wondering if there is a single document or maybe documents that give
> an overview of the current
> state of the design/sketches/ideas/discussions of how this functionality
> would work from the perspective of a user (someone runningworkloads on k8s)?
>
> I have an idea on what I'd like the user-facing design of
> checkpoint-restore to look like based on my own very narrow experiences of
> running ML pipelines and model training jobs on k8s,
> but I'd like to expand my context and thinking on this by reading what
> others have already proposed in this space.
>
> Thanks in advance!
> Camilla
>
>
> On Wednesday, October 29, 2025 at 8:29:20 PM UTC+1 Dhruv Bindra wrote:
>
> > Hello, +1 for WG, I'm Dhruv from Manifold Labs. I'm interested in
> > contributing
> >
> > On Saturday, August 16, 2025 at 8:59:15 AM UTC-5 Parthiba Hazra wrote:
> >
> >> +1 for the WG, Hi I'm Parthiba from DevZero (
devzero.io
>). We're building a custom Kubernetes