Red Hat GmbH, Registered seat: Werner von Siemens Ring 12, D-85630 Grasbrunn, Germany Commercial register: Amtsgericht Muenchen/Munich, HRB 153243, Managing Directors: Ryan Barnhart, Charles Cachera, Avril Crosse O'Flaherty
On 6/06/25 18:53, 'Dmitry Tantsur' via Metal3 Development List wrote:
> Hi team,
>
> I'm not sure about everyone here, but I regularly get complaints from
> customers that use forced deletion of BMH objects as a way to "fix"
> problems like stuck cleaning. As explained in our troubleshooting FAQ
> [1], it may cause issues on re-registration because of the stale node
> record in Ironic.
>
> The FAQ suggests deleting and re-creating the Ironic pod. But this
> "solution" is not just pretty inconvenient, it won't work when a real
> database backed by a persistent volume is used. I'd like to open a
> brainstorming session on solving this issue.
>
> My immediate idea is to introduce a new internal CRD IronicNode that is
> created for any node that a BMH is backed by. It will be linked to the
> BMH by an ownerReference and will also have a finalizer. If the
> IronicNode controller finds that the BMH is gone, it will try to get rid
> of the corresponding Node as soon as possible. This will include
> aborting cleaning/inspection/deployment to get the Node into some
> deletable state.
Users will just delete the finalizer on the IronicNode.
> This addition may have a positive side effect of splitting the BMH
> controller into two controllers, one handling high-level and one low-
> level provisioning logic. It does provide an easy way to bypass
> cleaning, which I don't particularly like. But so does the current
> workaround of removing the pod.
>
> Does anyone have any better ideas? And what do you think about this idea?
Since it doesn't appear we will ever try to do discovery, what about
having some sort of process that periodically scans the ironic nodes and
deletes any that aren't associated with a BMH?
Or even simpler, store the UID of the BMH somewhere in the ironic Node
(which you'd probably have to do to safely implement the above anyway),
and then when someone creates a new BMH with the same name delete any
existing node that references a different UID.
(I want to say that some version of this was first suggested by Mahnoor,
but I can't find where so I assume that any good parts in the idea were
hers and the bad parts were mine.)
- ZB
--
You received this message because you are subscribed to the Google Groups "Metal3 Development List" group.
To unsubscribe from this group and stop receiving emails from it, send an email to metal3-dev+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/metal3-dev/e1c30d9d-e823-4dd7-9668-5ab935b431ea%40redhat.com.
On 7/06/25 03:27, Dmitry Tantsur wrote:
> Thank you for chiming in!
>
> On Fri, Jun 6, 2025 at 2:19 PM 'Zane Bitter' via Metal3 Development List
> <metal...@googlegroups.com <mailto:metal...@googlegroups.com>> wrote:
>
> On 6/06/25 18:53, 'Dmitry Tantsur' via Metal3 Development List wrote:
> > Hi team,
> >
> > I'm not sure about everyone here, but I regularly get complaints
> from
> > customers that use forced deletion of BMH objects as a way to "fix"
> > problems like stuck cleaning. As explained in our troubleshooting
> FAQ
> > [1], it may cause issues on re-registration because of the stale
> node
> > record in Ironic.
> >
> > The FAQ suggests deleting and re-creating the Ironic pod. But this
> > "solution" is not just pretty inconvenient, it won't work when a
> real
> > database backed by a persistent volume is used. I'd like to open a
> > brainstorming session on solving this issue.
<snip>
> Dmitry
>
>
> (I want to say that some version of this was first suggested by
> Mahnoor,
> but I can't find where so I assume that any good parts in the idea were
> hers and the bad parts were mine.)
>
> - ZB
>
> --
> You received this message because you are subscribed to the Google
> Groups "Metal3 Development List" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to metal3-dev+...@googlegroups.com
> <mailto:metal3-dev%2Bunsu...@googlegroups.com>.
> To view this discussion visit https://groups.google.com/d/msgid/
> metal3-dev/e1c30d9d-e823-4dd7-9668-5ab935b431ea%40redhat.com
> <https://groups.google.com/d/msgid/metal3-dev/e1c30d9d-
> e823-4dd7-9668-5ab935b431ea%40redhat.com>.
>
>
>
> --
>
> Red Hat GmbH <https://www.redhat.com/de/global/dach>, Registered seat: Werner von Siemens Ring 12, D-85630 Grasbrunn, Germany
> Commercial register: Amtsgericht Muenchen/Munich, HRB 153243,
> Managing Directors: Ryan Barnhart, Charles Cachera, Avril Crosse O'Flaherty
>
--
You received this message because you are subscribed to the Google Groups "Metal3 Development List" group.
To unsubscribe from this group and stop receiving emails from it, send an email to metal3-dev+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/metal3-dev/dc8a3063-545e-4703-95ce-c0357da879f1%40redhat.com.
Red Hat GmbH, Registered seat: Werner von Siemens Ring 12, D-85630 Grasbrunn, Germany Commercial register: Amtsgericht Muenchen/Munich, HRB 153243, Managing Directors: Ryan Barnhart, Charles Cachera, Avril Crosse O'Flaherty