Marc Sluiter
He / Him / His
Principal Software Engineer
Red Hat GmbH, Registered seat: Werner von Siemens Ring 12, D-85630 Grasbrunn, Germany
Commercial register: Amtsgericht Muenchen/Munich, HRB 153243,
Managing Directors: Ryan Barnhart, Charles Cachera, Michael O'Neill, Amy Ross
--
You received this message because you are subscribed to the Google Groups "medik8s" group.
To unsubscribe from this group and stop receiving emails from it, send an email to medik8s+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/medik8s/87f2284a-58e3-416c-a2e3-0739db7a80e5n%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hi.. Apologies for the delay in reply, as project work has been very busy. We have been able to make changes and validate behavior, and I wanted to be sure that we were happy with the implementation before continuing the discussion.
Existing behavior is that if there is only a single worker node known, it would always be marked as healthy in apicheck/check.go, and no remediation actions could occur even if is actually unhealthy. In our case, we have deployments that by design could consist of 3 control plane nodes + 1 worker node. For us, even in that situation, if the worker node goes unhealthy for connectivity or other reasons, we still want to attempt remediation.
So, to accomplish this, we have added the following configuration value to SNR. Default is one, which preserves existing behavior as the default.
minPeersForRemediation: 1 - Specify the number of worker peers needed to be able to contact before determining a node is unhealthySo, what I'd like to potentially do is, if at a high level there is no conflict here with project direction, validate the naming of the property we have added, and then submit a PR.
To view this discussion on the web visit https://groups.google.com/d/msgid/medik8s/9fa764ad-2ac2-487f-a5f0-b423304110bbn%40googlegroups.com.
Michael Shitrit
Principal Software Engineer
Hi.For the existing described functionality, we are preparing a PR (I've created an issue for it already), but we need to get the unit tests fully running. We were able to run unit tests for validating configuration updates, but not the actual change to apicheck - all the tests there currently fail.
- Yes
- This would be a good addition as well. We will consider supporting this for our internal customers, and pushing relevant changes
To view this discussion on the web visit https://groups.google.com/d/msgid/medik8s/8211a81e-a18d-4666-a971-6f61f7c8d6afn%40googlegroups.com.
Marc Sluiter
He / Him / His
Principal Software Engineer
Red Hat GmbH, Registered seat: Werner von Siemens Ring 12, D-85630 Grasbrunn, Germany
Commercial register: Amtsgericht Muenchen/Munich, HRB 153243,
Managing Directors: Ryan Barnhart, Charles Cachera, Michael O'Neill, Amy Ross
To view this discussion visit https://groups.google.com/d/msgid/medik8s/d857243b-8574-45bd-9c7c-8fd520ef8c47n%40googlegroups.com.