NHC and List of supported problems

15 views
Skip to first unread message

Safadi Mohi

unread,
Feb 26, 2025, 11:41:04 AMFeb 26
to medik8s
What is the list of supported problems which could be detected by node Health check? as I read on FAQ

"Can I create my own definition of what counts as a healthy node?

Yes. Node Healthcheck determines node health based on NodeConditions. There are a set of basic conditions built into Kubernetes, but additional conditions can be defined and then referenced by Node Healthcheck. Node Problem Detector is a common tool for creating and updating NodeConditions based on log scraping.

It means that in case of FileSystem Readonly, or any another problem on the node, the NHC will not report that node is not ready?

so it supports just 
node condition
["Ready", "DiskPressure", "MemoryPressure", "PIDPressure" and "NetworkUnavailable"]

so such problem as [ReadonlyFilesystem] it won't be detected by NHC?

So to have better results, we have to run NHC and NDP on the node? with self node remediation?

how it looks like if we need to have best practices to detect all known issues on the node?


Node Problem detector has such list as below

Marc Sluiter

unread,
Feb 27, 2025, 3:43:42 AMFeb 27
to Safadi Mohi, medik8s
Hi.

Sorry, it's not clear to me what the actual question is.
NHC is looking at node conditions indeed. The condition types which trigger NHC can be configured in the NodeHealthCheck CR.
NHC does not care if those are "built in" condition types, or if they are set on the node by a 3rd party tool like Node Problem Detector.
Just be aware that NPD is not supported by Red Hat, in case that matters.

Hope this helps,

Marc



--
You received this message because you are subscribed to the Google Groups "medik8s" group.
To unsubscribe from this group and stop receiving emails from it, send an email to medik8s+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/medik8s/af138035-69da-4378-8d2f-a52e86af8103n%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Safadi Mohi

unread,
Feb 27, 2025, 3:52:05 AMFeb 27
to medik8s
Ok, got it.

This what I wanted to get clear.


NHC is just checking node conditions which are ( DEFINED) for node. these node conditions could be defined by 3d part (as node-detect-problem). it just monitor these conditions and make some action in case if one is failed.
Reply all
Reply to author
Forward
0 new messages