Groups
Groups
Sign in
Groups
Groups
LBNL Node Health Check
Conversations
Labels
About
Send feedback
Help
LBNL Node Health Check
Contact owners and managers
1–19 of 19
Mark all as read
Report group
0 selected
Ryan Novosielski
5/21/24
Negating a mount or files check, NHC 1.4.2
Hi there, I'm not sure if this is possible and I'm missing it in the docs, or if it's not
unread,
Negating a mount or files check, NHC 1.4.2
Hi there, I'm not sure if this is possible and I'm missing it in the docs, or if it's not
5/21/24
Michael Jennings
2/1/23
NHC Status Update
Hi folks! I hope you all are having a great 2023 so far. Those of you who've been paying
unread,
NHC Status Update
Hi folks! I hope you all are having a great 2023 so far. Those of you who've been paying
2/1/23
Jason Simms
, …
Erik Ellestad
4
7/21/21
SLURM Integration
For what it is worth, I had the best luck adding "NHC_RM=slurm" to /etc/sysconfig/nhc and
unread,
SLURM Integration
For what it is worth, I had the best luck adding "NHC_RM=slurm" to /etc/sysconfig/nhc and
7/21/21
Ole Holm Nielsen
2
4/24/21
A new check_all_fs_used function?
I've created a pull request https://github.com/mej/nhc/pull/101 proposing these new functions: *
unread,
A new check_all_fs_used function?
I've created a pull request https://github.com/mej/nhc/pull/101 proposing these new functions: *
4/24/21
Michael Jennings
,
Heitor
2
4/23/21
Re: [slurm-users] NHC and slurm
Hello Michael, On Tue, 20 Apr 2021 19:17:57 -0600 Michael Jennings <m...@lanl.gov> wrote: >
unread,
Re: [slurm-users] NHC and slurm
Hello Michael, On Tue, 20 Apr 2021 19:17:57 -0600 Michael Jennings <m...@lanl.gov> wrote: >
4/23/21
Jennings, Michael E
12/6/18
NHC Project Update
Hi everyone! It's been awhile, so I wanted to provide an update on where things stand with NHC.
unread,
NHC Project Update
Hi everyone! It's been awhile, so I wanted to provide an update on where things stand with NHC.
12/6/18
Dockendorf, Trey
,
John Hearns
4
10/31/18
Running GPFS commands with check_cmd_output
Thanks for pointing me at mmhealth, looks like a system not using RDMA will show as unhealthy which
unread,
Running GPFS commands with check_cmd_output
Thanks for pointing me at mmhealth, looks like a system not using RDMA will show as unhealthy which
10/31/18
Belgin, Mehmet
, …
Michael Jennings
4
7/16/18
Best way to setup env for NHC?
On Thu, Jul 5, 2018 at 3:22 PM, Belgin, Mehmet <mehmet...@oit.gatech.edu> wrote: > I
unread,
Best way to setup env for NHC?
On Thu, Jul 5, 2018 at 3:22 PM, Belgin, Mehmet <mehmet...@oit.gatech.edu> wrote: > I
7/16/18
Belgin, Mehmet
,
Michael Jennings
2
7/16/18
NHC loglevels?
On Fri, Jul 6, 2018 at 2:44 PM, Belgin, Mehmet <mehmet...@oit.gatech.edu> wrote: > I
unread,
NHC loglevels?
On Fri, Jul 6, 2018 at 2:44 PM, Belgin, Mehmet <mehmet...@oit.gatech.edu> wrote: > I
7/16/18
Brian Kircher
7/6/17
Simple NHC test install on an SGE managed node
Good morning, I have what I hope is a simple question on an initial configuration issue I ran into.
unread,
Simple NHC test install on an SGE managed node
Good morning, I have what I hope is a simple question on an initial configuration issue I ran into.
7/6/17
Pariksheet Nanda
,
Michael Jennings
2
5/25/17
How does one process .spec.in file to .spec?
Hi Pariksheet, As with *.in files in general, at least with projects that use GNU Autotools, the
unread,
How does one process .spec.in file to .spec?
Hi Pariksheet, As with *.in files in general, at least with projects that use GNU Autotools, the
5/25/17
Dockendorf, Trey
3/28/17
File descriptor issue with services started by NHC when invoked by pbs_mom
I've hit an issue that occurs when pbs_mom executes NHC and NHC restarts a failed service. The
unread,
File descriptor issue with services started by NHC when invoked by pbs_mom
I've hit an issue that occurs when pbs_mom executes NHC and NHC restarts a failed service. The
3/28/17
Johan Guldmyr
, …
Michael Jennings
8
3/2/17
Q&A
How to best use NHC to check that time is in sync?
Nice! Thanks Michael :) I think some of these should work quite nicely. // Johan On Thursday, March 2
unread,
Q&A
How to best use NHC to check that time is in sync?
Nice! Thanks Michael :) I think some of these should work quite nicely. // Johan On Thursday, March 2
3/2/17
John Griffin-Wiesner
, …
John Griffin-Wiesner
5
2/17/17
nhc keeps appending to note
Thanks everyone for the feedback. Turns out we had 6 settings on pbs_server that we'd set for our
unread,
nhc keeps appending to note
Thanks everyone for the feedback. Turns out we had 6 settings on pbs_server that we'd set for our
2/17/17
Eisa Hedayati
, …
Gowtham
3
1/11/17
NHC Configuration on SGE
Thank you for helping us out, Michael. Much appreciated! Best regards, Gowtham -- Gowtham, PhD
unread,
NHC Configuration on SGE
Thank you for helping us out, Michael. Much appreciated! Best regards, Gowtham -- Gowtham, PhD
1/11/17
Dockendorf, Trey
,
Michael Jennings
2
10/11/16
NHC gets hung when IB gets into bad state
Hi Trey! On Wed, Oct 5, 2016 at 1:33 PM, Dockendorf, Trey <tdock...@osc.edu> wrote: >
unread,
NHC gets hung when IB gets into bad state
Hi Trey! On Wed, Oct 5, 2016 at 1:33 PM, Dockendorf, Trey <tdock...@osc.edu> wrote: >
10/11/16
Simpson Lachlan
,
Michael Jennings
4
9/26/16
joining list? Also: hardware distinction
On Sun, Sep 25, 2016 at 6:08 PM, Simpson Lachlan <Lachlan...@petermac.org> wrote: >
unread,
joining list? Also: hardware distinction
On Sun, Sep 25, 2016 at 6:08 PM, Simpson Lachlan <Lachlan...@petermac.org> wrote: >
9/26/16
Ryan Novosielski
, …
Bidwell, Matt
4
8/31/16
check_hw_ib minimum rate (not exact)
Created Issue #21 for this. Let me know if I can be of some help somehow. > On Aug 23, 2016, at 12
unread,
check_hw_ib minimum rate (not exact)
Created Issue #21 for this. Let me know if I can be of some help somehow. > On Aug 23, 2016, at 12
8/31/16
Cam
4/22/16
NHC and SGE implementations
While NHC seems to be geared more towards SLURM and TORQUE, reading through documentation and the
unread,
NHC and SGE implementations
While NHC seems to be geared more towards SLURM and TORQUE, reading through documentation and the
4/22/16