Thanks Josh. I think we may be talking about two different situations.
What I am asking about is the bogus sar data reflected after a node has
been power cycled to restore it to use. Power cycled without the benefit
of a clean shutdown prior to the power cycle that is...
We know that the %user logged immediately after the power cycle is not
correct.
What we are interested in doing is resetting the sar counters upon
startup immediately following the power cycle to restore the sar
information to a sane state so only accurate data is reflected w/the sar
command.
I've tried a /etc/rc.d.rc3.d script that runs a '/usr/sbin/sadc -'
function prior to cron or accounting being started (it's run in a S07
script) but that did not do the job.
Thanks,
Joe
-----Original Message-----
From:
lnx...@googlegroups.com [mailto:
lnx...@googlegroups.com] On Behalf
Of Joshua Aune
Sent: Thursday, July 31, 2008 4:22 PM
To:
lnx...@googlegroups.com
Subject: [lnxiug] Re: Corrupted sar entries after power cycle of hung
nodes
Hi Joe,
When I see this generally mens that a users process ran away,
frequently into swap. I suspect the %usages are real (though off a
little bit).
Josh
On Jul 31, 2008, at 5:14 PM, Joseph Dowell wrote:
>
> Periodically we've seen nodes hang in a way that they can only be
> restored to use via an ungraceful power cycle (pm -0 node)...