Return code 128. Nagios service "Unknown" W2k3 guest monitored.

258 views
Skip to first unread message

edos77

unread,
Jan 31, 2007, 9:07:28 AM1/31/07
to Nagios Config
One day on my Nagios webinterface i find a guest W2k3 monitored that
output "Unknown" in status service. Only this server had this strange
problem, other server function correctly ...

After trying various basic procedures to resolve it, as restart nagios
service on guest, recreate service, comment all command on nrpe.cfg,
but none of these resolve my problem, status of these two guest rest
in a "Unknown" state.

So, on my nrpe_nt config files (%systemroot%\...\bin\nrpe.cfg) i
activate debug option as below:

--------
# DEBUGGING OPTION
# This option determines whether or not debugging messages are logged
to the
# eventlog.
# Values: 0=debugging off, 1=debugging on

debug=1
---------

This function create a file .log (in same folder as before) that says
to me what appens when nagios server starts to control my guest and
also starts any command related to it. So i'm able to debug any error
that occours.

In a quickly view of this log i see a strange row:

--------

2007-00-30 12:32:24; 198844; 6;Listening on port 5666
2007-00-30 12:32:31; 198732; 7;Host is asking for command
'nt_check_disk_c' to be run...
2007-00-30 12:32:31; 198732; 7;Running command: C:\nrpe_nt\plugins\bin
\diskspace_nrpe_nt.exe c: 80 90
2007-00-30 12:32:31; 198732; 7;Command completed with return code 128
2007-00-30 12:32:31; 198732; 4;Bad return code for [No output
available from command...]: 128
2007-00-30 12:32:56; 195064; 7;Host is asking for command
'nt_check_disk_e' to be run...
2007-00-30 12:32:56; 195064; 7;Running command: C:\nrpe_nt\plugins\bin
\diskspace_nrpe_nt.exe e: 80 90
2007-00-30 12:32:56; 195064; 7;Command completed with return code 128
2007-00-30 12:32:56; 195064; 4;Bad return code for [No output
available from command...]: 128
2007-00-30 12:33:11; 198844; 7;Exited listener loop

---------

"Command completed with return code 128" "Bad return code for ...:
128" so i search in internet. On various forum that i see, nobody say
nothing about it and how to resolve this problem, that seems born
automatically itself. And in appearence seems a communication problem.

Debug doesn't help us to resolve this problem, so we have to try
another tools (File monitor) to known what windows process create this
problem related to nrpe check.

The process that create it, at the end, is "csrss.exe" and is related
with a terminal server session that is opened from few days and never
(in this time) has been closed by a "logoff" action. In fact when I
close this opened terminal server session all turn on normally and
Nagios's checks start to control guest correctly!!!!

Edo

Reply all
Reply to author
Forward
0 new messages