[Spacewalk-list] Spacewalk monitoring - lost connection to monitored host

Jeremy Tan jeremy.tan at msn.com
Fri Aug 19 06:54:16 UTC 2011


Hi All,
 
I have spacewalk client 1.4 installed, and monitoring enabled. My monitor scout config is pushed successfully but the probes that I setup seem to be always in a “Unknown state”. When I drill down to the status of the monitored host, it says “Lost connection to the monitored host".
 
rhn-runprobe gives me the following output. Anyone has an idea what goes wrong ?
 
Any help is appreciated.
 
Cheers
Jem
 
 
[nocpulse at xxxxxx ~]$ rhn-catalog
30 ServiceProbe on : Grid Linux: Disk Usage
31 ServiceProbe on : Linux: Load
 
[nocpulse at xxxxxx ~]$ rhn-runprobe 30
2011-08-19 06:37:43     Items changed or removed:
2011-08-19 06:37:43             space_avail '27762' is OK
2011-08-19 06:37:43             space_used '459' is OK
2011-08-19 06:37:43             pctused '2' is CRITICAL
2011-08-19 06:37:43             NOCpulse::Probe::Shell::LostConnectionError '' isUNKNOWN
2011-08-19 06:37:43     Would notify because:
2011-08-19 06:37:43             pctused '2' is CRITICAL
2011-08-19 06:37:43             NOCpulse::Probe::Shell::LostConnectionError '' is OK
2011-08-19 06:37:43     NOTE: Running in test mode; no changes saved, nothing enqueued
2011-08-19 06:37:43
============================================================
CRITICAL: Filesystem /dev/mapper/rootvg-varvol (/var): Filesystem pct used 2% (above critical threshold of 1%); Space available 27,762 MB; Space used 459 MB
============================================================
 
[nocpulse at xxxxxx ~]$ rhn-runprobe 31
2011-08-19 06:39:10     Items changed or removed:
2011-08-19 06:39:10             load1 '0.02' is OK
2011-08-19 06:39:10             load15 '0.00' is OK
2011-08-19 06:39:10             load5 '0.03' is OK
2011-08-19 06:39:10             NOCpulse::Probe::Shell::LostConnectionError '' isUNKNOWN
2011-08-19 06:39:10     Would notify because:
2011-08-19 06:39:10             NOCpulse::Probe::Shell::LostConnectionError '' is OK
2011-08-19 06:39:10     NOTE: Running in test mode; no changes saved, nothing enqueued
2011-08-19 06:39:10
============================================================
OK: CPU load 1-min ave 0.02; CPU load 5-min ave 0.03; CPU load 15-min ave 0.00
============================================================

Sent from my iPad
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/spacewalk-list/attachments/20110819/0fbeb2ce/attachment.htm>


More information about the Spacewalk-list mailing list