[Spacewalk-list] spacewalk 1.3 monitoring scout status
Muhammad Atif
m_atif_s at yahoo.com
Wed Feb 23 04:28:54 UTC 2011
Thanks for the location, I have looked at the logs which I will paste after some
further preface to the issue.
Firstly, even if I add a new probe (say load) to the client system, the
monitoring scout does not display "push required". Should not it give me some
notification about the "push is required"?
I manually do the push, and start to get information about the probe. However,
the status of monitoring scout remains 'Request Failed".
I have looked at the logs and could not figure about the exact issue. Seems like
there is one odd command that fails to execute. This keeps on happening for
sometime and then spacewalk seemingly gives up and I get "Request Failed"
status.
I am stuck so as to where to start debugging. Any help is appreciated.
output from /var/log/nocpulse/execute_commands.log
============================================================================
2011-02-23 15:13:21 96: Starting queue run
2011-02-23 15:13:21 95: Freshening heartbeat file
/var/lib/nocpulse/commands/heartbeat
2011-02-23 15:13:21 104: Attempting to fetch commands (attempt 0)
2011-02-23 15:13:21 95: Queue server URL:
https://spacewalkX/satconfig/cgi-mod-perl/fetch_commands.cgi?cluster_id=cd5832fc859b&node_id=2&role=lead&version=1.0
2011-02-23 15:13:21 95: Successfully got command list
2011-02-23 15:13:21 95: CommandQueueParser parsing 410 bytes
2011-02-23 15:13:21 95: Parser::parse returning
NOCpulse::CommandQueue=HASH(0x118d5030)
2011-02-23 15:13:21 95: CQ::fetch_commands returning
NOCpulse::CommandQueue=HASH(0x118d5030)
2011-02-23 15:13:21 121: Fetched 1 commands
2011-02-23 15:13:21 130: Running command 45
2011-02-23 15:13:21 131: Command line is: /usr/bin/scheduleEvents
2011-02-23 15:13:21 133: Command is:
$VAR1 = bless( {
'restartable' => '0',
'cluster_id' => 'cd5832fc859b',
'command_line' => '/usr/bin/scheduleEvents',
'heartbeatfreq' => 60,
'effective_group' => 'nocpulse',
'timeout' => '600',
'debug' => bless( {
'_maxlevel' => 3,
'_streams' => [
bless( {
'timestamps' => 1,
'linenumbers' =>
0,
'level' => 0,
'stamper' => sub {
"DUMMY" },
'buffering' => 0,
'fh' => bless(
\*Symbol::GEN1, 'IO::Handle' ),
'active' => 1,
'append' => '>'
},
'NOCpulse::Debug::Stream::literal' ),
bless( {
'timestamps' => 1,
'linenumbers' =>
1,
'level' => 3,
'filename' =>
'/var/log/nocpulse/execute_commands.log',
'stamper' =>
$VAR1->{'debug'}{'_streams'}[0]{'stamper'},
'buffering' => 0,
'fh' => bless(
\*Symbol::GEN6, 'IO::Handle' ),
'active' => 1,
'append' => '>>'
},
'NOCpulse::Debug::Stream::literal' )
]
}, 'NOCpulse::Debug' ),
'target_type' => 'cluster',
'expdate' => '1298434960',
'id' => '45',
'laststartedfile' => '/var/lib/nocpulse/commands/last_started',
'netsaint_id' => '1',
'effective_user' => 'nocpulse',
'lastcompletedfile' =>
'/var/lib/nocpulse/commands/last_completed',
'heartbeatfile' => '/var/lib/nocpulse/commands/heartbeat',
'queue' => 'commands'
}, 'NOCpulse::CommandQueue::Command' );
2011-02-23 15:13:21 95: Attempting to run command 45
2011-02-23 15:13:21 95: Last started: 44; Last completed: 44
2011-02-23 15:13:21 95: 45 is executable (new)
2011-02-23 15:13:21 95: Executing command ...
2011-02-23 15:13:21 95: Freshening heartbeat file
/var/lib/nocpulse/commands/heartbeat
2011-02-23 15:13:24 95: Shell command exited with 0 exit status
2011-02-23 15:13:24 95: STDOUT:<?xml version="1.0"?>
<sat_config id="cd5832fc859b" version="1.1">
<step name="requestConfigData">
<status>ok</status>
<message>Nolog probe table synchronized
</message>
</step>
<step name="loadCommandParameters">
<status>ok</status>
</step>
<step name="buildProbeRecordDB">
<status>ok</status>
</step>
<step name="buildProbeDB">
<status>ok</status>
</step>
<step name="generateEvents">
<status>ok</status>
<message>Generated 1 events on satellite</message>
</step>
<step name="schedulerReload">
<status>ok</status>
<message>Scheduler reloaded
</message>
</step>
<status>ok</status>
<message>Scheduler reloaded
</message>
</step>
<status>ok</status>
</sat_config>
2011-02-23 15:13:24 95: STDERR:
2011-02-23 15:13:25 155: Finished with command 45
2011-02-23 15:13:25 158: Finished processing commands
2011-02-23 15:13:25 95: Freshening heartbeat file
/var/lib/nocpulse/commands/heartbeat
2011-02-23 15:13:25 174: Not sleeping after processing commands
2011-02-23 15:13:25 96: Starting queue run
2011-02-23 15:13:25 95: Freshening heartbeat file
/var/lib/nocpulse/commands/heartbeat
2011-02-23 15:13:25 104: Attempting to fetch commands (attempt 0)
2011-02-23 15:13:25 95: Queue server URL:
https://spacewalkX/satconfig/cgi-mod-perl/fetch_commands.cgi?cluster_id=cd5832fc859b&node_id=2&role=lead&version=1.0
2011-02-23 15:13:25 95: Successfully got command list
2011-02-23 15:13:25 95: CommandQueueParser parsing 410 bytes
2011-02-23 15:13:25 95: Parser::parse returning
NOCpulse::CommandQueue=HASH(0x118d5030)
2011-02-23 15:13:25 95: CQ::fetch_commands returning
NOCpulse::CommandQueue=HASH(0x118d5030)
2011-02-23 15:13:25 121: Fetched 1 commands
2011-02-23 15:13:25 130: Running command 45
2011-02-23 15:13:25 131: Command line is: /usr/bin/scheduleEvents
2011-02-23 15:13:25 133: Command is:
$VAR1 = bless( {
'restartable' => '0',
'cluster_id' => 'cd5832fc859b',
'command_line' => '/usr/bin/scheduleEvents',
'heartbeatfreq' => 60,
'effective_group' => 'nocpulse',
'timeout' => '600',
'debug' => bless( {
'_maxlevel' => 3,
'_streams' => [
bless( {
'timestamps' => 1,
'linenumbers' =>
0,
'level' => 0,
'stamper' => sub {
"DUMMY" },
'buffering' => 0,
'fh' => bless(
\*Symbol::GEN1, 'IO::Handle' ),
'active' => 1,
'append' => '>'
},
'NOCpulse::Debug::Stream::literal' ),
bless( {
'timestamps' => 1,
'linenumbers' =>
1,
'level' => 3,
'filename' =>
'/var/log/nocpulse/execute_commands.log',
'stamper' =>
$VAR1->{'debug'}{'_streams'}[0]{'stamper'},
'buffering' => 0,
'fh' => bless(
\*Symbol::GEN6, 'IO::Handle' ),
'active' => 1,
'append' => '>>'
},
'NOCpulse::Debug::Stream::literal' )
]
}, 'NOCpulse::Debug' ),
'target_type' => 'cluster',
'expdate' => '1298434960',
'id' => '45',
'laststartedfile' => '/var/lib/nocpulse/commands/last_started',
'netsaint_id' => '1',
'effective_user' => 'nocpulse',
'lastcompletedfile' =>
'/var/lib/nocpulse/commands/last_completed',
'heartbeatfile' => '/var/lib/nocpulse/commands/heartbeat',
'queue' => 'commands'
}, 'NOCpulse::CommandQueue::Command' );
2011-02-23 15:13:25 95: Attempting to run command 45
2011-02-23 15:13:25 95: Last started: 45; Last completed: 45
2011-02-23 15:13:25 95: 45 is not executable (old, last started 45, last
completed 45)
2011-02-23 15:13:25 151: Not executing command 45: 45 is not executable (old,
last started 45, last completed 45)
Best Regards,
Atif
________________________________
From: Miroslav Suchy <msuchy at redhat.com>
To: spacewalk-list at redhat.com
Sent: Thu, February 17, 2011 6:47:51 PM
Subject: Re: [Spacewalk-list] spacewalk 1.3 monitoring scout status
Dne 16.2.2011 06:44, Muhammad Atif napsal(a):
> Hi,
> We have just installed spacewalk 1.3 on Centos 5.5. The system so far
> seems to be working fine however on the monitoring side we have a little
> confusion. We are able to push the probes on to the desired clients and
> can monitor them, but the
> Spacewalk Monitoring Scout gives the status of Request Expired (just to
> add it tried to work like the status remains as pending for a while and
> then expires).
>
> My question is where can I find in the logs that what part of the
> configuration is failing. Can some one point me to appropriate logs
> files etc so that we can start to debug the issue please?
/var/log/nocpulse/*
/var/log/notification/*
Log files of Spacewalk itself are in:
/var/log/rhn/*
And very often is useful information in Tomcat log:
/var/log/tomcat*/catalina.out
Miroslav Suchy
_______________________________________________
Spacewalk-list mailing list
Spacewalk-list at redhat.com
https://www.redhat.com/mailman/listinfo/spacewalk-list
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/spacewalk-list/attachments/20110222/e8223ea1/attachment.htm>
More information about the Spacewalk-list
mailing list