Hi,<br><br>Could you please send your <a href="http://lvs.cf">lvs.cf</a> file?<br><br>--<br><br>Bigo <br><br><div class="gmail_quote">On Thu, Jun 11, 2009 at 6:00 PM, <span dir="ltr"><<a href="mailto:piranha-list-request@redhat.com">piranha-list-request@redhat.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Send Piranha-list mailing list submissions to<br>
<a href="mailto:piranha-list@redhat.com">piranha-list@redhat.com</a><br>
<br>
To subscribe or unsubscribe via the World Wide Web, visit<br>
<a href="https://www.redhat.com/mailman/listinfo/piranha-list" target="_blank">https://www.redhat.com/mailman/listinfo/piranha-list</a><br>
or, via email, send a message with subject or body 'help' to<br>
<a href="mailto:piranha-list-request@redhat.com">piranha-list-request@redhat.com</a><br>
<br>
You can reach the person managing the list at<br>
<a href="mailto:piranha-list-owner@redhat.com">piranha-list-owner@redhat.com</a><br>
<br>
When replying, please edit your Subject line so it is more specific<br>
than "Re: Contents of Piranha-list digest..."<br>
<br>Today's Topics:<br>
<br>
1. Re: lvsd kills off all nannies! (Dan Yocum)<br>
<br><br>---------- Forwarded message ----------<br>From: Dan Yocum <<a href="mailto:yocum@fnal.gov">yocum@fnal.gov</a>><br>To: Piranha clustering/HA technology <<a href="mailto:piranha-list@redhat.com">piranha-list@redhat.com</a>><br>
Date: Wed, 10 Jun 2009 15:58:09 -0500<br>Subject: Re: lvsd kills off all nannies!<br>I just had the same experience again when attempting to add another service to our LVS director and reloading pulse, so upgrading to piranha-0.8.4-11.el5 did not help.<br>
<br>
One thing that I noticed was that the monitor process to one real servers failed right away (the service on that system was actually down). I think this caused the nanny to falter which brought everything down, too. Not good.<br>
<br>
Here's what I saw in /var/log/messages:<br>
<br>
lvs[19604]: rereading configuration file<br>
lvs[19604]: create_monitor for saz-admin:8443/fg5x3 running as pid 31729<br>
lvs[19604]: create_monitor for saz-admin:8443/fg6x3 running as pid 31730<br>
lvs[19604]: nanny for child saz-admin:8443/fg5x3 died! shutting down lvs<br>
lvs[19604]: shutting down virtual service MYSQL:3306<br>
lvs[19604]: shutting down virtual service SAZ:8888<br>
lvs[19604]: shutting down virtual service SAZ:8881<br>
lvs[19604]: shutting down virtual service SAZ:8882<br>
lvs[19604]: shutting down virtual service voms:8443<br>
lvs[19604]: shutting down virtual service voms-osg:8443<br>
lvs[19604]: shutting down virtual service gums:8443<br>
nanny[19614]: Terminating due to signal 15<br>
nanny[19617]: Terminating due to signal 15<br>
nanny[19622]: Terminating due to signal 15<br>
nanny[19644]: Terminating due to signal 15<br>
nanny[19645]: Terminating due to signal 15<br>
nanny[19647]: Terminating due to signal 15<br>
etc.<br>
<br>
Thanks,<br>
Dan<br>
<br>
<br>
<br>
Dan Yocum wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Hi Barry,<br>
<br>
We're on piranha-0.8.4-9.3.el5. I will upgrade to release 11 and see if that helps.<br>
<br>
Thanks,<br>
Dan<br>
<br>
<br>
Barry Brimer wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Quoting Dan Yocum <<a href="mailto:yocum@fnal.gov" target="_blank">yocum@fnal.gov</a>>:<br>
<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Hi all,<br>
<br>
Here's the situation we're running into - after setting a real server to<br>
active = 0 and weight = 0 and reloading pulse, <perform some work on the<br>
RS>, set active = 1 and weight = 3 and reloading pulse, lvsd first<br>
creates the monitor for the process, which dies for some strange reason,<br>
then proceeds to shutdown *all* virtual services!!<br>
<br>
Here's what I see in /var/log/messages:<br>
</blockquote>
<br>
<snip><br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Performing a 'service pulse restart' brings everything back online just<br>
fine.<br>
<br>
What's going on here?<br>
<br>
The OS is Scientific Linux 5.2 (i.e., RHELv5.2) on a Xen VM, kernel<br>
2.6.18-128.1.6.el5xen.<br>
</blockquote>
<br>
What version of piranha do you have installed? The latest version seems to<br>
correct some nanny/pulse related issues<br>
<<a href="https://rhn.redhat.com/errata/RHBA-2009-0095.html" target="_blank">https://rhn.redhat.com/errata/RHBA-2009-0095.html</a>><br>
<br>
Barry<br>
<br>
_______________________________________________<br>
Piranha-list mailing list<br>
<a href="mailto:Piranha-list@redhat.com" target="_blank">Piranha-list@redhat.com</a><br>
<a href="https://www.redhat.com/mailman/listinfo/piranha-list" target="_blank">https://www.redhat.com/mailman/listinfo/piranha-list</a><br>
</blockquote>
<br>
</blockquote>
<br>
-- <br>
Dan Yocum<br>
Fermilab 630.840.6509<br>
<a href="mailto:yocum@fnal.gov" target="_blank">yocum@fnal.gov</a>, <a href="http://fermigrid.fnal.gov" target="_blank">http://fermigrid.fnal.gov</a><br>
Fermilab. Just zeros and ones.<br>
<br>
<br>
<br>_______________________________________________<br>
Piranha-list mailing list<br>
<a href="mailto:Piranha-list@redhat.com">Piranha-list@redhat.com</a><br>
<a href="https://www.redhat.com/mailman/listinfo/piranha-list" target="_blank">https://www.redhat.com/mailman/listinfo/piranha-list</a><br></blockquote></div><br>