[Linux-cluster] Can anyone help tackling issues with custom resource agent for RHCS?

Digimer lists at alteeve.ca
Fri Feb 22 18:01:41 UTC 2013


Would this help?

https://fedorahosted.org/cluster/wiki/ResourceActions

digimer

On 02/22/2013 12:29 PM, Ralph.Grothe at itdz-berlin.de wrote:
> Hello,
>
> I have written a custom resource agent for Informix servers in
> accordance to the OCF RA Developer's Guide
> http://www.linux-ha.org/doc/dev-guides/ra-dev-guide.html
> but I have some doubt if it properly integrates with RHCS's
> idiosyncrasies.
>
> I would have liked to save me the work finding out all the RedHat
> deviations, and would have taken a readily supplied agent for
> Informix Server resources if only I had found one.
>
> I found that I not only had to write the agent and simply place
> it in /usr/share/cluster
> (as far as I understand this would have sufficed in a Pacemaker
> cluster, albeit under a different directory path)
> I also had to tinker with
> /usr/share/system-config-cluster/misc/cluster.ng in order to have
> my extensions pass the XML validity check.
>
> Now my new <ifxdb/> tag is accepted in cluster.conf along with
> its paramtrezation via in the agent's meta-data dump as well as
> my definitions in cluster.ng,
> and I was able to successfully commit an ccs_tool update.
>
>
> These are the attributes and actions my agent supports so far:
>
> # /usr/share/cluster/ifxdb.sh meta-data|grep -E
> '(parameter|action) name'
>          <parameter name="server" primary="1" unique="1">
>          <parameter name="home" required="1">
>          <parameter name="config" required="1">
>          <parameter name="version">
>          <parameter name="locale">
>          <parameter name="user">
>          <parameter name="dbdate">
>          <parameter name="dbmoney">
>          <parameter name="log">
>          <action name="start" timeout="60"/>
>          <action name="stop" timeout="60"/>
>          <action name="recover" timeout="180"/>
>          <action name="status" depth="1" timeout="10"
> interval="15"/>
>          <action name="monitor" depth="10" timeout="20"
> interval="30"/>
>          <action name="meta-data" timeout="5"/>
>          <action name="verify-all" timeout="5"/>
>
>
>
>
>
>
> rg_test shows its rules and I am able to start, stop and monitor
> (or status in RHCS mannerism) my Informix resources
> like e.g.
>
> # rg_test test /etc/cluster/cluster.conf start ifxdb ju_09tcp
>
>
> I can also enable, stop and relocate every service that contains
> an instance of my ifxdb resource via clusvcadm.
>
>
> As I made ample use of the ocf_log function
> (I prefer to have some meaningful output from the agents in the
> logs)
> at the command line when invoked through rg_test the agent
> verbosely reports what it's doing.
>
> What puzzles me is that I cannot see any of these logging entries
> when my agent is run by clurgmgrd.
> That it is run at all I can verify as on e.g. service startup the
> Informix server instance is successfully started.
>
> What puzzles me even more is that I cannot see any entries in
> /var/log/messages from the regular monitoring invocatins of the
> agent at given intervals according to above meta-data dump, so as
> if no resource monitoring checks are at all performed.
>
> And even worse, when I shutdown an Informix instance manually
> (e.g. omnode -ky) it doesn't get restarted by clurgmgrd even
> though *no* __independent_subtree attribute is defined in the
> ifxdb tags of cluster.conf.
>
> So although I can manually start, stop and relocate the affected
> services through clusvcadm, the whole rgmanager HA treatment
> seems dysfunctional to me.
>
> Is there anything or anywhere that I have forgotten to also
> manipulate to fully enable my custom resource agent?
>
>
>
> Regards,
> Ralph



-- 
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without 
access to education?




More information about the Linux-cluster mailing list