[Linux-cluster] Fencing issues with fence_apc_snmp (APC Firmware 6.x)
Digimer
lists at alteeve.ca
Tue Oct 14 11:01:42 UTC 2014
On 13/10/14 03:10 PM, Thomas Meier wrote:
> Hi
>
> When configuring PDU fencing in my 2-node-cluster I ran into some problems with
> the fence_apc_snmp agent. Turning a node off works fine, but
> fence_apc_snmp then exits with error.
>
>
>
> When I do this manually (from node2):
>
> fence_apc_snmp -a node1 -n 1 -o off
>
> the output of the command is not an expected:
>
> Success: Powered OFF
>
> but in my case:
>
> Returned 2: Error in packet.
> Reason: (genError) A general failure occured
> Failed object: .1.3.6.1.4.1.318.1.1.4.4.2.1.3.21
>
>
> When I check the PDU, the port is without power, so this part works.
> But it seems that the fence agent can't read the status of the PDU
> and then exits with error. The same seems to happen when fenced
> is calling the agent. The agent also exits with an error and fencing can't succeed
> and the cluster hangs.
>
>>From the logfile:
>
> fenced[2100]: fence node1 dev 1.0 agent fence_apc_snmp result: error from agent
>
>
> My Setup: - CentOS 6.5 with fence-agents-3.1.5-35.el6_5.4.x86_64 installed.
> - APC AP8953 PDU with firmware 6.1
> - 2-node-cluster based on https://alteeve.ca/w/AN!Cluster_Tutorial_2
> - fencing agents in use: fence_ipmilan (working) and fence_apc_snmp
>
>
> I did some recherche, and for me it looks like that my fence-agents package is too old for my APC firmware.
>
> I've already found the fence-agents repo: https://git.fedorahosted.org/cgit/fence-agents.git/
>
> Here https://git.fedorahosted.org/cgit/fence-agents.git/commit/?id=55ccdd79f530092af06eea5b4ce6a24bd82c0875
> it says: "fence_apc_snmp: Add support for firmware 6.x"
>
>
> I've managed to build fence-agents-4.0.11.tar.gz on a CentOS 6.5 test box, but my build
> of fence_apc_snmp doesn't work.
>
> It gives:
>
> [root at box1]# fence_apc_snmp -v -a node1 -n 1 -o status
> Traceback (most recent call last):
> File "/usr/sbin/fence_apc_snmp", line 223, in <module>
> main()
> File "/usr/sbin/fence_apc_snmp", line 197, in main
> options = check_input(device_opt, process_input(device_opt))
> File "/usr/share/fence/fencing.py", line 705, in check_input
> logging.getLogger().addHandler(logging.StreamHandler(stream=sys.stderr))
> TypeError: __init__() got an unexpected keyword argument 'stream'
>
>
> I'd really like to see if a patched fence_apc_snmp agent fixes my problem, and if so,
> install the right version of fence_apc_snmp on the cluster without breaking things,
> but I'm a bit clueless how to build me a working version.
>
>
> Maybe you have some tips?
>
>
>
> Thanks in advance
>
> Thomas
Hi Marek et. al.,
This is a RHEL 6.5 install, so Kristoffer's comment about needing a
newer version of python is a bit of a concern. Has this been tested on
RHEL 6 with an APC with the 6.x firmware?
cheeps
--
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without
access to education?
More information about the Linux-cluster
mailing list