I'm trying to get fence_drac5 working on a cluster I'm setting up of two Dell R410's. The primary issue I'm seeing are timeouts. The fence does seem to work as the other node will get shut down, but the script always exits 1.
Here's the output:
# fence_drac5 -a 192.168.1.100 --power-timeout 30 -x -l root -p calvin -c 'admin1->' -o reboot
Connection timed out
# fence_drac5 -a 192.168.1.100 --power-timeout 30 -v -x -l root -p calvin -c 'admin1->' -o reboot
root@192.168.1.100's password:
/admin1-> racadm serveraction powerstatus
Server power status: ON
/admin1->
/admin1-> racadm serveraction powerdown
Server power operation successful
/admin1->Traceback (most recent call last):
File "/usr/sbin/fence_drac5", line 154, in <module>
main()
File "/usr/sbin/fence_drac5", line 137, in main
result = fence_action(conn, options, set_power_status, get_power_status, get_list_devices)
File "/usr/share/fence/fencing.py", line 838, in fence_action
if wait_power_status(tn, options, get_power_fn) == 0:
File "/usr/share/fence/fencing.py", line 744, in wait_power_status
if get_power_fn(tn, options) != options["-o"]:
File "/usr/sbin/fence_drac5", line 38, in get_power_status
status = re.compile("(^|: )(ON|OFF|Powering ON|Powering OFF)\s*$", re.IGNORECASE | re.MULTILINE).search(conn.before).group(2)
AttributeError: 'NoneType' object has no attribute 'group'
Even though I pass "-o reboot", it still powers off. It does the same even if I don't pass that option.
I added --power-timeout 30 in the latest test to see if that'd help but no dice. Doesn't work without it either.
I have tried fence_ipmilan & it works great, but the iDRAC interfaces are somewhat exposed & need to use SSH for security reasons, which limits me to fence_drac5.
Thanks.
-- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster