I got the following errors after "reboot -fn" on erd-tt-eproof1, which script do I need to change?
Aug 9 15:35:40 erd-tt-eproof2 kernel: CMAN: removing node erd-tt-eproof1 from t
he cluster : Missed too many heartbeats
Aug 9 15:35:40 erd-tt-eproof2 fenced[3437]: erd-tt-eproof1 not a cluster member
after 0 sec post_fail_delay
Aug 9 15:35:40 erd-tt-eproof2 fenced[3437]: fencing node "erd-tt-eproof1"
Aug 9 15:35:42 erd-tt-eproof2 fenced[3437]: agent "fence_drac" reports: WARNING
: unable to detect DRAC version ' Dell Embedded Remote Access Controller (ERA) F
irmware Version 3.31 (Build 07.15) ' WARNING: unsupported DRAC version '__unknow
n__' failed: unable to determine power state
This is DRAC on Dell PE2650.
he cluster : Missed too many heartbeats
Aug 9 15:35:40 erd-tt-eproof2 fenced[3437]: erd-tt-eproof1 not a cluster member
after 0 sec post_fail_delay
Aug 9 15:35:40 erd-tt-eproof2 fenced[3437]: fencing node "erd-tt-eproof1"
Aug 9 15:35:42 erd-tt-eproof2 fenced[3437]: agent "fence_drac" reports: WARNING
: unable to detect DRAC version ' Dell Embedded Remote Access Controller (ERA) F
irmware Version 3.31 (Build 07.15) ' WARNING: unsupported DRAC version '__unknow
n__' failed: unable to determine power state
This is DRAC on Dell PE2650.
Thanks,
Hai
On 8/9/06, Lon Hohberger <lhh@xxxxxxxxxx> wrote:
On Wed, 2006-08-09 at 13:44 -0500, hai wu wrote:
> Thanks Lon. We got redundant power here.
>
> How can I test this fence_drac? How to simulate a failure on one node
> and know for sure that it does kick in and restarts the failed node in
> the cluster?
After both nodes join the cluster, try doing 'reboot -fn' on the node.
Oh, also, you should be booting with acpi=off when using integrated
power management.
-- Lon
--
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster
-- Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster