[Linux-HA] STONITH device failure and then what?
dejanmm at fastmail.fm
Thu Sep 27 10:12:11 MDT 2007
On Thu, Sep 27, 2007 at 01:56:04PM +0100, Wojciech Turek wrote:
> Dear All,
> I have 2 nodes HA cluster configuration. Both nodes are connected to
> shared storage. Is it critical that nodes will not mount the same
> LUN at the same time. That is why I am using STONITH for node
> fencing. My STONITH is based on IPMI device.
> I am considering scenario that IPMI device on node that need to be
> power down fails:
> Sep 27 13:10:12 storage09 heartbeat: : ERROR: STONITH device
> external/ipmi not operational!
> Sep 27 13:10:12 storage09 heartbeat: : WARN: Exiting STONITH-
> stat process 17161 returned rc 1.
> Sep 27 13:10:12 storage09 heartbeat: : ERROR: STONITH status
> operation failed.
> Sep 27 13:10:12 storage09 heartbeat: : info: This may mean that
> the STONITH device has failed!
> Is there a way to configure STONITH in such a way that if status of
> the device exit with code 1 then it will for example stop heartbeat
> and wait for administrator intervention?
Is this v1 or v2 configuration? Looks like v1. With v1, AFAIK,
this is not possible. With v2 it should be possible, but perhaps
not easy. You can also open a bugzilla for an enhancement.
> Best Regards
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> See also: http://linux-ha.org/ReportingProblems
More information about the Linux-HA