[Linux-HA] Preventing STONITH deathmatch
Daniel X Moore
dxm at sgi.com
Tue Sep 4 04:00:14 MDT 2007
Is there any way to rate-limit STONITH attempts? We occasionally cause a
problem in one of our plugins that causes the status & stop actions to
always fail. This causes both nodes to continually kill the other node
(and themselves, I suspect).
This "deathmatch" behaviour makes it pretty difficult to get in and
reconfigure/fix things since the nodes are being killed almost
immediately they come back up.
Is there any way to force a delay (with associated lack of availability)
between STONITH attempts?
Constantly rebooting machines are actually less available than machines
not running a specific service :)
--
-------------------------------------------------------------------
Daniel Moore dxm at sgi.com
Engineering Manager: AppMan + HA Phone: +61-3-9963-1957
SGI Australian Software Group Mobile: +61-4-1360-4720
-------------------------------------------------------------------
More information about the Linux-HA
mailing list