[Linux-HA] Preventing STONITH deathmatch

Daniel X Moore dxm at sgi.com
Tue Sep 4 04:00:14 MDT 2007


Is there any way to rate-limit STONITH attempts? We occasionally cause a 
problem in one of our plugins that causes the status & stop actions to 
always fail. This causes both nodes to continually kill the other node 
(and themselves, I suspect).

This "deathmatch" behaviour makes it pretty difficult to get in and 
reconfigure/fix things since the nodes are being killed almost 
immediately they come back up.

Is there any way to force a delay (with associated lack of availability) 
between STONITH attempts?

Constantly rebooting machines are actually less available than machines 
not running a specific service :)

-- 
-------------------------------------------------------------------
  Daniel Moore                              dxm at sgi.com
  Engineering Manager: AppMan + HA          Phone:  +61-3-9963-1957
  SGI Australian Software Group             Mobile: +61-4-1360-4720
-------------------------------------------------------------------


More information about the Linux-HA mailing list