[Linux-HA] Preventing STONITH deathmatch

Dejan Muhamedagic dejanmm at fastmail.fm
Tue Sep 4 06:10:58 MDT 2007


Hi,

On Tue, Sep 04, 2007 at 08:00:14PM +1000, Daniel X Moore wrote:
> Is there any way to rate-limit STONITH attempts?

Not that I know of.

> We occasionally cause a 
> problem in one of our plugins that causes the status & stop actions to 
> always fail. This causes both nodes to continually kill the other node 
> (and themselves, I suspect).
> 
> This "deathmatch" behaviour makes it pretty difficult to get in and 
> reconfigure/fix things since the nodes are being killed almost 
> immediately they come back up.

The only way out I can offer is to prevent Heartbeat from
starting on boot.

> Is there any way to force a delay (with associated lack of availability) 
> between STONITH attempts?

No. But you could file an enhancement request in the bugzilla.

> Constantly rebooting machines are actually less available than machines 
> not running a specific service :)

:D

Dejan

> -- 
> -------------------------------------------------------------------
>  Daniel Moore                              dxm at sgi.com
>  Engineering Manager: AppMan + HA          Phone:  +61-3-9963-1957
>  SGI Australian Software Group             Mobile: +61-4-1360-4720
> -------------------------------------------------------------------
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems


More information about the Linux-HA mailing list