[Linux-HA] STONITH keeps rebooting node over and over

Dejan Muhamedagic dejanmm at fastmail.fm
Wed Jan 2 11:41:34 MST 2008


Hi,

On Mon, Dec 31, 2007 at 10:07:00AM -0500, David S. Madole wrote:
> I have setup STONITH on my two-node cluster using a Baytech
> RPC-3 as the underlying hardware.
>  
> It works in that if node B fails, then node A performs a power
> cycle on it. However, it continues to power-cycle the node
> every 30 seconds or so and node B never gets a chance to come
> back up again.
>  
> I have been searching for a setting to change the amount of
> time that node A waits for node B to become available before
> killing it again but I can't find anything.

There's no such setting. The surviving node doesn't wait for the
other one to come up, but wants to make sure that it was reset.
For some reason, the stonith device doesn't report that the reset
operation was successful. The logs should reveal what is going
on.

Thanks,

Dejan

> A way to make it
> only try once even if node B does not come up would be fine
> also, after all if it doesn't come back after one power cycle,
> it's not likely to after ten either.

>  
> Any tips?
>  
> David
>  
> 
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems


More information about the Linux-HA mailing list