[Linux-HA] Automatic resource restart/clone resources

Andrew Beekhof beekhof at gmail.com
Tue Oct 3 05:05:11 MDT 2006


On 10/3/06, Max Hofer <max.hofer at apus.co.at> wrote:
> In the current configuration i have a cloned resource (one instance for each
> cluster node, 2 nodes in the cluster).
>
> The resource can not failover to the other node because it is an ISDN card
> which apparently can not be relocated. The application running on the cluster
> is a resource called APPL and depends on a running ISDN resource.
>
> Currently when an ISDN resource fails on one node it is stopped on this node
> and it is never restarted withouth manual intervention (setting failcount to
> 0 or using crm_resource -C).

try setting resource_failure_stickiness = 0 for just the ISDN resource

>
> I have following questions:
> * do i have to write a rule that a certain clone can only run on a certain
> node?

you cant do that at all.

are both ISDN connections ever active at the same time?  if not, you
probably shouldnt be using a clone.

> Or does clone_max, clone_node_max implicitely create those rules for
> me? What happens when i add an additional node with no ISDN card at a later
> stage?

By default, symmetric_cluster = true.  This means you have to use
rsc_location constraints to tell us where we CANT run the resources.

If you set that to false, then you need to use rsc_location
constraints to tell us where we _CAN_ run the resources.

The idea being that you only need to specify the exception(s).

> * Is there a way to tell the CRM to try a restart of the failed resource (X
> times before giving up)

If you mean "before giving up on node A", yes.  Thats what
resource_failure_stickiness is for.  The problem is you set it to
-INFINITY, which means X = 1.

>
> The bahevaiour i really would llike to have:
> * If the ISDN resource where APPL is runing fails, failover to the other node
> (if there runs the ISDN resource)

that would be automatic depending on your values of resource_failure_stickiness.

> * if the ISDN resource fails, try to restart it again periodcailly

a cron job resetting the failure count would achieve this

> can anyone give me a clue how to implement this?
>
> Attached a minimum CIB and the status after the monitor operaiton failed.
>
> kind regards
> Max
>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>
>


More information about the Linux-HA mailing list