[Linux-HA] how to restart a failed resource
Andrew Beekhof
beekhof at gmail.com
Sun Sep 30 11:07:49 MDT 2007
On 9/28/07, dxj_600 <dxj_600 at 126.com> wrote:
> Thanks to "Andrew Beekhof" and "Joseph Lamoree", i can trigger heartbeat to start the resource now
> but I met new problem as followings:
> 1. I configured a clones resource, which should be run at two nodes
> node host_192.168.2.220 host_192.168.2.221
> 2. first my resource run at only one node
> # crm_resource -W -r rsc_lcm_0:0
> resource rsc_lcm_0:0 is running on: host_192.168.2.220
> # crm_resource -W -r rsc_lcm_0:1
> resource rsc_lcm_0:1 is NOT running
why is this?
> 3. then i trigger the heartbeat to start rsc_lcm_0:1
> # crm_resource -C -r rsc_lcm_0:1
> 4. then heartbeat do something like this
> on node host_192.168.2.221
please attach the actual logs, they convey far more information than
you have reproduced here.
> [2007-09-28 18:20]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra monitor
> probe
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra start
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra notify
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra stop <---- because my ra timeout
> on node host_192.168.2.220
> [2007-09-28 18:20]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra notify
> probe
> [2007-09-28 18:20]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra monitor
> [2007-09-28 18:20]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra notify
> [2007-09-28 18:20]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra notify
> [2007-09-28 18:20]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra stop <---- why stop me????
> [2007-09-28 18:20]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra monitor
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra notify
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra stop -;___ in fact it's a restart
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra start -'
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra notify
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra notify
> [2007-09-28 18:21]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra monitor
> [2007-09-28 18:22]: /usr/lib/ocf/resource.d//heartbeat/owlcm-ra monitor
> 5. finally result is
> # crm_resource -W -r rsc_lcm_0:0
> resource rsc_lcm_0:0 is running on: host_192.168.2.220
> # crm_resource -W -r rsc_lcm_0:1
> resource rsc_lcm_0:1 is NOT running
> My question is:
> since the result 5 is same as result 2, rsc_lcm_0:0 is still running at node host_192.168.2.220, then WHY heartbeat stop the resource and restart it again, and WHY not heartbeat keep rsc_lcm_0:0 running untouched, how can i avoid this scenary.
> my resource configuration is:
> <clone id="rsc_lcm" ordered="false" interleave="false" notify="true">
> <instance_attributes id="ow_lcm_instance_attributes">
> <attributes>
> <nvpair id="ow_lcm_clone_max" name="clone_max" value="2"/>
> <nvpair id="ow_lcm_clone_node_max" name="clone_node_max" value="1"/>
> </attributes>
> </instance_attributes>
> <primitive class="ocf" type="owlcm-ra" provider="heartbeat" id="rsc_lcm_0" is_managed="true">
> <operations>
> <op id="rsc_lcm_op0" name="start" timeout="50s" disabled="false"/>
> <op id="rsc_lcm_op2" name="monitor" interval="5s" timeout="4s" disabled="false" role="Started"/>
> </operations>
> <instance_attributes id="ow_lcm_ra_instance_attributes">
> <attributes>
> <nvpair id="rsc_lcm_vip" name="vip" value="192.168.2.231 192.168.2.232"/>
> </attributes>
> </instance_attributes>
> </primitive>
> </clone>
> thanks a lot
>
> $B:_(B2007-09-26$B!$(B"Joseph Lamoree" <jlamoree at gmail.com> $B<LF;!'(B
> Take a look at the crm_resource page:
> http://www.linux-ha.org/v2/AdminTools/crm_resource
> There is an example of stopping/starting resources there. Also, most
> of the executables in heartbeat support a --help option to display a
> lot of very useful information.
> --
> Joseph Lamoree
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
More information about the Linux-HA
mailing list