[Linux-HA] About time-out of STONITH.
Andrew Beekhof
beekhof at gmail.com
Tue Jun 17 00:49:52 MDT 2008
On Tue, Jun 17, 2008 at 08:14, HIDEO YAMAUCHI
<renayama19661014 at ybb.ne.jp> wrote:
> Hi,
>
>> Which operation did you cause to fail? The monitor or the fencing operation?
> When I caused fencing, I generated time-out in RA of STONITH.
>
> Possibly is this phenomenon improved in Pacemaker?
By design the crm has no idea which node or plugin is used (or were
tried and failed).
So it would be impossible to for it recover the RA on its own.
It _may_ make sense for the stonithd to unload any plugin that reports
a failed stonith action (and thus causing the monitor to eventually
fail) but I'll leave that for Dejan to think about :-)
>
> Regards,
> Hideo Yamauchi.
>
> --- Andrew Beekhof <beekhof at gmail.com> wrote:
>
>> On Tue, Jun 17, 2008 at 04:16, HIDEO YAMAUCHI
>> <renayama19661014 at ybb.ne.jp> wrote:
>> > Hi,
>> >
>> > I confirmed behavior of the time-out of the run time of STONITH.(Heartbeat 2.1.3 and
>> ibmrsa-telnet)
>> >
>> > I confirmed it by the next sequence.
>> >
>> > 1)Start Heartbeat in two nodes.
>> > 2)Hung up in one node.
>> > 3)Time-out in STONITH.(Put a sleep code or drop all power supplies of the node.)
>> >
>> > But, unlike normal RA, plural RA of STONITH are started.
>> >
>> > I think that RA of STONITH should be started again after I was murdered properly like normal
>> RA.
>>
>> Which operation did you cause to fail? The monitor or the fencing operation?
>>
>> >
>> > //-------The state of the ps command
>> > Last login: Tue Jun 17 10:00:54 2008 from 172.30.96.92
>> > [root at x3650b ~]# ps -ef |grep ibm
>> > root 4562 1 0 Jun12 ? 00:00:00 /sbin/ibmasm
>> > root 4823 4562 0 Jun12 ? 00:00:00 /sbin/ibmasm
>> > root 11913 11912 0 10:23 ? 00:00:00 /usr/bin/python
>> > /usr/lib64/stonith/plugins/external/ibmrsa-telnet reset x3650a
>> > root 11947 11917 0 10:23 pts/1 00:00:00 grep ibm
>> > [root at x3650b ~]# ps -ef |grep ibm
>> > root 4562 1 0 Jun12 ? 00:00:00 /sbin/ibmasm
>> > root 4823 4562 0 Jun12 ? 00:00:00 /sbin/ibmasm
>> > root 11913 1 0 10:23 ? 00:00:00 /usr/bin/python
>> > /usr/lib64/stonith/plugins/external/ibmrsa-telnet reset x3650a
>> > root 11962 1 0 10:26 ? 00:00:00 /usr/bin/python
>> > /usr/lib64/stonith/plugins/external/ibmrsa-telnet reset x3650a
>> > root 11977 1 0 10:29 ? 00:00:00 /usr/bin/python
>> > /usr/lib64/stonith/plugins/external/ibmrsa-telnet reset x3650a
>> > root 11994 11993 0 10:32 ? 00:00:00 /usr/bin/python
>> > /usr/lib64/stonith/plugins/external/ibmrsa-telnet reset x3650a
>> > root 12004 11917 0 10:34 pts/1 00:00:00 grep ibm
>> > //--------
>> >
>> > Best Regards,
>> >
>> > Hideo Yamauchi.
>> >
>> >
>> > _______________________________________________
>> > Linux-HA mailing list
>> > Linux-HA at lists.linux-ha.org
>> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> > See also: http://linux-ha.org/ReportingProblems
>> >
>> _______________________________________________
>> Linux-HA mailing list
>> Linux-HA at lists.linux-ha.org
>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> See also: http://linux-ha.org/ReportingProblems
>>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
More information about the Linux-HA
mailing list