[Linux-HA] Heartbeat took 20 min. to stop
Andrew Beekhof
beekhof at gmail.com
Fri Sep 21 05:34:08 MDT 2007
On 9/21/07, Jose Jerez <tale.toul at gmail.com> wrote:
> Hello,
>
> Today I had a problem stopping heartbeat that I thought had been
> solved in one of the latest releases:
>
> First I set one node in standby mode with:
>
> #crm_standby -U justiciamacjh3 -v on
>
> Then I stop heartbeat:
>
> #/etc/init.d/heartbeat stop
>
> And it takes 20 minutes to stop, in the log file (attached) I see:
>
> ERROR: crm_timer_popped: Shutdown Escalation (I_STOP) just popped!
> ERROR: do_log: [[FSA]] Input I_STOP from crm_timer_popped() received
> in state (S_TERMINATE)
> info: verify_stopped: Checking for active resources before exit
> ERROR: verify_stopped: 1 pending LRM operations at shutdown
> ERROR: ghash_print_pending: Pending action: IP_db1:137
> ERROR: do_exit: Performing A_EXIT_1 - forcefully exiting the CRMd
>
> I had found this problem in a 2.0.7 but I thought it had been solved by now.
>
> System is SLES9 SP3 with heartbeat 2.1.2 compiled from source.
thanks for reporting this!
from the logs, it looks like we think some actions we initiated
haven't come back (from the lrm). which may or may not be true.
i've put in a significant amount of work into shutdowns since 2.1.2
came out (a new test i wrote displayed similar symptoms), you might
want to try updating to the "obs-2.1.2-2" tag which includes the
changes and is quite well tested.
More information about the Linux-HA
mailing list