[Linux-HA] Node failure causes peer host to reboot?!?

Andrew Beekhof beekhof at gmail.com
Thu Apr 17 04:56:28 MDT 2008


On Thu, Apr 17, 2008 at 12:35 PM, Luis Motta Campos
<luismottacampos at yahoo.co.uk> wrote:
> Dejan Muhamedagic wrote:
>  > Hi,
>
> >> respawn hacluster /usr/lib64/heartbeat/ipfail
>  >
>  > ipfail doesn't work with crm. You should use pingd instead.
>
>  Well, I don't think this helps. :( I'm using the suggested (reasonable
>  for me) defaults:
>
>  respawn root /usr/lib64/heartbeat/pingd -m 100 -d 5s
>
>  (yes, I'm running CentOS x86_64).
>
>  I still have problems, but they seem to be worse, now. Before, if I
>  restarted heartbeat (/etc/init.d/heartbeat restart), any service running
>  on the machine jumped away before the restart, and heartbeat was able to
>  restart ok.
>
>  Using pingd instead of the ipfail, even this is crippled, and heartbeat
>  reboots the peer host (the one supposed to keep services running) if I
>  try to restart the heartbeat service on one of the machines.
>
>  I presume I'm doing something really stupid, but I can't understand it.
>  Please help me out. I used hb_report to fetch all I know about my
>  system, please find the report attached.
>

random question - did you install from source or packages?  where did
you get them from?


More information about the Linux-HA mailing list