[Linux-HA] more on problems with resouce and stonith

Rob Aronson riaronson at gmail.com
Sun Sep 30 14:37:42 MDT 2007


Thanks. I'll have to check the version. It's whatever comes with SLES 10
sp1. The address will migrate if I put the node it's on to standby. I had
something similar happen when I tried to fail over the other resource group,
that time it was the filesystem resource. The gui showed it as failed, it
wouldn't move until the failed node came back on line to move it.

On 9/30/07, Andrew Beekhof <beekhof at gmail.com> wrote:
>
> On 9/29/07, Rob Aronson <riaronson at gmail.com> wrote:
> > Thsi si another section of log I thought I lost. I kept getting this all
> the
> > time my node was off line. The gui showed the resource failed, the node
> off
> > line.
> >
> >
> > Sep 29 09:41:45 gwcluster2 crmd: [4823]: info: do_lrm_rsc_op: Performing
> > op=resource_domain_ip_stop_0
> key=4:278:91f2d010-9418-4554-9e05-3c95650fd13e)
> > Sep 29 09:41:45 gwcluster2 cib: [4819]: info: cib_diff_notify: Update
> > (client: 4823, call:276): 0.149.6783 -> 0.149.6784 (ok)
> > Sep 29 09:41:45 gwcluster2 lrmd: [4820]: info: RA output:
> > (resource_domain_ip:stop:stderr) No route to 172.16.0.125
> > Sep 29 09:41:45 gwcluster2 crmd: [4823]: ERROR: process_lrm_event: LRM
> > operation resource_domain_ip_stop_0 (call=262, rc=2) Error invalid
> parameter
> >
> > Sep 29 09:41:45 gwcluster2 cib: [8540]: info: write_cib_contents: Wrote
> > version 0.149.6784 of the CIB to disk (digest:
> > 00a3da66e2b80266956102a304ae1ca6)
> > Sep 29 09:41:46 gwcluster2 crmd: [4823]: info: do_lrm_rsc_op: Performing
> > op=resource_domain_ip_stop_0
> key=4:279:91f2d010-9418-4554-9e05-3c95650fd13e)
> >
> > Sep 29 09:41:46 gwcluster2 cib: [4819]: info: cib_diff_notify: Update
> > (client: 4823, call:277): 0.149.6784 -> 0.149.6785 (ok)
> > Sep 29 09:41:46 gwcluster2 lrmd: [4820]: info: RA output:
> > (resource_domain_ip:stop:stderr) No route to 172.16.0.125
> >
> > The failed node was gwcluster2, the log was from the surviving node,
> > gwcluster1. I'm sure it has to do with my resource parameters but I
> don't
> > know what they should be.
>
> looks like none of the NICs want to have that IP
>
> what version of heartbeat are you running?  maybe the IP addr script is
> broken.
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>



-- 
Rob Aronson
Storage, Virtualization and Orchestration Practice Manager, Novacoast
USA


More information about the Linux-HA mailing list