[Linux-HA] crm, pingd, bond - cause with migrate resources
Andrew Beekhof
beekhof at gmail.com
Wed Sep 6 03:05:50 MDT 2006
On 9/5/06, Pavel Sebek <sebek at via.cz> wrote:
> Since already moderator my mail not approval because was bigger than
> 100k I send again with gzip's attachement.
>
> Hi,
>
> i could this situation try onlz todaz because cluster is running at
> production already.
>
> In attachment are output cibadmin -Q (cib.xml) before lose connection to
> the ping node and ouput after change state ping node on dead
> (cib_ping_node_dead.xml). I put also logs. Lose conection to the ping
> node i made by iptables (iptables.txt).
> I expected after what node1 (cn42x1) losed connection to the ping node
> that the all resources will move to the second node (cn42x2).
> But the resources were left on the node which losed connection to the
> ping node.
The problem is in the nodes section. Specifically these lines:
<nvpair name="pingd"
id="pingd-926192c4-8483-4327-9e5f-2768983d945a" value="1"/>
<nvpair name="pingd"
id="pingd-6ca0fb50-0fe4-453d-b193-cc83308e4bda" value="1"/>
I'm guessing they were left over from a prior attempt which placed the
values in the "permanent" section of the CIB. The "temporary" values
then had no effect.
If you remove those lines it should work fine.
> PS:Is there posibility how found out node's score?
Only by lookng at the CIB
>
> Andrew Beekhof napsal(a):
>
> > On 8/28/06, Pavel Sebek <sebek at via.cz> wrote:
> >
> >> Hi,
> >>
> >> I tried '<nvpair id="pingd-multiplier" name="multiplier"
> >> value="INFINITY"/>' without success.
> >>
> >> Now I tried another:
> >> score for group resource on node1 is 200
> >> score for group resource on node2 is 199
> >> resource_stickiness is 155
> >> multiplier for pingd is 180
> >> one ping node
> >>
> >> I think that after start both nodes the all resources are running on
> >> node1. It's true. IMHO node1's score is (200 + 180 + 155) = 535 (score
> >> of node + pingd + resource_stickiness)
> >> And node2's score is (199+180) = 379
> >> I think that after lose link on interface at node1 the node1's score
> >> would be (200 + 155) = 355 and node2's score would be (199 + 180) = 379
> >> and therefore the all resources would moves to the node2.
> >>
> >> But It doesn't work ;-(
> >>
> >> Why?
> >
> >
> > If you get it into this situation again (where it should be on the
> > "other" node because of pingd, can you run "cibadmin -Q" and attach
> > the result here?
> >
>
>
>
>
> [root at xxxx ha]# iptables -vnxL
> Chain INPUT (policy ACCEPT 5120674 packets, 1039892387 bytes)
> pkts bytes target prot opt in out source destination
>
> Chain FORWARD (policy ACCEPT 0 packets, 0 bytes)
> pkts bytes target prot opt in out source destination
>
> Chain OUTPUT (policy ACCEPT 2989188 packets, 446932188 bytes)
> pkts bytes target prot opt in out source destination
> [root at xxxx ha]# /usr/local/bin/stop_ping-node
> [root at xxxx ha]# iptables -vnxL
> Chain INPUT (policy ACCEPT 5120843 packets, 1039919976 bytes)
> pkts bytes target prot opt in out source destination
>
> Chain FORWARD (policy ACCEPT 0 packets, 0 bytes)
> pkts bytes target prot opt in out source destination
>
> Chain OUTPUT (policy ACCEPT 2989282 packets, 446945668 bytes)
> pkts bytes target prot opt in out source destination
> 1 137 DROP icmp -- * * 0.0.0.0/0 10.6.35.1
> [root at xxxx ha]
>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>
>
More information about the Linux-HA
mailing list