[Linux-HA] questions on resources, iptables, ipaddr2, ocfs2

Andrew Beekhof beekhof at gmail.com
Tue Mar 20 08:38:42 MDT 2007


On 3/14/07, Florian Heigl <florian.heigl at gmail.com> wrote:
> Hi,
>
> sorry for the late reply, I was catching up on sleep.
> I've put the logs online now, they hold all messages from starting on
> the first node to the second node successdully joining.
>
> i have debug 1 on my ha.cf, if this is too verbose, I will regenerate the logs.

thanks to grep, logs can never be too long :-)

>
> http://wartungsfenster.dyndns.org/outbox/messages-domU-bacula1
>
> other than that I have not yet done any changes so that the log is in
> sync with what you read in the last email.
>
> what I know I need to change so far:
> don't use ucast as I want to have >2 nodes, so it will be
> mcast(encrypted) on eth0 and mcast (crc) on eth1.
> I'll remove the ocfs2 bit and make the filesystem RA mount appropriately
> I'll rework my application start scripts to include the 'monitor'
> action and be more compliant for  OCF standards.
>
> Also thank You a lot for explaining the conditions that make something
> unmanaged, I don't know why it happens, but at least I know what it
> means now.

this seems to be an lrm bug.

the crm can handle it when an RA isnt installed on all nodes, but
apparently the lrm never bothers to tell us this is the case and just
returns "unknown error"

which means:
* monitor actions for that resource on that node will fail - making it
look active but failed
* stop actions for that resource on that node will fail - making it unmanaged

> 2007/3/12, Andrew Beekhof <beekhof at gmail.com>:
> > On 3/12/07, Florian Heigl <florian.heigl at gmail.com> wrote:
> > more to the point it becomes unmanaged when it fails badly.
> > again, logs?
>
> i just didn't want to throw them at the list with my  first email :)
>
> florian
>
>
> --
> 'Sie brauchen sich um Ihre Zukunft keine Gedanken zu machen'
>
> [ side note: overload an emc^2 clariions storage processor number one,
> and it will crash, then dump. thus the sudden double load will
> overload processor number two. once 1 comes up and asks for a
> giveback, it will feel very alone and lock up, with a 30% chance of
> losing it's lun allocations; I'm glad heartbeat is far more robust
> even when misconfigured by me and I really love it already.
> besides, I didn't say any of the above.]
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>


More information about the Linux-HA mailing list