[Linux-HA] Failover not happening.
nasar sherieff
shenasar at yahoo.com
Sat Oct 6 04:37:05 MDT 2007
Thank you Andrew.
Failover happens now after configuring ipfail. But the
problem is with the drbd now. When I connect back the
master node, It's ending up in split brain. I have to
solve this manually. Read about stonith, but I don't
have any stonith devices. Is there any other way to
avoid split brain. ie, when master comes online, it'll
get all the updates from slave1?. Please help.
Thanks
Nasar
--- Andrew Beekhof <beekhof at gmail.com> wrote:
> On 10/4/07, nasar sherieff <shenasar at yahoo.com>
> wrote:
> > I have two node cluster as active/passive setup.
> >
> > two nodes are connected through Lan. There is a
> serial
> > connection between these two nodes.
> >
> > OS: Fedora core 7
> > s/w: heartbeat-2.0.8-1.fc7, drbd-8.0.6-3(RPMS)
> > Stonith not configured.
>
> bad idea
>
> > Cluster nodes: master and slave1
> >
> >
> > Problem:
> > When I disconnect network cable of master node,
> > failover is not happening.
>
> because heartbeat can still see the other side via
> the serial cable
>
> you need to read up on ipfail
>
> > heartbeat still running
> > with the resources up. drbd show primary/unknown.
> > slave1 node shows secodary/unknown.
> >
> > Failover happening only when master node properly
> > shutting down or heartbeat service stopped.
> >
> >
> > Please let me know how to solve this failover
> issues
> > when network not available, but machine is up.
> >
> >
> > My configuration:
> >
> > /etc/ha.d/ha.cf
> > --------------------
> > logfile /var/log/ha-log
> > logfacility local0
> > keepalive 1
> > deadtime 30
> > warntime 10
> > initdead 60
> > serial /dev/ttyS0
> > bcast eth0
> > ping 192.168.2.2
> > auto_failback yes
> > node master slave1
> > crm no
> >
> >
> >
> > /etc/ha.d/haresources
> > ----------------------
> > master drbddisk::drbd0
> > Filesystem::/dev/drbd0::/data::ext3 192.168.2.3
> smb
> >
> >
> >
> > /etc/drbd.conf
> > ---------------------
> > global {
> > usage-count yes;
> > }
> >
> > resource drbd0 {
> > protocol C;
> > handlers {
> > pri-on-incon-degr "echo o >
> /proc/sysrq-trigger ;
> > /sbin/shutdown -h";
> > pri-lost-after-sb "echo o >
> /proc/sysrq-trigger ;
> > /sbin/shutdown -h";
> > local-io-error "echo o > /proc/sysrq-trigger ;
> > /sbin/shutdown -h";
> > outdate-peer
> > "/usr/lib/heartbeat/drbd-peer-outdater";
> > }
> > startup {
> > degr-wfc-timeout 120; # 2 minutes.
> > }
> > disk {
> > on-io-error detach;
> > }
> > net {
> > timeout 120;
> > connect-int 20;
> > ping-int 20;
> > max-buffers 2048;
> > max-epoch-size 2048;
> > ko-count 30;
> > cram-hmac-alg "sha1";
> > shared-secret "FooFunFactory";
> > }
> > syncer {
> > rate 10M;
> > al-extents 257;
> > }
> >
> > on master {
> > device /dev/drbd0;
> > disk /dev/sda2;
> > address 192.168.2.9:7788;
> > meta-disk internal;
> > }
> > on slave1 {
> > device /dev/drbd0;
> > disk /dev/sda2;
> > address 192.168.2.8:7788;
> > meta-disk internal;
> > }
> > }
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
>
____________________________________________________________________________________
> > Be a better Heartthrob. Get better relationship
> answers from someone who knows. Yahoo! Answers -
> Check it out.
> >
>
http://answers.yahoo.com/dir/?link=list&sid=396545433
> > _______________________________________________
> > Linux-HA mailing list
> > Linux-HA at lists.linux-ha.org
> >
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> >
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
____________________________________________________________________________________
Shape Yahoo! in your own image. Join our Network Research Panel today! http://surveylink.yahoo.com/gmrs/yahoo_panel_invite.asp?a=7
More information about the Linux-HA
mailing list