[Linux-HA] NewToHA2
Andrew Beekhof
beekhof at gmail.com
Tue May 15 03:44:20 MDT 2007
its almost always a firewall.
try stopping the firewall completely and see if the problem persists.
On 5/8/07, Eric Marcus <Eric.Marcus at kentcounty.org> wrote:
>
> Hello, I am new to HA2 and am having some configuration issues. I installed HA2 (2.0.8-1) on two Suse 10 (SLES10) machines using Alan's Education Project Screencast (http://www.linux-ha.org/Education/Newbie/InstallHeartbeatScreencast)
>
> I think I have a node configuration issue even though it is in ha.cf. I am very familiar with Novell Cluster Services. The problem I outline below makes me think that both of the nodes are trying to be the "Master" but I don't how to fix this. I've spent a week on this and am feeling very stupid! Here goes.....
>
> My ha.cf file for the 2 servers shows
>
> use_logd yes
> bcast eth1
> node it-mgatedom it-mgatedomc
> crm on
>
>
> The logd.cf shows
>
> logfacility daemon
>
>
> The authkeys show
>
> auth 1
> 1 sha1 cluster1
>
>
> Now, when I start it up on IT-MGATEDOM, it shows "done"
>
> crm_mon shows only 1 node configured and after a couple minutes the "Current DC: NONE" becomes "Current DC: it-mgatedom" with 0 resources configured. It still shows 1 node, not 2.
>
> Then I go to IT-MGATEDOMC to start it up...... It says "done" and when I do a tail /var/log/message I see this
>
>
>
> it-mgatedomc:~ # /etc/init.d/heartbeat start
> Starting High-Availability services:
> done
>
> it-mgatedomc:~ # tail /var/log/messages
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating.
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up'
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
> it-mgatedomc:~ # tail /var/log/messages
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating.
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up'
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
> it-mgatedomc:~ # tail /var/log/messages
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating.
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up'
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
> it-mgatedomc:~ # tail /var/log/messages
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating.
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up'
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
> it-mgatedomc:~ # tail /var/log/messages
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating.
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17
> May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up'
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active
> May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
> it-mgatedomc:~ # tail /var/log/messages
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->ackseq =0
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->lowseq =0, hist->hi seq=103
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: expecting from it-mgatedo m
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: it's ackseq=0
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug:
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->ackseq =0
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->lowseq =0, hist->hi seq=104
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: expecting from it-mgatedo m
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: it's ackseq=0
> May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug:
>
>
>
> The line that says "expecting from it-mgatedom" confuses me.
>
> crm_mon shows "Not Connected".
>
> netstat -n -l | grep 694 shows that udp 694 is there.
>
> The strange thing is if I stop both of them and start it on IT-MGATEDOMC first, then it will come up just fine and then when I start it on IT-MGATEDOM, it has the above issue.
>
> Any ideas?
>
> Thank you,
> Eric...
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
More information about the Linux-HA
mailing list