[Linux-HA] heartbeat 2.0.8: still not working

Serge Dubrouski sergeyfd at gmail.com
Mon Feb 5 13:21:50 MST 2007


Your nodes do not see each other. Is there any kind of firewall
between them? Do you run iptablses?

On 2/5/07, greno at verizon.net <greno at verizon.net> wrote:
> heartbeat is wearing me down...
>
> I tried again on two different servers and still cannot get heartbeat working.  Both are running FC6, and hb 2.0.8.  Can someone please check my configs and see if you can find what is wrong.
>
> Right now, if I start heartbeat then both nodes get the IP (splitbrain).  If I change symmetrical to false then neither nodes get the IP.  I also tried configuring this in the gui which was even worse.  It would not even show you the correct state of things.  When there was a splitbrain it would happily show you that one of the nodes had the IP and the other didn't when in fact both nodes had the IP.  That and other numerous problems.
>
> details:
>
> ha.cf:
> ========================
> logfacility     daemon
> keepalive 1
> deadtime 10
> warntime 5
> initdead 20
> udpport 694
> ping 192.168.1.1
> bcast eth0 eth1
> auto_failback off
> respawn root /usr/lib/heartbeat/pingd -m 100 -d 5s -a pingd_score
> apiauth cibmon   uid=hacluster
> respawn hacluster /usr/lib/heartbeat/cibmon -d
>
> node    grp-01-30-01
> node    grp-01-30-02
> use_logd yes
> compression     bz2
> compression_threshold 2
> crm yes
>
> ========================
>
> cib.xml:
> ========================
>  <cib have_quorum="true" generated="true" admin_epoch="0" epoch="8" num_updates="96" ignore_dtd="false" num_peers="1" cib_feature_revision="1.3" cib-last-written="Mon Feb  5 14:40:31 2007" ccm_transition="1" dc_uuid="67b0bfa7-0165-4a8c-9c0f-ec82e0ae2c91">
>    <configuration>
>      <crm_config>
>        <cluster_property_set id="cib-bootstrap-options">
>          <attributes>
>            <nvpair id="cib-bootstrap-options-symmetric_cluster" name="symmetric_cluster" value="True"/>
>            <nvpair id="cib-bootstrap-options-default_resource_stickiness" name="default_resource_stickiness" value="100"/>
>          </attributes>
>        </cluster_property_set>
>      </crm_config>
>      <nodes>
>        <node id="67b0bfa7-0165-4a8c-9c0f-ec82e0ae2c91" uname="grp-01-30-01" type="normal"/>
>        <node id="282ffe4f-a18b-4450-9495-0a42afb6ef18" uname="grp-01-30-02" type="normal"/>
>      </nodes>
>      <resources>
>        <group id="GRP_webserver_ip_RG">
>          <primitive id="GRP_webserver_ip_R" class="ocf" type="IPaddr" provider="heartbeat">
>            <instance_attributes id="GRP_webserver_ip_R_instance_attrs">
>              <attributes>
>                <nvpair id="941b5590-c6b8-4465-882d-ce52ec4f63e8" name="ip" value="192.168.1.215"/>
>              </attributes>
>            </instance_attributes>
>          </primitive>
>        </group>
>      </resources>
>      <constraints>
>        <rsc_location id="GRP_webserver_ip_RG:not_connected" rsc="GRP_webserver_ip_RG">
>          <rule id="GRP_webserver_ip_RG:not_connected:rule" score="-INFINITY">
>            <expression id="GRP_webserver_ip_RG:not_connected:expr" attribute="pingd_score" operation="not_defined"/>
>          </rule>
>        </rsc_location>
>      </constraints>
>    </configuration>
>  </cib>
>
> ========================
>
> log: grp-01-30-01
> ========================
> log: grp-01-30-01
> ========================
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6506]: info: Enabling logging daemon
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6506]: info: logfile and debug file are those specified in logd config file (default /etc/logd.cf)
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6506]: WARN: logd is enabled but logfile/debugfile/logfacility is still configured in ha.cf
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6506]: info: **************************
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6506]: info: Configuration validated. Starting heartbeat 2.0.8
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: heartbeat: version 2.0.8
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: Heartbeat generation: 48
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: G_main_add_TriggerHandler: Added signal manual handler
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: G_main_add_TriggerHandler: Added signal manual handler
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: Removing /var/run/heartbeat/rsctmp failed, recreating.
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: glib: ping heartbeat started.
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth1
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth1 - Status: 1
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: G_main_add_SignalHandler: Added signal handler for signal 17
> Feb  5 14:39:48 grp-01-30-01 heartbeat: [6507]: info: Local status now set to: 'up'
> Feb  5 14:39:49 grp-01-30-01 heartbeat: [6507]: info: Link 192.168.1.1:192.168.1.1 up.
> Feb  5 14:39:49 grp-01-30-01 heartbeat: [6507]: info: Status update for node 192.168.1.1: status ping
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: WARN: node grp-01-30-02: is dead
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Comm_now_up(): updating status to active
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Local status now set to: 'active'
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/pingd -m 100 -d 5s -a pingd_score" (0,0)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/cibmon -d" (100,101)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/ccm" (100,101)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/cib" (100,101)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/lrmd -r" (0,0)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/stonithd" (0,0)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/attrd" (100,101)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/crmd" (100,101)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6507]: info: Starting child client "/usr/lib/heartbeat/mgmtd -v" (0,0)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6525]: info: Starting "/usr/lib/heartbeat/pingd -m 100 -d 5s -a pingd_score" as uid 0  gid 0 (pid 6525)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6526]: info: Starting "/usr/lib/heartbeat/cibmon -d" as uid 100  gid 101 (pid 6526)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6527]: info: Starting "/usr/lib/heartbeat/ccm" as uid 100  gid 101 (pid 6527)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6528]: info: Starting "/usr/lib/heartbeat/cib" as uid 100  gid 101 (pid 6528)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6529]: info: Starting "/usr/lib/heartbeat/lrmd -r" as uid 0  gid 0 (pid 6529)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6530]: info: Starting "/usr/lib/heartbeat/stonithd" as uid 0  gid 0 (pid 6530)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6531]: info: Starting "/usr/lib/heartbeat/attrd" as uid 100  gid 101 (pid 6531)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6532]: info: Starting "/usr/lib/heartbeat/crmd" as uid 100  gid 101 (pid 6532)
> Feb  5 14:40:09 grp-01-30-01 heartbeat: [6533]: info: Starting "/usr/lib/heartbeat/mgmtd -v" as uid 0  gid 0 (pid 6533)
> Feb  5 14:40:09 grp-01-30-01 stonithd: [6530]: info: Signing in with heartbeat.
> Feb  5 14:40:09 grp-01-30-01 cib: [6528]: WARN: crm_is_writable: /var/lib/heartbeat/crm/cib.xml should be owned and r/w by group haclient
> Feb  5 14:40:09 grp-01-30-01 cib: [6528]: info: readCibXmlFile: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.xml
> Feb  5 14:40:09 grp-01-30-01 cib: [6528]: info: log_data_element: readCibXmlFile: [on-disk]         <primitive id="GRP_webserver_ip_R" class="ocf" type="IPaddr" provider="heartbeat">
> Feb  5 14:40:09 grp-01-30-01 stonithd: [6530]: notice: /usr/lib/heartbeat/stonithd start up successfully.
> Feb  5 14:40:30 grp-01-30-01 pengine: [6539]: info: native_print:     GRP_webserver_ip_R        (heartbeat::ocf:IPaddr):        Stopped
> Feb  5 14:40:30 grp-01-30-01 pengine: [6539]: info: process_pe_message: Transition 0: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-27.bz2
> Feb  5 14:40:30 grp-01-30-01 cibmon: [6526]: info: log_data_element: cib_update: +           <lrm_resource id="GRP_webserver_ip_R" type="IPaddr" class="ocf" provider="heartbeat">
> Feb  5 14:40:31 grp-01-30-01 pengine: [6539]: info: native_print:     GRP_webserver_ip_R        (heartbeat::ocf:IPaddr):        Started grp-01-30-01
> Feb  5 14:40:31 grp-01-30-01 pengine: [6539]: info: process_pe_message: Transition 1: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-28.bz2
>
> ========================
>
> log: grp-01-30-02
> ========================
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5347]: info: Enabling logging daemon
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5347]: info: logfile and debug file are those specified in logd config file (default /etc/logd.cf)
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5347]: WARN: logd is enabled but logfile/debugfile/logfacility is still configured in ha.cf
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5347]: info: **************************
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5347]: info: Configuration validated. Starting heartbeat 2.0.8
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: heartbeat: version 2.0.8
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: Heartbeat generation: 44
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: G_main_add_TriggerHandler: Added signal manual handler
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: G_main_add_TriggerHandler: Added signal manual handler
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: Removing /var/run/heartbeat/rsctmp failed, recreating.
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: glib: ping heartbeat started.
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth1
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth1 - Status: 1
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: G_main_add_SignalHandler: Added signal handler for signal 17
> Feb  5 14:39:55 grp-01-30-02 heartbeat: [5348]: info: Local status now set to: 'up'
> Feb  5 14:39:56 grp-01-30-02 heartbeat: [5348]: info: Link 192.168.1.1:192.168.1.1 up.
> Feb  5 14:39:56 grp-01-30-02 heartbeat: [5348]: info: Status update for node 192.168.1.1: status ping
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: WARN: node grp-01-30-01: is dead
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Comm_now_up(): updating status to active
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Local status now set to: 'active'
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/pingd -m 100 -d 5s -a pingd_score" (0,0)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/cibmon -d" (100,101)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/ccm" (100,101)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/cib" (100,101)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/lrmd -r" (0,0)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/stonithd" (0,0)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/attrd" (100,101)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/crmd" (100,101)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5348]: info: Starting child client "/usr/lib/heartbeat/mgmtd -v" (0,0)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5366]: info: Starting "/usr/lib/heartbeat/pingd -m 100 -d 5s -a pingd_score" as uid 0  gid 0 (pid 5366)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5367]: info: Starting "/usr/lib/heartbeat/cibmon -d" as uid 100  gid 101 (pid 5367)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5368]: info: Starting "/usr/lib/heartbeat/ccm" as uid 100  gid 101 (pid 5368)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5369]: info: Starting "/usr/lib/heartbeat/cib" as uid 100  gid 101 (pid 5369)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5370]: info: Starting "/usr/lib/heartbeat/lrmd -r" as uid 0  gid 0 (pid 5370)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5371]: info: Starting "/usr/lib/heartbeat/stonithd" as uid 0  gid 0 (pid 5371)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5372]: info: Starting "/usr/lib/heartbeat/attrd" as uid 100  gid 101 (pid 5372)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5373]: info: Starting "/usr/lib/heartbeat/crmd" as uid 100  gid 101 (pid 5373)
> Feb  5 14:40:16 grp-01-30-02 heartbeat: [5374]: info: Starting "/usr/lib/heartbeat/mgmtd -v" as uid 0  gid 0 (pid 5374)
> Feb  5 14:40:16 grp-01-30-02 cib: [5369]: WARN: crm_is_writable: /var/lib/heartbeat/crm/cib.xml should be owned and r/w by group haclient
> Feb  5 14:40:16 grp-01-30-02 cib: [5369]: info: readCibXmlFile: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.xml
> Feb  5 14:40:16 grp-01-30-02 cib: [5369]: info: log_data_element: readCibXmlFile: [on-disk]         <primitive id="GRP_webserver_ip_R" class="ocf" type="IPaddr" provider="heartbeat">
> Feb  5 14:40:16 grp-01-30-02 stonithd: [5371]: info: Signing in with heartbeat.
> Feb  5 14:40:16 grp-01-30-02 stonithd: [5371]: notice: /usr/lib/heartbeat/stonithd start up successfully.
> Feb  5 14:40:34 grp-01-30-02 pengine: [5380]: info: native_print:     GRP_webserver_ip_R        (heartbeat::ocf:IPaddr):        Stopped
> Feb  5 14:40:34 grp-01-30-02 pengine: [5380]: info: process_pe_message: Transition 0: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-89.bz2
> Feb  5 14:40:35 grp-01-30-02 cibmon: [5367]: info: log_data_element: cib_update: +           <lrm_resource id="GRP_webserver_ip_R" type="IPaddr" class="ocf" provider="heartbeat">
> Feb  5 14:40:35 grp-01-30-02 pengine: [5380]: info: native_print:     GRP_webserver_ip_R        (heartbeat::ocf:IPaddr):        Started grp-01-30-02
> Feb  5 14:40:35 grp-01-30-02 pengine: [5380]: info: process_pe_message: Transition 1: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-90.bz2
>
> ========================
>
>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>


More information about the Linux-HA mailing list