[Linux-HA] cl_status hb_api_signon error

aichains aichains at nonstophate.com
Fri Oct 19 06:12:56 MDT 2007


On Fri, 2007-10-19 at 12:31 +0200, Dejan Muhamedagic wrote:
> Hi,
> 
> On Thu, Oct 18, 2007 at 08:03:20PM -0400, aichains wrote:
> > please help me to understand why this cl_status error would occur:
> > 
> > 
> > 
> > root at master: ~ # cl_status listnodes
> > cl_status[4521]: 2007/10/18_19:53:31 ERROR: Cannot signon with heartbeat
> > cl_status[4521]: 2007/10/18_19:53:31 ERROR: REASON: hb_api_signon: Can't
> > initiate connection  to heartbeat
> > 
> > 
> > 
> > and here is my config:
> > 
> > 
> > 
> > root at master: ~ # cat /etc/redhat-release 
> > Red Hat Enterprise Linux ES release 4 (Nahant Update 5)
> > 
> > root at master: ~ # uname -a
> > Linux master.example.com 2.6.9-42.0.3.ELsmp #1 SMP Mon Sep 25 17:28:02
> > EDT 2006 i686 i686 i386 GNU/Linux
> > root at master: ~ # 
> > 
> > root at master: ~ # grep hac /etc/{passwd,group}
> > /etc/passwd:hacluster:x:501:501::/home/hacluster:/bin/bash
> > /etc/group:haclient:x:90:
> > /etc/group:hacluster:x:501:
> > 
> > root at master: ~ # rpm -qa | grep heartbeat
> > heartbeat-pils-2.0.4-1
> > heartbeat-2.0.4-1
> > heartbeat-stonith-2.0.4-1
> > root at master: ~ # 
> 
> This is too old. Please upgrade.
> 
> Thanks,
> 
> Dejan
> 
> > root at master: ~ # ps aux | grep heartbeat
> > root      4396  0.0  0.5 12220 12220 ?       SLs  Jan26   7:43
> > heartbeat: master control process
> > nobody    4408  0.0  0.2  5612 5612 ?        SL   Jan26   0:01
> > heartbeat: FIFO reader        
> > nobody    4409  0.0  0.2  5608 5608 ?        SL   Jan26   1:38
> > heartbeat: write: bcast bond0 
> > nobody    4410  0.0  0.2  5608 5608 ?        SL   Jan26   4:55
> > heartbeat: read: bcast bond0  
> > root at master: ~ # 
> > 
> > root at master: ~ # service heartbeat status
> > heartbeat OK [pid 4396 et al] is running on master.example.com
> > [master.example.com]...
> > root at master: ~ # 
> > 
> > root at master: ~ # cat /etc/ha.d/ha.cf
> > bcast           bond0
> > auto_failback   off
> > keepalive       5
> > deadtime        30
> > node            master.example.com
> > node            slave.example.com
> > respawn         hacluster /usr/local/lib/heartbeat/ipfail
> > apiauth         ipfail gid=haclient uid=hacluster
> > traditional_compression false
> > use_logd yes
> > root at master: ~ # 
> > 
> > 
> > -- 
> > aichains <aichains at nonstophate.com>
> > 
> > _______________________________________________
> > Linux-HA mailing list
> > Linux-HA at lists.linux-ha.org
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems

hello, thank you for the reply - while i do intend to upgrade, i would
like to solve this issue first.

i did find one other clue using strace cl_status nodelist:

connect(3, {sa_family=AF_FILE,
path="/usr/local/var/run/heartbeat/register"}, 110) = -1 ECONNREFUSED
(Connection refused)

this is the wrong file...it SHOULD be trying to connect
to /var/run/heartbeat/register

is there a way to change the file that cl_status is trying to connect
to ?  do you think a restart of heartbeat would help ?




-- 
aichains <aichains at nonstophate.com>



More information about the Linux-HA mailing list