[Linux-HA] How to check/monitor OpenSIPS status in linuxHA

Ahmed Munir ahmedmunir007 at gmail.com
Mon Jul 20 06:02:28 MDT 2009


Hi,
I've made a resource file for OpenSIPs. The problem I'm facing is when I use
*sipsak -s sip:2001 at 192.168.0.184 <sip%3A2001 at 192.168.0.184> -H 127.0.0.1 *in
OpenSips_Status() function, OpenSIPs resource begin to start, failed and
later stop. Sample listed below;


OpenSips_Status( )
{
  sipsak -s sip:2001@@$OCF_RESKEY_ip -H 127.0.0.1 2> /dev/null > /dev/null
rc=$?

if [ $rc -ne 0 ]; then
       return $OCF_NOT_RUNNING
else
        return $OCF_SUCCESS

fi

}

The settings which works is;

OpenSips_Status( )
{
/etc/init.d/opensips status > /dev/null

rc=$?

if [ $rc -ne 0 ]; then
       return $OCF_NOT_RUNNING
else
       return $OCF_SUCCESS
fi
}

When I use second configuration no errors are found on log file. When I use
first configuration it gives me errors of rc=7(Even I've increased the
timeout values and you can also check the logs).

Using second configuration, the major problem arrives when one node goes
down and its virtual IP is taken by another machine, the OpenSIPs doesn't
update its arp table, so the calls on that virtual IP are not get forward
and gives timeout error.

I'm attaching my resource file and my ha configuration, please have a look.
And also kindly give me solution of "how I can monitor OpenSIPs status?"

-- 
Regards,

Ahmed Munir
-------------- next part --------------
Jul 20 18:51:43 ha1 tengine: [26695]: info: status_from_rc: Re-mapping op status to LRM_OP_ERROR for rc=7
Jul 20 18:51:43 ha1 crmd: [13703]: info: do_state_transition: All 2 cluster nodes are eligible to run resources.
Jul 20 18:51:43 ha1 tengine: [26695]: WARN: status_from_rc: Action monitor on ha1 failed (target: <null> vs. rc: 7): Error
Jul 20 18:51:43 ha1 tengine: [26695]: WARN: update_failcount: Updating failcount for OpenSips_1 on e651c120-b9a1-489a-baf7-caf0028ad540 after failed monitor: rc=7
Jul 20 18:51:43 ha1 tengine: [26695]: info: update_abort_priority: Abort priority upgraded to 1
Jul 20 18:51:43 ha1 tengine: [26695]: info: update_abort_priority: Abort action 0 superceeded by 2
Jul 20 18:51:43 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_monitor_10000 (1) confirmed on ha1 (rc=4)
Jul 20 18:51:43 ha1 tengine: [26695]: info: run_graph: Transition 23: (Complete=4, Pending=0, Fired=0, Skipped=0, Incomplete=0)
Jul 20 18:51:43 ha1 pengine: [26696]: info: determine_online_status: Node ha1 is online
Jul 20 18:51:43 ha1 pengine: [26696]: WARN: unpack_rsc_op: Processing failed op OpenSips_1_monitor_10000 on ha1: Error
Jul 20 18:51:43 ha1 pengine: [26696]: info: determine_online_status: Node ha2 is online
Jul 20 18:51:43 ha1 pengine: [26696]: notice: native_print: IPaddr_1    (heartbeat::ocf:IPaddr):        Started ha1
Jul 20 18:51:43 ha1 pengine: [26696]: notice: native_print: IPaddr_2    (heartbeat::ocf:IPaddr):        Started ha2
Jul 20 18:51:43 ha1 pengine: [26696]: notice: native_print: OpenSips_1  (heartbeat::ocf:testsip):       Started ha1 FAILED
Jul 20 18:51:43 ha1 pengine: [26696]: notice: native_print: OpenSips_2  (heartbeat::ocf:testsip):       Started ha2
Jul 20 18:51:43 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource IPaddr_1     (ha1)
Jul 20 18:51:43 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource IPaddr_2     (ha2)
Jul 20 18:51:43 ha1 pengine: [26696]: notice: NoRoleChange: Recover resource OpenSips_1 (ha1)
Jul 20 18:51:43 ha1 pengine: [26696]: notice: StopRsc:   ha1    Stop OpenSips_1
Jul 20 18:51:43 ha1 pengine: [26696]: notice: StartRsc:  ha1    Start OpenSips_1
Jul 20 18:51:43 ha1 pengine: [26696]: notice: RecurringOp: ha1     OpenSips_1_monitor_10000
Jul 20 18:51:43 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource OpenSips_2   (ha2)
Jul 20 18:51:43 ha1 crmd: [13703]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=route_message ]
Jul 20 18:51:43 ha1 tengine: [26695]: info: unpack_graph: Unpacked transition 24: 4 actions in 4 synapses
Jul 20 18:51:43 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 2: OpenSips_1_stop_0 on ha1
Jul 20 18:51:43 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_stop_0 key=2:24:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:43 ha1 lrmd: [13700]: info: rsc:OpenSips_1: stop
Jul 20 18:51:43 ha1 pengine: [26696]: info: process_pe_message: Transition 24: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-14038.bz2
Jul 20 18:51:43 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_monitor_10000 (call=156, rc=-2) Cancelled
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:tm:mod_init: TM - initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:maxfwd:mod_init: initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:usrloc:ul_init_locks: locks array size 512
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:registrar:mod_init: initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:textops:mod_init: initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:xlog:mod_init: initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:acc:mod_init: initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:auth:mod_init: initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:auth_db:mod_init: initializing...
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31153]: INFO:core:probe_max_receive_buffer: using a UDP receive buffer of 255 kb
Jul 20 18:51:43 ha1 last message repeated 2 times
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31179]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31207]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31209]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31203]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31196]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31198]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31210]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31184]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31194]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31202]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31212]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31208]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31182]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31190]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31200]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31211]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31192]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:43 ha1 /usr/local/sbin/opensips[31205]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_stop_0 (call=157, rc=0) complete
Jul 20 18:51:44 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_stop_0 (2) confirmed on ha1 (rc=0)
Jul 20 18:51:44 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 14: OpenSips_1_start_0 on ha1
Jul 20 18:51:44 ha1 tengine: [26695]: info: te_pseudo_action: Pseudo action 9 fired and confirmed
Jul 20 18:51:44 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_start_0 key=14:24:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:44 ha1 lrmd: [13700]: info: rsc:OpenSips_1: start
Jul 20 18:51:44 ha1 opensips: WARNING:core:fix_socket_list: could not rev. resolve 192.168.0.184
Jul 20 18:51:44 ha1 opensips: WARNING:core:fix_socket_list: could not rev. resolve 192.168.0.184
Jul 20 18:51:44 ha1 opensips: INFO:core:init_tcp: using epoll_lt as the TCP io watch method (auto detected)
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: NOTICE:core:main: version: opensips 1.5.1-notls (i386/linux)
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:core:main: using 32 Mb shared memory
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:core:main: using 1 Mb private memory per process
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: NOTICE:signaling:mod_init: initializing module ...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:sl:mod_init: Initializing StateLess engine
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:tm:mod_init: TM - initializing...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:maxfwd:mod_init: initializing...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:usrloc:ul_init_locks: locks array size 512
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:registrar:mod_init: initializing...
Jul 20 18:51:44 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_start_0 (call=158, rc=0) complete
Jul 20 18:51:44 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_start_0 (14) confirmed on ha1 (rc=0)
Jul 20 18:51:44 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 1: OpenSips_1_monitor_10000 on ha1
Jul 20 18:51:44 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_monitor_10000 key=1:24:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:44 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_monitor_10000 (call=159, rc=7) complete
Jul 20 18:51:44 ha1 tengine: [26695]: info: status_from_rc: Re-mapping op status to LRM_OP_ERROR for rc=7
Jul 20 18:51:44 ha1 tengine: [26695]: WARN: status_from_rc: Action monitor on ha1 failed (target: <null> vs. rc: 7): Error
Jul 20 18:51:44 ha1 tengine: [26695]: WARN: update_failcount: Updating failcount for OpenSips_1 on e651c120-b9a1-489a-baf7-caf0028ad540 after failed monitor: rc=7
Jul 20 18:51:44 ha1 tengine: [26695]: info: update_abort_priority: Abort priority upgraded to 1
Jul 20 18:51:44 ha1 tengine: [26695]: info: update_abort_priority: Abort action 0 superceeded by 2
Jul 20 18:51:44 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_monitor_10000 (1) confirmed on ha1 (rc=4)
Jul 20 18:51:44 ha1 tengine: [26695]: info: run_graph: Transition 24: (Complete=4, Pending=0, Fired=0, Skipped=0, Incomplete=0)
Jul 20 18:51:44 ha1 crmd: [13703]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_IPC_MESSAGE origin=route_message ]
Jul 20 18:51:44 ha1 crmd: [13703]: info: do_state_transition: All 2 cluster nodes are eligible to run resources.
Jul 20 18:51:44 ha1 crmd: [13703]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=route_message ]
Jul 20 18:51:44 ha1 tengine: [26695]: info: unpack_graph: Unpacked transition 25: 4 actions in 4 synapses
Jul 20 18:51:44 ha1 pengine: [26696]: info: determine_online_status: Node ha1 is online
Jul 20 18:51:44 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 2: OpenSips_1_stop_0 on ha1
Jul 20 18:51:44 ha1 pengine: [26696]: WARN: unpack_rsc_op: Processing failed op OpenSips_1_monitor_10000 on ha1: Error
Jul 20 18:51:44 ha1 pengine: [26696]: info: determine_online_status: Node ha2 is online
Jul 20 18:51:44 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 2: OpenSips_1_stop_0 on ha1
Jul 20 18:51:44 ha1 pengine: [26696]: WARN: unpack_rsc_op: Processing failed op OpenSips_1_monitor_10000 on ha1: Error
Jul 20 18:51:44 ha1 pengine: [26696]: info: determine_online_status: Node ha2 is online
Jul 20 18:51:44 ha1 pengine: [26696]: notice: native_print: IPaddr_1    (heartbeat::ocf:IPaddr):        Started ha1
Jul 20 18:51:44 ha1 pengine: [26696]: notice: native_print: IPaddr_2    (heartbeat::ocf:IPaddr):        Started ha2
Jul 20 18:51:44 ha1 pengine: [26696]: notice: native_print: OpenSips_1  (heartbeat::ocf:testsip):       Started ha1 FAILED
Jul 20 18:51:44 ha1 pengine: [26696]: notice: native_print: OpenSips_2  (heartbeat::ocf:testsip):       Started ha2
Jul 20 18:51:44 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource IPaddr_1     (ha1)
Jul 20 18:51:44 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource IPaddr_2     (ha2)
Jul 20 18:51:44 ha1 pengine: [26696]: notice: NoRoleChange: Recover resource OpenSips_1 (ha1)
Jul 20 18:51:44 ha1 pengine: [26696]: notice: StopRsc:   ha1    Stop OpenSips_1
Jul 20 18:51:44 ha1 pengine: [26696]: notice: StartRsc:  ha1    Start OpenSips_1
Jul 20 18:51:44 ha1 pengine: [26696]: notice: RecurringOp: ha1     OpenSips_1_monitor_10000
Jul 20 18:51:44 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource OpenSips_2   (ha2)
Jul 20 18:51:44 ha1 pengine: [26696]: info: process_pe_message: Transition 25: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-14039.bz2
Jul 20 18:51:44 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_stop_0 key=2:25:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:44 ha1 lrmd: [13700]: info: rsc:OpenSips_1: stop
Jul 20 18:51:44 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_monitor_10000 (call=159, rc=-2) Cancelled
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:textops:mod_init: initializing...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:xlog:mod_init: initializing...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:acc:mod_init: initializing...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:auth:mod_init: initializing...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:auth_db:mod_init: initializing...
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31241]: INFO:core:probe_max_receive_buffer: using a UDP receive buffer of 255 kb
Jul 20 18:51:44 ha1 last message repeated 2 times
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31266]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31300]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31294]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31292]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 cib: [13699]: info: sync_our_cib: Syncing CIB to ha2
Jul 20 18:51:44 ha1 cib: [13699]: info: sync_our_cib: Syncing CIB to ha2
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31291]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31289]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31277]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31287]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31298]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31273]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31283]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31296]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31269]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31279]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31285]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31281]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31275]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31271]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31299]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 /usr/local/sbin/opensips[31297]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:44 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_stop_0 (call=160, rc=0) complete
Jul 20 18:51:45 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_stop_0 (2) confirmed on ha1 (rc=0)
Jul 20 18:51:45 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 14: OpenSips_1_start_0 on ha1
Jul 20 18:51:45 ha1 tengine: [26695]: info: te_pseudo_action: Pseudo action 9 fired and confirmed
Jul 20 18:51:45 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_start_0 key=14:25:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:45 ha1 lrmd: [13700]: info: rsc:OpenSips_1: start
Jul 20 18:51:45 ha1 opensips: WARNING:core:fix_socket_list: could not rev. resolve 192.168.0.184
Jul 20 18:51:45 ha1 opensips: WARNING:core:fix_socket_list: could not rev. resolve 192.168.0.184
Jul 20 18:51:45 ha1 opensips: INFO:core:init_tcp: using epoll_lt as the TCP io watch method (auto detected)
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: NOTICE:core:main: version: opensips 1.5.1-notls (i386/linux)
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:core:main: using 32 Mb shared memory
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:core:main: using 1 Mb private memory per process
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: NOTICE:signaling:mod_init: initializing module ...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:sl:mod_init: Initializing StateLess engine
Jul 20 18:51:45 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_start_0 (call=161, rc=0) complete
Jul 20 18:51:45 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_start_0 (14) confirmed on ha1 (rc=0)
Jul 20 18:51:45 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 1: OpenSips_1_monitor_10000 on ha1
Jul 20 18:51:45 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_monitor_10000 key=1:25:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:45 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_monitor_10000 (call=162, rc=7) complete
Jul 20 18:51:45 ha1 tengine: [26695]: info: status_from_rc: Re-mapping op status to LRM_OP_ERROR for rc=7
Jul 20 18:51:45 ha1 tengine: [26695]: WARN: status_from_rc: Action monitor on ha1 failed (target: <null> vs. rc: 7): Error
Jul 20 18:51:45 ha1 tengine: [26695]: WARN: update_failcount: Updating failcount for OpenSips_1 on e651c120-b9a1-489a-baf7-caf0028ad540 after failed monitor: rc=7
Jul 20 18:51:45 ha1 crmd: [13703]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_IPC_MESSAGE origin=route_message ]
Jul 20 18:51:45 ha1 tengine: [26695]: info: update_abort_priority: Abort priority upgraded to 1
Jul 20 18:51:45 ha1 pengine: [26696]: info: determine_online_status: Node ha1 is online
Jul 20 18:51:45 ha1 crmd: [13703]: info: do_state_transition: All 2 cluster nodes are eligible to run resources.
Jul 20 18:51:45 ha1 tengine: [26695]: info: update_abort_priority: Abort action 0 superceeded by 2
Jul 20 18:51:45 ha1 pengine: [26696]: WARN: unpack_rsc_op: Processing failed op OpenSips_1_monitor_10000 on ha1: Error
Jul 20 18:51:45 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_monitor_10000 (1) confirmed on ha1 (rc=4)
Jul 20 18:51:45 ha1 pengine: [26696]: info: determine_online_status: Node ha2 is online
Jul 20 18:51:45 ha1 tengine: [26695]: info: run_graph: Transition 25: (Complete=4, Pending=0, Fired=0, Skipped=0, Incomplete=0)
Jul 20 18:51:45 ha1 pengine: [26696]: notice: native_print: IPaddr_1    (heartbeat::ocf:IPaddr):        Started ha1
Jul 20 18:51:45 ha1 pengine: [26696]: notice: native_print: IPaddr_2    (heartbeat::ocf:IPaddr):        Started ha2
Jul 20 18:51:45 ha1 pengine: [26696]: notice: native_print: OpenSips_1  (heartbeat::ocf:testsip):       Started ha1 FAILED
Jul 20 18:51:45 ha1 pengine: [26696]: notice: native_print: OpenSips_2  (heartbeat::ocf:testsip):       Started ha2
Jul 20 18:51:45 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource IPaddr_1     (ha1)
Jul 20 18:51:45 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource IPaddr_2     (ha2)
Jul 20 18:51:45 ha1 pengine: [26696]: notice: NoRoleChange: Recover resource OpenSips_1 (ha1)
Jul 20 18:51:45 ha1 pengine: [26696]: notice: StopRsc:   ha1    Stop OpenSips_1
Jul 20 18:51:45 ha1 pengine: [26696]: notice: StartRsc:  ha1    Start OpenSips_1
Jul 20 18:51:45 ha1 pengine: [26696]: notice: RecurringOp: ha1     OpenSips_1_monitor_10000
Jul 20 18:51:45 ha1 pengine: [26696]: notice: NoRoleChange: Leave resource OpenSips_2   (ha2)
Jul 20 18:51:45 ha1 crmd: [13703]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=route_message ]
Jul 20 18:51:45 ha1 tengine: [26695]: info: unpack_graph: Unpacked transition 26: 4 actions in 4 synapses
Jul 20 18:51:45 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 2: OpenSips_1_stop_0 on ha1
Jul 20 18:51:45 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_stop_0 key=2:26:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:45 ha1 lrmd: [13700]: info: rsc:OpenSips_1: stop
Jul 20 18:51:45 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_monitor_10000 (call=162, rc=-2) Cancelled
Jul 20 18:51:45 ha1 pengine: [26696]: info: process_pe_message: Transition 26: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-14040.bz2
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:tm:mod_init: TM - initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:maxfwd:mod_init: initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:usrloc:ul_init_locks: locks array size 512
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:registrar:mod_init: initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:textops:mod_init: initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:xlog:mod_init: initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:acc:mod_init: initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:auth:mod_init: initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:auth_db:mod_init: initializing...
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31330]: INFO:core:probe_max_receive_buffer: using a UDP receive buffer of 255 kb
Jul 20 18:51:45 ha1 last message repeated 2 times
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31356]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31389]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31376]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31375]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31367]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31373]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31372]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31371]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31384]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31374]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31370]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31365]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31363]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31361]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31359]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31387]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31386]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 /usr/local/sbin/opensips[31385]: INFO:core:sig_usr: signal 15 received
Jul 20 18:51:45 ha1 cib: [13699]: info: sync_our_cib: Syncing CIB to ha2
Jul 20 18:51:45 ha1 crmd: [13703]: info: process_lrm_event: LRM operation OpenSips_1_stop_0 (call=163, rc=0) complete
Jul 20 18:51:45 ha1 crmd: [13703]: info: do_lrm_rsc_op: Performing op=OpenSips_1_start_0 key=14:26:26908b7e-e1bf-4a4a-a0af-3628b94b972d)
Jul 20 18:51:45 ha1 lrmd: [13700]: info: rsc:OpenSips_1: start
Jul 20 18:51:45 ha1 tengine: [26695]: info: match_graph_event: Action OpenSips_1_stop_0 (2) confirmed on ha1 (rc=0)
Jul 20 18:51:45 ha1 tengine: [26695]: info: send_rsc_command: Initiating action 14: OpenSips_1_start_0 on ha1
Jul 20 18:51:45 ha1 tengine: [26695]: info: te_pseudo_action: Pseudo action 9 fired and confirmed
Jul 20 18:51:46 ha1 opensips: WARNING:core:fix_socket_list: could not rev. resolve 192.168.0.184
Jul 20 18:51:46 ha1 cib: [13699]: info: sync_our_cib: Syncing CIB to ha2
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cib.xml
Type: text/xml
Size: 7181 bytes
Desc: not available
URL: <http://lists.linux-ha.org/pipermail/linux-ha/attachments/20090720/14268508/attachment.xml>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ha.cf
Type: application/octet-stream
Size: 11230 bytes
Desc: not available
URL: <http://lists.linux-ha.org/pipermail/linux-ha/attachments/20090720/14268508/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: testsip
Type: application/octet-stream
Size: 2971 bytes
Desc: not available
URL: <http://lists.linux-ha.org/pipermail/linux-ha/attachments/20090720/14268508/attachment-0001.obj>


More information about the Linux-HA mailing list