[Linux-HA] Resource not monitored?
Andreas Kurz
andreas.kurz at gmail.com
Wed Feb 7 06:43:03 MST 2007
On 2/7/07, Jim Wong <jwong at sharpcast.com> wrote:
> Folks,
>
> We may have done something goofy in our configuration here, but we're
> stumped. We've got a resource, managed by a custom OCF resource agent, that
> is listed as running in the output of crm_mon:
>
> xeroxd (sharpcast::ocf:xeroxd): Started abg01
>
> However, the associated process is definitely not running (due to a
> configuration error), and the monitor operation seems to indicate as such:
>
> bash-3.00# /usr/lib/ocf/resource.d/sharpcast/xeroxd monitor ; echo "Result:
> $?"
> xeroxd dead but pid file exists
> Result: 1
>
> It's been in this state for an hour or so, but heartbeat doesn't seem to
> have figured it out. Is there a particular reason heartbeat wouldn't be
> monitoring this resource? There doesn't seem to be anything relevant in the
> logs during that time...
Do you have an explicit monitor operation configured for your
resource? Whithout it, heartbeat will not monitor a resource
regularly.
>
> Feb 7 02:18:48 abg01 crmd: [16643]: info: do_lrm_rsc_op:lrm.c Performing op
> start on xeroxd (interval=0ms, key=112:3c6c0960-29d1-4f5b-9497-da5324195891)
> Feb 7 02:18:48 abg01 tengine: [18920]: info: match_graph_event:events.c
> Action xeroxd_monitor_0 (3) confirmed
> Feb 7 02:18:48 abg01 tengine: [18920]: info: send_rsc_command:actions.c
> Initiating action 15: xeroxd_start_0 on abg01
> Feb 7 02:18:48 abg01 cibmon: [16645]: info: cib_update: +
> <lrm_resource id="xeroxd" type="xeroxd" class="ocf" provider="sharpcast"
> __crm_diff_marker__="added:top">
> Feb 7 02:18:48 abg01 cibmon: [16645]: info: cib_update: +
> <lrm_rsc_op id="xeroxd_monitor_0" operation="monitor"
> crm-debug-origin="do_update_resource"
> transition_key="112:3c6c0960-29d1-4f5b-9497-da5324195891"
> transition_magic="4:7;112:3c6c0960-29d1-4f5b-9497-da5324195891" call_id="42"
> crm_feature_set="1.0.6" rc_code="7" op_status="4" interval="0"
> op_digest="f2317cad3d54cec5d7d7aa7d0bf35cf8"/>
> Feb 7 02:18:48 abg01 lrmd: [16640]: info: RA output: (xeroxd:start:stdout)
> Starting xeroxd:
> Feb 7 02:18:48 abg01 lrmd: [16640]: info: RA output: (xeroxd:start:stdout)
> [
> Feb 7 02:18:48 abg01 lrmd: [16640]: info: RA output: (xeroxd:start:stdout)
> OK ]
> Feb 7 02:18:48 abg01 lrmd: [16640]: info: RA output: (xeroxd:start:stdout)
> Feb 7 02:18:48 abg01 crmd: [16643]: info: process_lrm_event:lrm.c LRM
> operation (40) start_0 on xeroxd complete
> Feb 7 02:18:48 abg01 tengine: [18920]: info: match_graph_event:events.c
> Action xeroxd_start_0 (15) confirmed
> Feb 7 02:18:48 abg01 cibmon: [16645]: info: cib_update: +
> <lrm_resource id="xeroxd">
> Feb 7 02:18:48 abg01 cibmon: [16645]: info: cib_update: +
> <lrm_rsc_op id="xeroxd_start_0" operation="start"
> crm-debug-origin="do_update_resource"
> transition_key="112:3c6c0960-29d1-4f5b-9497-da5324195891"
> transition_magic="0:0;112:3c6c0960-29d1-4f5b-9497-da5324195891" call_id="40"
> crm_feature_set="1.0.6" rc_code="0" op_status="0" interval="0"
> op_digest="f2317cad3d54cec5d7d7aa7d0bf35cf8"
> __crm_diff_marker__="added:top"/>
> Feb 7 02:35:09 abg01 pengine: [18921]: info: xeroxd
> (sharpcast::ocf:xeroxd): Started abg01
> Feb 7 02:35:09 abg01 pengine: [18921]: notice: NoRoleChange:native.c Leave
> resource xeroxd (abg01)
> Feb 7 02:40:29 abg01 pengine: [18921]: info: xeroxd
> (sharpcast::ocf:xeroxd): Started abg01
> Feb 7 02:40:29 abg01 pengine: [18921]: notice: NoRoleChange:native.c Leave
> resource xeroxd (abg01)
> Feb 7 02:40:30 abg01 pengine: [18921]: info: xeroxd
> (sharpcast::ocf:xeroxd): Started abg01
> Feb 7 02:40:31 abg01 pengine: [18921]: notice: NoRoleChange:native.c Leave
> resource xeroxd (abg01)
>
>
>
>
> --
> Jim Wong (jwong at sharpcast.com)
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
More information about the Linux-HA
mailing list