[Linux-HA] Help understand an incident

Peter Kruse pk at q-leap.com
Tue Jul 3 05:26:41 MDT 2007


Hello list!

today in one of our clusters a failover occured.  Good news: it
succeeded.  But...  while looking through the logs we found
that messages are missing on one node so we can not say exactly
what happened.  Attached is the syslog from node-2 from the
time where there are no messages on node-1.  Is it possible
to say from that log what happened on node-1?
Especially there are messages like this:

Jul  3 11:22:59 beosrv-c-2 cibmon: [16501]: info: mask(cib_apply_diff): 
+             <lrm_rsc_op id="nfs:maillastnfs_stop_0" operation="stop" 
crm-debug-origin="do_update_resource" 
transition_key="6:ad6f57b8-295b-4c20-8e0f-e01494577dfb" 
transition_magic="2:152;6:ad6f57b8-295b-4c20-8e0f-e01494577dfb" 
call_id="45" rc_code="152" op_status="2" interval="0" 
__crm_diff_marker__="added:top"/>

Does that mean the action maillastnfs_stop_0 was run but returned
the status 2?  Or is it possible that the action never was run
on node 1?

Thanks in advance.

	Peter


More information about the Linux-HA mailing list