[Linux-HA] confused about setting drac5 stonith
Frank
frank at si.ct.upc.edu
Tue Nov 20 03:48:58 MST 2007
Hi,
I've seen some discussions about this, but I'm still confused.
I'm using the drac5 stonith plugin from Thomas Paschy (thanks thomas)
but I don't know how to cofigure it to work fine.
We have a cluster with 2 nodes, each one with its public address, and
each one with a drac5 device with a private address;
le'ts call them node1, node2, node1_drac and node2_drac. So node1 can
make a reset to node2 connenting to node2_drac,
and node2 can make a reset to node1 connenting to node1_drac
So we have created two stonith resources called stonith_node1_drac (with
node1_drac address) and stonith_node2_drac (with
node2_drac address); stonith_node1_drac needs to run on node node2
because it is node2 which can reboot node1, an stonith_node2_drac
needs to run on node node1 because it is node1 which can reboot node2
If we started this way, they starts ok. But when we forced a stonith
condition on node2 (killing heartbeat) node 1 is not able to reset
node2; we got
this on logs:
pengine[8955]: 2007/11/20_11:05:05 WARN: stage6: Scheduling Node kripton
for STONITH
pengine[8955]: 2007/11/20_11:05:05 info: native_stop_constraints:
drac_argon_stop_0 is implicit after kripton is fenced
pengine[8955]: 2007/11/20_11:05:05 WARN: process_pe_message: Transition
80: WARNINGs found during PE processing. PEngine Input stored in:
/var/lib/heartbeat/pengine/pe-warn-75.bz2
pengine[8955]: 2007/11/20_11:05:05 info: process_pe_message:
Configuration WARNINGs found during PE processing. Please run
"crm_verify -L" to identify issues.
stonithd[3978]: 2007/11/20_11:05:35 ERROR: Failed to STONITH the node
kripton: optype=RESET, op_result=TIMEOUT
tengine[8954]: 2007/11/20_11:05:35 info: tengine_stonith_callback:
call=-43, optype=1, node_name=kripton, result=2, node_list=,
action=8:80:84a50e41-ffda-4c9a-959a-76a61919413a
tengine[8954]: 2007/11/20_11:05:35 ERROR: tengine_stonith_callback:
Stonith of kripton failed (2)... aborting transition.
(kripton is node2 and argon is node1)
It seems that there is something messy with the addresses.
Can anyone help?
Thanks.
Frank
UPC - Barcelona -Spain
--
Aquest missatge ha estat analitzat per MailScanner
a la cerca de virus i d'altres continguts perillosos,
i es considera que està net.
For all your IT requirements visit: http://www.transtec.co.uk
More information about the Linux-HA
mailing list