[Linux-HA] pingd failover in active/standby cluster - SOLVED
Matt Zagrabelny
mzagrabe at d.umn.edu
Thu Oct 4 08:16:10 MDT 2007
On Wed, 2007-10-03 at 14:37 -0500, Matt Zagrabelny wrote:
> Update:
>
> I ran ptest on 'cody', the primary node. It looks like it is computing
> the values incorrectly for pingd. The multiplier seems to be only "1",
> when in fact in the cib.xml file it is "100". Thus I am getting the
> following comparison in the output of the ptest command:
>
> debug: native_assign_node: Color external_VIP, Node[0] cody: 51
> debug: native_assign_node: Color external_VIP, Node[1] tim: 2
>
> which is wrong, it should be:
>
> debug: native_assign_node: Color external_VIP, Node[0] cody: 150
> debug: native_assign_node: Color external_VIP, Node[1] tim: 200
The problem was that I had the pingd multiplier in the wrong attributes
section. It was in the clone's attribute section, not the pingd
primitive's section.
> and 'tim' should then get the resources. Also, shouldn't it be computing
> the values for the group which is "monolith_resources" not
> "external_VIP"?
Though I still don't understand the following output. Any explanations
would be great.
> With regard to the last question, I do not understand why the following
> resource (internal_VIP) is getting different values from above (51 and
> 2), it is in the same resource group (monolith_resources) as
> external_VIP:
>
> debug: native_assign_node: Color internal_VIP, Node[0] cody: 1000000
> debug: native_assign_node: Color internal_VIP, Node[1] tim: -1000000
>
>
> Here is the output I am referencing for the ptest command.
>
> # ptest -VVVVVV -L
> ptest[3876]: 2007/10/03_14:05:59 info: main: =#=#=#=#= Getting XML
> =#=#=#=#=
> ptest[3876]: 2007/10/03_14:05:59 info: main: Reading XML from: live
> cluster
> ptest[3876]: 2007/10/03_14:05:59 notice: main: Required feature set:
> 1.1
> ptest[3876]: 2007/10/03_14:05:59 notice: cluster_option: Using default
> value '60s' for cluster option 'cluster-delay'
> ptest[3876]: 2007/10/03_14:05:59 notice: cluster_option: Using default
> value '-1' for cluster option 'pe-error-series-max'
> ptest[3876]: 2007/10/03_14:05:59 notice: cluster_option: Using default
> value '-1' for cluster option 'pe-warn-series-max'
> ptest[3876]: 2007/10/03_14:05:59 debug: unpack_config: Default action
> timeout: 5s
> ptest[3876]: 2007/10/03_14:05:59 debug: unpack_config: Default
> stickiness: 0
> ptest[3876]: 2007/10/03_14:05:59 debug: unpack_config: Default failure
> stickiness: 0
> ptest[3876]: 2007/10/03_14:05:59 debug: unpack_config: STONITH of failed
> nodes is disabled
> ptest[3876]: 2007/10/03_14:05:59 debug: unpack_config: Cluster is
> symmetric - resources can run anywhere by default
> ptest[3876]: 2007/10/03_14:05:59 debug: unpack_config: On loss of CCM
> Quorum: Stop ALL resources
> ptest[3876]: 2007/10/03_14:05:59 info: determine_online_status: Node tim
> is online
> ptest[3876]: 2007/10/03_14:05:59 info: determine_online_status: Node
> cody is online
> ptest[3876]: 2007/10/03_14:05:59 info: unpack_find_resource: Internally
> renamed pingd-child:0 on cody to pingd-child:1
> ptest[3876]: 2007/10/03_14:05:59 debug: get_node_score: Rule
> monolith_resources_connected_rule: node tim had value 2 for pingd
> ptest[3876]: 2007/10/03_14:05:59 debug: get_node_score: Rule
> monolith_resources_connected_rule: node cody had value 1 for pingd
> ptest[3876]: 2007/10/03_14:05:59 debug: get_node_score: Rule
> monolith_resources_connected_rule: node tim had value 2 for pingd
> ptest[3876]: 2007/10/03_14:05:59 debug: get_node_score: Rule
> monolith_resources_connected_rule: node cody had value 1 for pingd
> ptest[3876]: 2007/10/03_14:05:59 info: group_print: Resource Group:
> monolith_resources
> ptest[3876]: 2007/10/03_14:05:59 info: native_print: external_VIP
> (heartbeat::ocf:IPaddr2): Started cody
> ptest[3876]: 2007/10/03_14:05:59 info: native_print: internal_VIP
> (heartbeat::ocf:IPaddr2): Started cody
> ptest[3876]: 2007/10/03_14:05:59 info: clone_print: Clone Set: pingd
> ptest[3876]: 2007/10/03_14:05:59 info: native_print: pingd-child:0
> (heartbeat::ocf:pingd): Started tim
> ptest[3876]: 2007/10/03_14:05:59 info: native_print: pingd-child:1
> (heartbeat::ocf:pingd): Started cody
> ptest[3876]: 2007/10/03_14:05:59 debug: group_rsc_location: Processing
> rsc_location prefered_location_monolith_resources for monolith_resources
> ptest[3876]: 2007/10/03_14:05:59 debug: group_rsc_location: Processing
> rsc_location monolith_resources_connected_rule for monolith_resources
> ptest[3876]: 2007/10/03_14:05:59 debug: native_print: Allocating:
> external_VIP (heartbeat::ocf:IPaddr2): Started cody
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> external_VIP, Node[0] cody: 51
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> external_VIP, Node[1] tim: 2
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Assigning
> cody to external_VIP
> ptest[3876]: 2007/10/03_14:05:59 debug: native_print: Allocating:
> internal_VIP (heartbeat::ocf:IPaddr2): Started cody
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> internal_VIP, Node[0] cody: 1000000
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> internal_VIP, Node[1] tim: -1000000
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Assigning
> cody to internal_VIP
> ptest[3876]: 2007/10/03_14:05:59 notice: NoRoleChange: Leave resource
> external_VIP (cody)
> ptest[3876]: 2007/10/03_14:05:59 notice: NoRoleChange: Leave resource
> internal_VIP (cody)
> ptest[3876]: 2007/10/03_14:05:59 debug: native_print: Allocating:
> pingd-child:0 (heartbeat::ocf:pingd): Started tim
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> pingd-child:0, Node[0] tim: 1
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> pingd-child:0, Node[1] cody: 0
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Assigning
> tim to pingd-child:0
> ptest[3876]: 2007/10/03_14:05:59 debug: native_print: Allocating:
> pingd-child:1 (heartbeat::ocf:pingd): Started cody
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> pingd-child:1, Node[0] cody: 1
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Color
> pingd-child:1, Node[1] tim: -1000000
> ptest[3876]: 2007/10/03_14:05:59 debug: native_assign_node: Assigning
> cody to pingd-child:1
> ptest[3876]: 2007/10/03_14:05:59 debug: clone_color: Allocated 2 pingd
> instances of a possible 2
> ptest[3876]: 2007/10/03_14:05:59 notice: NoRoleChange: Leave resource
> pingd-child:0 (tim)
> ptest[3876]: 2007/10/03_14:05:59 notice: NoRoleChange: Leave resource
> pingd-child:1 (cody)
> ptest[3876]: 2007/10/03_14:05:59 debug: init_dotfile: PE_DOT: digraph
> "g" {
> ptest[3876]: 2007/10/03_14:05:59 debug: main: PE_DOT: }
> ptest[3876]: 2007/10/03_14:05:59 info: unpack_graph: Unpacked transition
> 0: 0 actions in 0 synapses
> ptest[3876]: 2007/10/03_14:05:59 info: set_default_graph_functions:
> Setting default graph functions
> ptest[3876]: 2007/10/03_14:05:59 debug: run_graph:
> ====================================================
> ptest[3876]: 2007/10/03_14:05:59 info: run_graph: Transition 0:
> (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0)
--
Matt Zagrabelny - mzagrabe at d.umn.edu - (218) 726 8844
University of Minnesota Duluth
Information Technology Systems & Services
PGP key 1024D/84E22DA2 2005-11-07
Fingerprint: 78F9 18B3 EF58 56F5 FC85 C5CA 53E7 887F 84E2 2DA2
He is not a fool who gives up what he cannot keep to gain what he cannot
lose.
-Jim Elliot
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
Url : http://lists.community.tummy.com/pipermail/linux-ha/attachments/20071004/d99fdf4d/attachment.pgp
More information about the Linux-HA
mailing list