[Linux-HA] (drbd) master/slave monitoring operations

Andrew Beekhof beekhof at gmail.com
Wed Oct 10 01:37:17 MDT 2007


On 10/9/07, Raoul Bhatia [IPAX] <r.bhatia at ipax.at> wrote:
> hi,
>
> as discussed a couple of days ago, monitoring actions do not "happen"
> by themselves. moreover, i learnd, that one has to specify seperate
> monitoring actions for different roles. now my questions are:
>
>
> 1) What is the difference between role="Slave" and role="Started"?

not much.  it just means that the resource was a master at some point
and got demoted.

>
> 2) Why does my heartbeat setup run amok, when i replace
>    <op name="monitor" id="drbd_mysql_mon_normal" interval="5s"
> timeout="10s" role="Started" />
>
> with
>    <op name="monitor" id="drbd_mysql_mon_normal" interval="5s"
> timeout="10s" role="Slave" />
>    <op name="monitor" id="drbd_mysql_mon_master" interval="5s"
> timeout="11s" role="Master" />

because you can't have two operations with the same name+interval

>     or when i simply add the master monitor operation?
>
> my logfiles show:
> > pengine[27942]: 2007/10/09_10:59:01 WARN: process_pe_message: Transition 840: WARNINGs found during PE processing. PEngine Input stored in: /var/lib/heartbeat/pengine/pe-warn-1886.bz2
> > pengine[27942]: 2007/10/09_10:59:01 WARN: native_color: Resource drbd_mysql:1 cannot run anywhere
> > pengine[27942]: 2007/10/09_10:59:01 WARN: native_color: Resource mysql-server cannot run anywhere
> > crmd[27938]: 2007/10/09_10:59:01 ERROR: cancel_op_key: No known drbd_mysql:0_monitor_5000 operation to cancel
> > tengine[27941]: 2007/10/09_10:59:01 WARN: status_from_rc: Action cancel on webcluster01 failed (target: 8 vs. rc: 0): Error
> > tengine[27941]: 2007/10/09_10:59:01 WARN: update_failcount: Updating failcount for drbd_mysql:0 on 49e81295-8e2f-4aeb-98f3-a14de6f62298 after failed monitor: rc=0
> > pengine[27942]: 2007/10/09_10:59:01 WARN: process_pe_message: Transition 841: WARNINGs found during PE processing. PEngine Input stored in: /var/lib/heartbeat/pengine/pe-warn-1887.bz2
>
> i attached the cib.xml file
>
> 3) and last but not least, why are the roles for <op .. /> written with
> a big first letter (Started instead of started)? as far as i can see,
> this is one of few cases where case matters. :)

i think i felt it looked better when printed by crm_mon etc.
perhaps it wasn't a great idea



More information about the Linux-HA mailing list