[Linux-HA] HA2 OCF CRM: Manage multiple DRBD Resources

Dominik Klein dk at in-telegence.net
Thu Jul 5 00:19:01 MDT 2007


Good morning.

> Sure you didn't forget a --meta here?

Running it with "--meta" produced an error:

1: crm_resource -r ms-r0 -v '#default' --meta -p target_role
crm_resource: unrecognized option `--meta'

>> <crm_mon shows r0 "started" for both nodes -> not good>
>>
>> 1: drbdadm state r0
>> Unknown/TOO_LARGE
>> <OCF script needs to be changed to recognize this (maybe new drbd8) 
>> state after just the module being loaded>
>> <done>
> 
> Probably this is screwing up the initial start up probe we do. It
> appears drbd8 doesn't quite work, which doesn't come as a surprise. You
> will need to make a few more changes.

I think you have read this later on - I did change a few more things to 
address this issue. And as I get the same behaviour with drbd7 and the 
original drbd script from heartbeat, I don't think this is an issue of 
my OCF script.

>> So except for changing and copying the script, I started over from 
>> reboot up to target_role=started for fs0
>> <now crm_mon show r0:0 on acd-xen03 is master>
>> <fs0 is mounted on acd-xen03>
>> <2 online nodes, *4* resources>
> 
> 4 resources? Weird. 

Well, 2x master/slave plus 2x filesystem. To me, this looked rather ok. 
What would you expect?

> It might be a bit late in the game to ask this, but
> which heartbeat version, exactly, are you running?

2.0.8 compiled from source on openSuSE 10.2
Which version do you recommend?

> You're drbd RA seems to still put the master_slave preference into the
> configuration section instead of a transient node attribute, which
> indicates you're not running our latest code?

Latest "stable" I think.
# /usr/lib/heartbeat/heartbeat -V
2.0.8

> That, too, reminds me of a bug which has been fixed in the past ...

Ok.

> 20s is low, you should increase it.

I'll figure out how to do this. If you want to drop me a line, that 
might save some time :)

>> Jul  4 14:45:59 ACD-xen03 drbd_master_slave[7406]: [7435]: DEBUG: DK 
>> before crm_master -v 75
>> Jul  4 14:45:59 ACD-xen03 drbd_master_slave[7406]: [7436]: DEBUG: r1: 
>> Calling /usr/sbin/crm_master -v 75
>> ########### notice: +20s
> 
> But true, this is weird, it should take so long.

Did you forget a "not" in this sentence?

> Hm, I don't have a good idea of the top of my head. I'd need to try and
> reproduce on my own cluster.

Ok. That would be very nice.

Regards
Dominik


More information about the Linux-HA mailing list