[Linux-HA] HA2 OCF CRM: Manage multiple DRBD Resources
Dominik Klein
dk at in-telegence.net
Thu Jul 5 00:19:01 MDT 2007
Good morning.
> Sure you didn't forget a --meta here?
Running it with "--meta" produced an error:
1: crm_resource -r ms-r0 -v '#default' --meta -p target_role
crm_resource: unrecognized option `--meta'
>> <crm_mon shows r0 "started" for both nodes -> not good>
>>
>> 1: drbdadm state r0
>> Unknown/TOO_LARGE
>> <OCF script needs to be changed to recognize this (maybe new drbd8)
>> state after just the module being loaded>
>> <done>
>
> Probably this is screwing up the initial start up probe we do. It
> appears drbd8 doesn't quite work, which doesn't come as a surprise. You
> will need to make a few more changes.
I think you have read this later on - I did change a few more things to
address this issue. And as I get the same behaviour with drbd7 and the
original drbd script from heartbeat, I don't think this is an issue of
my OCF script.
>> So except for changing and copying the script, I started over from
>> reboot up to target_role=started for fs0
>> <now crm_mon show r0:0 on acd-xen03 is master>
>> <fs0 is mounted on acd-xen03>
>> <2 online nodes, *4* resources>
>
> 4 resources? Weird.
Well, 2x master/slave plus 2x filesystem. To me, this looked rather ok.
What would you expect?
> It might be a bit late in the game to ask this, but
> which heartbeat version, exactly, are you running?
2.0.8 compiled from source on openSuSE 10.2
Which version do you recommend?
> You're drbd RA seems to still put the master_slave preference into the
> configuration section instead of a transient node attribute, which
> indicates you're not running our latest code?
Latest "stable" I think.
# /usr/lib/heartbeat/heartbeat -V
2.0.8
> That, too, reminds me of a bug which has been fixed in the past ...
Ok.
> 20s is low, you should increase it.
I'll figure out how to do this. If you want to drop me a line, that
might save some time :)
>> Jul 4 14:45:59 ACD-xen03 drbd_master_slave[7406]: [7435]: DEBUG: DK
>> before crm_master -v 75
>> Jul 4 14:45:59 ACD-xen03 drbd_master_slave[7406]: [7436]: DEBUG: r1:
>> Calling /usr/sbin/crm_master -v 75
>> ########### notice: +20s
>
> But true, this is weird, it should take so long.
Did you forget a "not" in this sentence?
> Hm, I don't have a good idea of the top of my head. I'd need to try and
> reproduce on my own cluster.
Ok. That would be very nice.
Regards
Dominik
More information about the Linux-HA
mailing list