[Linux-HA] crm_failcount queries quite slow?

Andrew Beekhof beekhof at gmail.com
Mon Jun 30 02:04:30 MDT 2008


On Fri, Apr 4, 2008 at 08:25, Dominik Klein <dk at in-telegence.net> wrote:
> Lars Marowsky-Bree wrote:
>>
>> On 2008-04-03T13:59:36, Dejan Muhamedagic <dejanmm at fastmail.fm> wrote:
>>
>>> Any crm* program is significantly slower on a non-DC node
>>> regardless of whether something's happening in the cluster. It's
>>> always been like that.
>
> I can confirm that. It's been for me ever since I started using heartbeat.
>
>> Hm, I've not personally observed that in my test cluster, or at least
>> not noticed anything out of line.
>>
>> "Significantly" slower is bad; we mandate that "DC or not DC" is _not_
>> the question, and that users shouldn't care about this designation.
>>
>> Could anyone who reproduces this report a few more details? Is it the
>> local node, the time it takes to process on the DC, or the network
>> roundtrip? (Should be observable using tcpdump/wireshark)
>
> Just 2 measurements:
>
> dktest2sles10:~# time crmadmin -D
> Designated Controller is: dktest2sles10
>
> real    0m0.005s
> user    0m0.004s
> sys     0m0.000s
>
> dktest1sles10:~/cib# time crmadmin -D
> Designated Controller is: dktest2sles10
>
> real    0m1.014s
> user    0m0.000s
> sys     0m0.004s
>
> dktest2sles10:~# time cibadmin -Q &> /dev/null
>
> real    0m0.009s
> user    0m0.004s
> sys     0m0.004s
>
> dktest1sles10:~/cib# time cibadmin -Q &> /dev/null
>
> real    0m1.713s
> user    0m0.004s
> sys     0m0.004s

I've just been testing this with the latest 0.7 code running on
openais and a 50k cib (so not exactly small).

c001n02:~ # time cibadmin -Ql > /dev/null
real	0m0.098s
user	0m0.060s
sys	0m0.008s

c001n02:~ # time cibadmin -Q > /dev/null
real	0m0.178s
user	0m0.052s
sys	0m0.008s

So while the local call is about twice as fast, the difference is less
than 1ms :-)
I'd call that a significant improvement over previous results.


More information about the Linux-HA mailing list