[Linux-HA] timeouts revisited

Dejan Muhamedagic dejanmm at fastmail.fm
Tue Aug 8 19:51:26 MDT 2006


***********************
Warning: Your file, cib.xml.bz2, appears to be a compressed file but is corrupt. It was not scanned by InterScan MSS.
***********************


Hi,

CTS has been running for about half an hour and so far no
serious problems encountered.

There's only this timeouts issue which has been bothering me
recently, as some of you may know.

Though it almost exclusively happens with network
interfaces,* I think now that it's some kind of lrm problem,
or perhaps lrm/crm interaction. It looks as if crm expected
an operation to have been started by lrm and it hasn't been
or as if lrm wanted to started by then has somehow forgotten
to do it or something in between. I hope that somebody can
unravel this from the stuff I'll attach.

Setup: HB CVS as of Aug  8 16:33. SLES9: two nodes i386 and
one node (sapcl03) x86_64.

BTW, the logging has been taken care of syslog-ng now, but I
see some strange output (such as some chars at the end of
the line missing). There's still ha_logd used in between.
Should ha_logd be removed from the equation?

Cheers,

Dejan

*) Network interfaces are monitored by far more often in
this configuration (default: 5s) and hence higher probability.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: ha-debug.bz2
Type: application/octet-stream
Size: 52028 bytes
Desc: not available
Url : http://lists.community.tummy.com/pipermail/linux-ha/attachments/20060809/0bdda5c1/ha-debug-0001.obj
-------------- next part --------------
traditional_compression false
coredumps true
use_logd yes
keepalive 2
deadtime 6
initdead 10
deadping 6
mcast   eth0 225.0.0.1 694 1 0
mcast   eth1 225.0.0.2 694 1 0
auto_failback	legacy
node sapcl01
node sapcl02
node sapcl03
#node lingws
ping 9.158.3.144 9.158.29.46 9.158.30.40
crm	on
respawn root /usr/lib/heartbeat/pingd -m 100 -d 5s
debug 1
-------------- next part --------------


============
Last updated: Tue Aug  8 23:49:18 2006
Current DC: sapcl03 (f20a2804-c822-4978-8d7f-96ebcd5db7be)
3 Nodes configured.
6 Resources configured.
============

Node: sapcl02 (bdcbaad6-5fdc-4880-a309-ccaaf70db357): online
Node: sapcl01 (cf68c349-b7ca-4495-9aaf-5e158969efef): online
Node: sapcl03 (f20a2804-c822-4978-8d7f-96ebcd5db7be): online

Resource Group: a1
    IPaddr_10_1_1_22	(heartbeat::ocf:IPaddr):	Started sapcl03
    IPaddr_192_168_1_22	(heartbeat::ocf:IPaddr):	Started sapcl03
    apache_a1	(heartbeat::ocf:apache):	Started sapcl03
Resource Group: a2
    IPaddr_10_1_1_23	(heartbeat::ocf:IPaddr):	Started sapcl01
    IPaddr_192_168_1_23	(heartbeat::ocf:IPaddr):	Started sapcl01
    apache_a2	(heartbeat::ocf:apache):	Started sapcl01
Resource Group: a3
    IPaddr_10_1_1_24	(heartbeat::ocf:IPaddr):	Started sapcl02
    IPaddr_192_168_1_24	(heartbeat::ocf:IPaddr):	Started sapcl02
    apache_a3	(heartbeat::ocf:apache):	Started sapcl02
Resource Group: a4
    IPaddr_10_1_1_25	(heartbeat::ocf:IPaddr):	Started sapcl03
    IPaddr_192_168_1_25	(heartbeat::ocf:IPaddr):	Started sapcl03
    apache_a4	(heartbeat::ocf:apache):	Started sapcl03
Resource Group: a5
    IPaddr_10_1_1_26	(heartbeat::ocf:IPaddr):	Started sapcl01
    IPaddr_192_168_1_26	(heartbeat::ocf:IPaddr):	Started sapcl01
    apache_a5	(heartbeat::ocf:apache):	Started sapcl01
Resource Group: a6
    IPaddr_10_1_1_13	(heartbeat::ocf:IPaddr):	Started sapcl02
    IPaddr_192_168_1_13	(heartbeat::ocf:IPaddr):	Started sapcl02
    apache_a6	(heartbeat::ocf:apache):	Started sapcl02
-------------- next part --------------
BZh91AY&SY?-"9 X_?j?X??$$L ????`}??n?wF??Q?@:5B?T?]j??:?B?R??@ e? B??       C??di?hG????M?       ?????C????h?  ? &?A4512L???? i?@z?
??1 ??&??G?  w?U}?A?@??P?G~Zb?/??i?#9X$?>???4???@`&???0h	?0`&???0I&@? L?0c@? N??%L?=u?x:?\? v[?8???????K?+??p#???????? Hx?qpy??p??.y??v?U?WW?ET^*Tt\?7.)?d????
Y?M?&?4?[???g?e}3????e(F1?c?0???J???V???????-?G._??s????????<?u??+,&a?,?+?K??V?1?0?Z\k????????NI?c?????z'X?m(?????3h?F?T???????e?5?g?'?9\?r?S'>?s????v?N;m3;!?]?:V?\???0n2??i?=	?o??J7??????\\?K?^?????^?=??? vy<W?w%????t?W???z?m?/}??I??{?r?????]?9? /-??l???	]be?U?_??-?G5`?M'h????_H????M????A??d?J4?m??}???8????^??V?K;???k?V?;??s????x,_???X??V/?u?????m?????0?Q??1kX??iKA?|[1m????+`??-?<a???h??cG??O	??;G(??8&????,.k????lrV??YP?bb'?C?R?-?u??/???^K
?????u??y??G>v???tzt?]?????+???' ????9?l[?F/x?xGH??qY0?#&??R??????7?s?[?p??z???????6?i?;OI?7?H??h????k~.???{py????4??"???~nA??c?9???????;?[???6q???qc????w?z?(?8/h?o=g?????4????n??	??N>?c2a???6?Y?d{???M?k?9?????;lg??.9xr?B???5?R?;6Ec[M???F?N??ky?w???????2=???8#?m?4??4??5iikM??8????????\?E?l????V??94b???????v3f?????u3???-?zG????^?8.??z-2^K5??9?H????8&pR?F#???2??fv?)?=#??q?s?y+?r??W9????v?/g[]??fl?Kg?g?(?q?m<???yO%?^?jvW????[
??,???C?p???????&1?&p?<'H?EuOI?I?2z?9?d?x???????????,?z?3f?[;??#,?kcE?tWUj??K???^Ke?h?D?Vk????U?\???0a?`?Ev"???)??J;.2??V?s?q?????N+Uy?	???;?O	?8??.??nA??g.????Y/m9y????y??8K???-2V6+?%??qV????y?)????U?ki?u??:??9?[??'?x?y?;????????\??]?E??X??Vj?\?e?<#?s??Q?6????g?*<?e??^Ku?rWusVk??WcQ?2<??u?s??5c#?im?y[[?U?????q???W97?kY??????cen??dn??LP?????z??????YY1&V????io?[??|??l?+&Q2k(??2j?I?Z??Y???????h}?,??R???&K??F??
bi?j??&??f?R??lZ?X?Z??d?&17F???1?????mb??l?????x8??V???c? ??Am*p?F????b?l??2???Tj?2?#?ad??cadj?d??h~????8Z??_?4?b?Y?????
q??8o8X?-???jl??iV1Y????d????2??2??2??2??fL?f\@U@
???o??U????k=<?*??V?kqm???z*??V???f??\p?0?3 B3 ?0?3 ?Uj???h?4VM&????h???R?)?{?~?
 
 wwid?Y8j3?????_??????n??x?n?o??M???2?J 
.??	????B?Q
SI??<??<??(?)?t?9t??;t??n?-B??"?#U?i7?Z??~Vq2H?O?????R?k?~?D.1sQY'?i7	j??DUM,?????J??s#????????l??S?N???????>??qq?m1l???F??1pp?l??????r&??n?.???Y?????^??Df??????jl?????Vk?U/D??'???.????????2dW?q	?8?t\?U?
??n???#i}d?Y?hU??1G?????????+?????:??_b??WE??G?7?O?p??????????>??;G?uyM??p??xN	???	?~W)?#?????u?VI^i.??Er???S'????|"0?V?^???l?/U??/eo>????X?"?A??v?????{CSE?W?(?|?j?K???8?????uL^1??G?=????)??????????B=?xz?u?b?5????2?$????????Z??F3??H?c?sZ??g??f????;?jr?O?????>?qW???SV/??q[????Z+?F???\U??U?????U??|? ?[?vW????H?f????A??2qEn?	+?v	????[?????????}?;??9+??????|???4??x?????r+??WY?{??}???G?:???>??u?H?EyM?O???z?Is*???SJ_?????5i?FI$?J??w????nk6????;8^????8Licku??:???????.?W?Y?x????s/???>gY???u"??q?<Q????y~???~e?re?#G????/?7U????Z??3YV?*?b????W6??%????^(??X?ju[?M???G(?????c1???r?g????????}f?Q?j;W?x?OWwZ?WE??|W%?f?^F???w?=???^o??~??aV?Y%~???<?????+?G???j??K?+?????<??????>??]$??6#??.?W?????=?t|T??????w?K????o??Ew#J???GKK?+?x??y??#??|?_/??|;?????{?+$???u;???U+,"??H??~?WG???????r??Dr???%+?-6?J??

QZcf?W??X??"?p??b?)????e???[?r?? i?iA??/Wy+?|*???{?????u?_?U4&??????]?oc?F?F??54j4j4h?k?j???W?7GwOeqT??>S??u????oO?????J?j)?dW?c?[-?B??e;?H??+?v?
?^?I??g???\V??>????????????#uF??h?????n??p?V?cR?????tEjn?J????|?R<???????3?|r??/?J?	XEmj??r????v]?^??????.K??n?=r???\?????m?o???4????LR????????Q???ko????i5??%|q??]WMTWXU????1????#^???nx??-3Z??????????(??w?K?????eS[??dx-6ce?.7??W??<O???bG???z#????F(?^????????U?K???^<oJ8F?N????????O2W??{????#?JmV7???r?????%C?GK?$?:?H??I8??V??????aj????9??+????~.??p?!?ZDr


More information about the Linux-HA mailing list