[Linux-HA] Newbie struggling to get 1.99.5 to work right. Would like help, please!

Guochun Shi gshi at ncsa.uiuc.edu
Fri Jul 1 12:46:35 MDT 2005


At 06:06 PM 6/30/2005 -0700, you wrote:
>Hi Guochun,
>Thank you very much. I have compiled and tried the cvs version and I  
>don't see that error message in the log file now. I guess the bug is  
>fixed!

Thanks, I can close the bug now

>My test setup, however, is pretty simple. All I have is just IP  
>failover; no other services yet. My goal for the next day or two is  
>to create a cluster with DRBD, Postfix, DoveCot, and STONITH using  
>IPMI over LAN (direct cross connect between the two nodes). I'll  
>confirm the bug fix status again after I have that setup going.
>
>btw, does anyone know what the best way to use USB for heartbeat. I  
>can't quite cross connect the USB ports of two hosts like you can  
>with serial ports. I have one serial port that is already taken by  
>the console. But I have a couple of USB ports on these hosts that I  
>would like to use for heartbeat. What's my option? Do I have to get  
>couple of those dongles that convert USB to serial? I'd like to avoid  
>all extra stuff hanging in the heartbeat path. Is there any cabling  
>magic that can be done to directly connect the one host's USB port to  
>another and use the connection like a cross cabled serial connection?

Heartbeat does not have a communication module for USB now, but you are welcome
to write one :)

-Guochun


>Thanks
>Srinisan
>
>
>On Jun 30, 2005, at 11:49 AM, Guochun Shi wrote:
>
>>Hi, can you try the CVS version to see if the problem is still  
>>there? Thanks
>>
>>FYI we have a bug for the memroy leak problem (http://www.osdl.org/ developer_bugzilla/show_bug.cgi?id=660)
>>
>>-Guochun
>>
>>At 05:58 PM 6/21/2005 -0700, you wrote:
>>
>>>Thanks for the help.
>>>
>>>On Jun 21, 2005, at 3:44 PM, Alan Robertson wrote:
>>>
>>>>You can't use ipfail and crm both.
>>>
>>>I took crm out
>>>
>>>
>>>>/var/lib/heartbeat/ccm needs to be owned by user id hacluster, and
>>>>at least 700 (probably ought to be 755).
>>>
>>>That was it! I fixed the permissions and everything (well... almost
>>>everything) was peachy after that.
>>>
>>>I can't get my cib.xml to work right, but I just used to haresources
>>>file to keep things going.  I will worry about cib.xml later.
>>>
>>>But the bigger reason for this e-mail is, I am seeing a bunch of
>>>warnings which, per some prior posts, are not a good thing! Any ideas
>>>what may be wrong? Here are the warnings:
>>>
>>>Jun 21 17:35:45 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:35:45 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 946176
>>>Jun 21 17:37:05 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:37:05 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 1081344
>>>Jun 21 17:38:23 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:38:23 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 1216512
>>>Jun 21 17:39:40 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:39:40 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 1351680
>>>Jun 21 17:40:59 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:40:59 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 1486848
>>>Jun 21 17:42:17 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:42:17 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 1622016
>>>Jun 21 17:43:34 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:43:34 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 1757184
>>>Jun 21 17:44:53 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:44:53 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 1892352
>>>Jun 21 17:46:11 pe-1850-2 heartbeat: [16460]: WARN: Performed 1 more
>>>non-realtime malloc calls.
>>>Jun 21 17:46:11 pe-1850-2 heartbeat: [16460]: info: Total non-  
>>>realtime malloc bytes: 2027520
>>>
>>><They repeat roughly every 75 seconds>
>>>
>>>Thanks
>>>
>>>_______________________________________________
>>>Linux-HA mailing list
>>>Linux-HA at lists.linux-ha.org
>>>http://lists.linux-ha.org/mailman/listinfo/linux-ha
>>



More information about the Linux-HA mailing list