[Linux-HA] pingd, quorum, split-brain... should I give up?

Riccardo Perni riccardo.perni at aslromab.it
Fri Oct 26 07:07:59 MDT 2007


Riccardo Perni <riccardo.perni at aslromab.it> ha scritto:

> maloja01 at arcor.de ha scritto:
>
>> Riccardo Perni schrieb:
>>>
>>>
>>> Andrew Beekhof <beekhof at gmail.com> ha scritto:
>>>
>>>> On 10/22/07, Riccardo Perni <riccardo.perni at aslromab.it> wrote:
>>>>>>> Is it possible
>>>>>>> to handle this situation?
>>>>>>
>>>>>> You may try quorumd. See
>>>>>>
>>>>>> http://www.linux-ha.org/QuorumServerGuide
>>>>>
>>>>> I'm going to look at it, but is'n it another SPOF?
>>>>
>>>> by definition, no.
>>>> because you've already had at least one failure before quorumd
>>>> becomes relevant
>>>
>>> Do you mean that the cluster will continue to work even if I have a
>>> failure on the quorum server?
>>
>> yes exactly, the cluster continues to work, if (in your case) both nodes
>> are up and running. A double failer like quorum server down and one node
>> down will result in a "no service" state.
>>
>> But thats even better than running a service twice, if it not designed
>> to run on more than one node.
>>
>
> Well this seems the way to go then! thanks to you, Fabian and Andreas
>

Ok I've setup a quorum server, but now no one get the quorum and the  
resource is stopped in both nodes..
The connection between the nodes ad the quormund seems to work as you  
can see from this log session:

Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug: version:2_0_8(6)
Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug: create new client 4
Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug: receive  
from client 4:
Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug:  
cl_name:clusterpaghe, CN:cluesterpaghe02.aslromab.net,EMAIL=uoia at a
slromab.it
Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug: send to client 4:
Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug: receive 0  
byte or error from client 4
Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug: client 4  
disconnected
Oct 26 14:43:18 clusterpaghequorum quorumd: [6864]: debug: delete client 4
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: The  
certificate cn:cluesterpaghe02.aslromab.net,EMAIL=uoia at aslroma
b.it
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: version:2_0_8(6)
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: create new client 5
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: receive  
from client 5:
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug:  
cl_name:clusterpaghe, CN:cluesterpaghe02.aslromab.net,EMAIL=uoia at a
slromab.it
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: send to client 5:
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: receive 0  
byte or error from client 5
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: client 5  
disconnected
Oct 26 14:43:22 clusterpaghequorum quorumd: [6864]: debug: delete client 5

I wont  to cry!!
Any idea??

Thankyou Riccardo


----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.




More information about the Linux-HA mailing list