[Freeipa-users] replication again :-(

thierry bordaz tbordaz at redhat.com
Wed May 20 14:17:18 UTC 2015


On 05/20/2015 03:46 PM, Janelle wrote:
> On 5/20/15 6:01 AM, thierry bordaz wrote:
>> On 05/20/2015 02:57 AM, Janelle wrote:
>>> On 5/19/15 12:04 AM, thierry bordaz wrote:
>>>> On 05/19/2015 03:42 AM, Janelle wrote:
>>>>> On 5/18/15 6:23 PM, Janelle wrote:
>>>>>> Once again, replication/sync has been lost. I really wish the 
>>>>>> product was more stable, it is so much potential and yet.
>>>>>>
>>>>>> Servers running for 6 days no issues. No new accounts or changes 
>>>>>> (maybe a few users changing passwords) and again, 5 out of 16 
>>>>>> servers are no longer in sync.
>>>>>>
>>>>>> I can test it easily by adding an account and then waiting a few 
>>>>>> minutes, then run "ipa  user-show --all username" on all the 
>>>>>> servers, and only a few of them have the account.  I have now 
>>>>>> waited 15 minutes, still no luck.
>>>>>>
>>>>>> Oh well.. I guess I will go look at alternatives. I had such high 
>>>>>> hopes for this tool. Thanks so much everyone for all your help in 
>>>>>> trying to get things stable, but for whatever reason, there is a 
>>>>>> random loss of sync among the servers and obviously this is not 
>>>>>> acceptable.
>>>>>>
>>>>>> regards
>>>>>> ~J
>>>>>
>>>
>>> All the replicas are happy again. I found these again:
>>>
>>> unable to decode  {replica 16} 55356472000300100000 55356472000300100000
>>> unable to decode  {replica 23} 5553e3a3000000170000 55543240000300170000
>>> unable to decode  {replica 24} 554d53d3000000180000 554d54a4000200180000
>>>
>>> What I also found to be interesting is that I have not deleted any 
>>> masters at all, so this was quite perplexing where the orphaned 
>>> entries came from.  However I did find 3 of the replicas did not 
>>> show complete RUV lists... While most of the replicas had a list of 
>>> all 16 servers, a couple of them listed only 4 or 5. (using 
>>> ipa-replica-manage list-ruv)
>> I don't know about the orphaned entries. Did you get entries below 
>> deleted parents ?
>>
>> AFAIK all replicas are master and so have an entry {replica <rid>} in 
>> the RUV. We should expect all servers having the same number of 
>> RUVelements (16, 4 or 5). The servers with 4 or 5 may be isolated so 
>> that they did not received updates from those with 16 RUVelements.
>> would you copy/paste an example of RUV with 16 and with 4-5 ?
>
> Now, the steps to clear this were:
>
> Removed the "unable to decode" with the direct ldapmodify's. This 
> worked across all replicas, which was nice and did not have to be 
> repeated in each one. In other words, entered on a single server, and 
> it was removed on all.
Hello,

Did you do direct ldapmodify onto the RUV entry 
(nsuniqueid=ffffffff-ffffffff-ffffffff-ffffffff,SUFFIX) , clean RUV ?

dc1-ipa1 and dc1-ipa2 are missing some RUVelement. If you do  an update 
on dc3-ipa1, is it replicated to dc1-ipa[12] ?

Also there are duplicated RID (9, 25) for dc1-ipa2.example.com:389. You 
may see some messages like 'attrlist_replace' in some error logs.
25 seems to be the new RID.

thanks
thierry

>
> re-initialized --from=good server on the ones with the short list.
>
> Waited 5 minutes to let everything settle, then started running tests 
> of adds/deletes which seemed to be just fine.
>
> Here are 2 of the DCs
>
> -------------------------------------
> Node dc1-ipa1
> -------------------------------------
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa4.example.com 389  4
> -------------------------------------
> Node dc1-ipa2
> -------------------------------------
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa2.example.com 389  25
> dc1-ipa3.example.com 389  8
> dc1-ipa4.example.com 389  4
> -------------------------------------
> Node dc1-ipa3
> -------------------------------------
> dc3-ipa1.example.com 389  14
> dc3-ipa2.example.com 389  13
> dc3-ipa3.example.com 389  12
> dc3-ipa4.example.com 389  11
> dc2-ipa1.example.com 389  7
> dc2-ipa2.example.com 389  6
> dc2-ipa3.example.com 389  5
> dc2-ipa4.example.com 389  3
> dc4-ipa1.example.com 389  18
> dc4-ipa2.example.com 389  19
> dc4-ipa3.example.com 389  20
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa2.example.com 389  25
> dc1-ipa2.example.com 389  9
> dc1-ipa3.example.com 389  8
> dc1-ipa4.example.com 389  4
> unable to decode  {replica 16} 55356472000300100000 55356472000300100000
> unable to decode  {replica 24} 554d53d3000000180000 554d54a4000200180000
> dc5-ipa1.example.com 389  26
> dc5-ipa2.example.com 389  15
> dc5-ipa3.example.com 389  17
> -------------------------------------
> Node dc1-ipa4
> -------------------------------------
> dc3-ipa1.example.com 389  14
> dc3-ipa2.example.com 389  13
> dc3-ipa3.example.com 389  12
> dc3-ipa4.example.com 389  11
> dc2-ipa1.example.com 389  7
> dc2-ipa2.example.com 389  6
> dc2-ipa3.example.com 389  5
> dc2-ipa4.example.com 389  3
> dc4-ipa1.example.com 389  18
> dc4-ipa2.example.com 389  19
> dc4-ipa3.example.com 389  20
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa2.example.com 389  25
> dc1-ipa2.example.com 389  9
> dc1-ipa3.example.com 389  8
> dc1-ipa4.example.com 389  4
> unable to decode  {replica 16} 55356472000300100000 55356472000300100000
> unable to decode  {replica 24} 554d53d3000000180000 554d54a4000200180000
> dc5-ipa1.example.com 389  26
> dc5-ipa2.example.com 389  15
> dc5-ipa3.example.com 389  17
> -------------------------------------
> Node dc2-ipa1
> -------------------------------------
> dc3-ipa1.example.com 389  14
> dc3-ipa2.example.com 389  13
> dc3-ipa3.example.com 389  12
> dc3-ipa4.example.com 389  11
> dc2-ipa1.example.com 389  7
> dc2-ipa2.example.com 389  6
> dc2-ipa3.example.com 389  5
> dc2-ipa4.example.com 389  3
> dc4-ipa1.example.com 389  18
> dc4-ipa2.example.com 389  19
> dc4-ipa3.example.com 389  20
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa2.example.com 389  25
> dc1-ipa2.example.com 389  9
> dc1-ipa3.example.com 389  8
> dc1-ipa4.example.com 389  4
> unable to decode  {replica 16} 55356472000300100000 55356472000300100000
> unable to decode  {replica 23} 5553e3a3000000170000 55543240000300170000
> unable to decode  {replica 24} 554d53d3000000180000 554d54a4000200180000
> dc5-ipa1.example.com 389  26
> dc5-ipa2.example.com 389  15
> dc5-ipa3.example.com 389  17
> -------------------------------------
> Node dc2-ipa2
> -------------------------------------
> dc3-ipa1.example.com 389  14
> dc3-ipa2.example.com 389  13
> dc3-ipa3.example.com 389  12
> dc3-ipa4.example.com 389  11
> dc2-ipa1.example.com 389  7
> dc2-ipa2.example.com 389  6
> dc2-ipa3.example.com 389  5
> dc2-ipa4.example.com 389  3
> dc4-ipa1.example.com 389  18
> dc4-ipa2.example.com 389  19
> dc4-ipa3.example.com 389  20
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa2.example.com 389  25
> dc1-ipa2.example.com 389  9
> dc1-ipa3.example.com 389  8
> dc1-ipa4.example.com 389  4
> unable to decode  {replica 16} 55356472000300100000 55356472000300100000
> unable to decode  {replica 24} 554d53d3000000180000 554d54a4000200180000
> dc5-ipa1.example.com 389  26
> dc5-ipa2.example.com 389  15
> dc5-ipa3.example.com 389  17
> -------------------------------------
> Node dc2-ipa3
> -------------------------------------
> dc3-ipa1.example.com 389  14
> dc3-ipa2.example.com 389  13
> dc3-ipa3.example.com 389  12
> dc3-ipa4.example.com 389  11
> dc2-ipa1.example.com 389  7
> dc2-ipa2.example.com 389  6
> dc2-ipa3.example.com 389  5
> dc2-ipa4.example.com 389  3
> dc4-ipa1.example.com 389  18
> dc4-ipa2.example.com 389  19
> dc4-ipa3.example.com 389  20
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa2.example.com 389  25
> dc1-ipa2.example.com 389  9
> dc1-ipa3.example.com 389  8
> dc1-ipa4.example.com 389  4
> unable to decode  {replica 16} 55356472000300100000 55356472000300100000
> unable to decode  {replica 24} 554d53d3000000180000 554d54a4000200180000
> dc5-ipa1.example.com 389  26
> dc5-ipa2.example.com 389  15
> dc5-ipa3.example.com 389  17
> -------------------------------------
> Node dc2-ipa4
> -------------------------------------
> dc3-ipa1.example.com 389  14
> dc3-ipa2.example.com 389  13
> dc3-ipa3.example.com 389  12
> dc3-ipa4.example.com 389  11
> dc2-ipa1.example.com 389  7
> dc2-ipa2.example.com 389  6
> dc2-ipa3.example.com 389  5
> dc2-ipa4.example.com 389  3
> dc4-ipa1.example.com 389  18
> dc4-ipa2.example.com 389  19
> dc4-ipa3.example.com 389  20
> dc4-ipa4.example.com 389  21
> dc1-ipa1.example.com 389  10
> dc1-ipa2.example.com 389  25
> dc1-ipa2.example.com 389  9
> dc1-ipa3.example.com 389  8
> dc1-ipa4.example.com 389  4
> unable to decode  {replica 16} 55356472000300100000 55356472000300100000
> unable to decode  {replica 24} 554d53d3000000180000 554d54a4000200180000
> dc5-ipa1.example.com 389  26
> dc5-ipa2.example.com 389  15
> dc5-ipa3.example.com 389  17
>
>
> Happy Wednesday
> ~Janelle

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/freeipa-users/attachments/20150520/f593cc5b/attachment.htm>


More information about the Freeipa-users mailing list