<html>
<head>
<meta content="text/html; charset=windows-1252"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 12/12/2014 02:00 PM, Martin Kosek
wrote:<br>
</div>
<blockquote cite="mid:548AE6EA.4030609@redhat.com" type="cite">On
12/11/2014 06:19 PM, Matt Chesler wrote:
<br>
<blockquote type="cite">I have a cluster of four IPA masters that
should be performing fully meshed
<br>
replication. I discovered yesterday that a recently created
user only existed
<br>
on a single master. After looking through all four masters, it
appears that
<br>
several recent updates only exist on one of the masters. I do
not see any
<br>
replication errors in any of the logs, but I'm not 100% sure how
far back this
<br>
issue goes.
<br>
</blockquote>
<br>
That's really strange, because AFAIK, DS replication module yells
periodically if it cannot replicate so you should see it on the
last errors log page.
<br>
</blockquote>
<br>
That should not occur. I remember a test case
(<a class="moz-txt-link-freetext" href="https://fedorahosted.org/389/ticket/47788">https://fedorahosted.org/389/ticket/47788</a>) where a transient error
could conduct to an update being skipped.<br>
Do you have access/errors logs since the missing entry was added. <br>
Also would you dump the RUV on each of the masters (ldapsearch -D
"cn=directory manager" -w xxx -b "<your suffix>"<font
size="-1">(&(objectclass=nstombstone)(nsUniqueId=ffffffff-ffffffff-ffffffff-ffffffff))"
<br>
<font size="-1"><br>
<font size="-1"><font size="-1"><font size="-1"><font size="-1"><font
size="-1">A<font size="-1">re you able to reproduce
this problem ?</font></font></font></font></font></font></font></font><br>
<blockquote cite="mid:548AE6EA.4030609@redhat.com" type="cite">
<br>
<blockquote type="cite">I do believe the one master with
up-to-date data is a reliable
<br>
representation of what the LDAP directory should look like. I
ran a
<br>
reinitialize command (ipa-replica-manage re-initialize --from
<br>
reliable-server.fqdn) on two of the out-of-date masters
yesterday around 4pm
<br>
EST. It's now a little after 12pm EST and the "Update in
progress" message is
<br>
still scrolling by once a second on both terminals. I'd greatly
appreciate
<br>
suggestions about a) how to determine the status of the
reinitialize command
<br>
and b) any other ideas about how to resolve this issue and
monitor for it
<br>
better in the future. Thanks in advance for your help!
<br>
</blockquote>
<br>
Thierry or Ludwig, any idea?
<br>
</blockquote>
<br>
The replica agreement on the master should say when the total update
is completed. But after 12h it looks very long.<br>
You may monitor the number of sent entries (grep -c
'2.16.840.1.113730.3.5.6' <replica log>/access) to see if it
progressing.<br>
If it is not progressing for several minutes, would you get a pstack
of the master .<br>
<br>
</body>
</html>