<html> <head> <meta content="text/html; charset=UTF-8" http-equiv="Content-Type"> </head> <body bgcolor="#FFFFFF" text="#000000"> <div class="moz-cite-prefix">On 06/12/2015 03:27 PM, Simo Sorce wrote:<br> </div> <blockquote cite="mid:214960863.1166110.1434115672513.JavaMail.zimbra@redhat.com" type="cite"> <pre wrap="">----- Original Message ----- </pre> <blockquote type="cite"> <pre wrap="">From: "Petr Spacek" <a class="moz-txt-link-rfc2396E" href="mailto:pspacek@redhat.com"><pspacek@redhat.com></a> To: "Simo Sorce" <a class="moz-txt-link-rfc2396E" href="mailto:simo@redhat.com"><simo@redhat.com></a> Cc: "freeipa-devel" <a class="moz-txt-link-rfc2396E" href="mailto:freeipa-devel@redhat.com"><freeipa-devel@redhat.com></a>, "Tomas Capek" <a class="moz-txt-link-rfc2396E" href="mailto:tcapek@redhat.com"><tcapek@redhat.com></a>, "Ludwig Krispenz" <a class="moz-txt-link-rfc2396E" href="mailto:lkrispen@redhat.com"><lkrispen@redhat.com></a>, "Thierry Bordaz" <a class="moz-txt-link-rfc2396E" href="mailto:tbordaz@redhat.com"><tbordaz@redhat.com></a> Sent: Friday, June 12, 2015 5:09:08 AM Subject: Re: [Freeipa-devel] DNA range distribution to replicas by default On 11.6.2015 16:11, Simo Sorce wrote: </pre> <blockquote type="cite"> <pre wrap="">On Thu, 2015-06-11 at 12:38 +0200, Petr Spacek wrote: </pre> <blockquote type="cite"> <pre wrap="">On 9.6.2015 15:06, Simo Sorce wrote: </pre> <blockquote type="cite"> <pre wrap="">On Tue, 2015-06-09 at 10:30 +0200, Petr Spacek wrote: </pre> <blockquote type="cite"> <pre wrap="">Hello, I would like to discuss <a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/show_bug.cgi?id=1211366">https://bugzilla.redhat.com/show_bug.cgi?id=1211366</a> "Error creating a user when jumping from an original server to replica". Currently the DNA ranges are distributed from master to other replicas on first attempt to get a number from particular range. This works well as long as the original master is reachable but fails miserably when the master is not reachable for any reason. It is apparently confusing to users [1][2] because it is counter-intuitive. They have created a replica to be sure that everything will work when the first server is down, right? Remediation is technically simple [3] (just assign a range to the new replica) but it is confusing to the users, error-prone, and personally I feel that this is an unnecessary obstacle. It seems to me that the original motivation for this behavior was that the masters were not able to request range back from other replicas when a local range was depleted. This deficiency is tracked as <a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/show_bug.cgi?id=1029640">https://bugzilla.redhat.com/show_bug.cgi?id=1029640</a> and it is slated for fix in 4.2.x time frame. Can we distribute ranges to the replicas during ipa-replica-install when we fix bug 1029640? </pre> </blockquote> <pre wrap=""> That was not the only reason, another reason is that you do not want to distribute and fragment ranges to replicas that will never be used to create users. What we should do perhaps, is to automatically give a range to CA enabled masters so that at least those servers have a range. If all your CAs are unavailable you have major issues anyway. Though it is a bit bad to have magic behaviors, maybe we should have a "main DNA range holder" role that can be assigned to arbitrary servers (maybe the first replica gets it by default), and when done the server acquire part of the range if it has none. </pre> </blockquote> <pre wrap=""> This concept sounds good to me! I would only reverse the default, i.e. distribute ranges by default to all replicas and let admin to toggle a knob if he feels that his case really needs to limit range distribution. </pre> </blockquote> <pre wrap=""> By the time you *feel* that it may be too late. </pre> <blockquote type="cite"> <blockquote type="cite"> <pre wrap="">Another option is that a replica can instantiate a whole new range if all the range bearing servers are not around, but that also comes with its own issues. In general I wouldn't want to split by default, because in domains with *many* replicas most of them are used for load balancing and will never be used to create users, so the range would be wasted. </pre> </blockquote> <pre wrap=""> This should not be an issue when <a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/show_bug.cgi?id=1029640">https://bugzilla.redhat.com/show_bug.cgi?id=1029640</a> is fixed because replicas will be able to request range back if the local chunk is depleted. Is that correct? </pre> </blockquote> <pre wrap=""> To some degree, the main issue is when replicas get removed abruptly and are not around to "give back" anything. We would need to start working on a range-scavenging tool to reclaim "lost" ranges if you go and automatically distribute ranges to every replica that ever pops up. </pre> </blockquote> <pre wrap=""> Okay, I understand that. I can't help myself but it seems to me that this problem is inherent to current design and can always happen because the range information is local to the replica. As a result, if the replica with a range disappears we always need to do some sort of manual recovery to get the free numbers back. Consequently, lowering number of replicas with ranges just makes the problem less common but does not eliminate it. Let's look at: cn=posix-ids,cn=dna,cn=ipa,cn=etc,dc=ipa,dc=example It seems that we already have information which replicas have free values in the shared tree - this is good, but not sufficient to eliminate the problem. The information about range start/end and the next free value is missing in the shared tree and is stored only in cn=config on particular replica. It seems to me that adding this range start/end values to the shared tree would help because the information about the range would be preserved even if the replica was deleted/lost. Apparently the attribute dnaRemainingValues in the shared tree is updated after each number allocation so adding the next free value (to a new attribute) to the shared tree would not add any significant replication churn because the object needs to be updated anyway. What did I miss? </pre> </blockquote> <pre wrap=""> We could publish the range there I guess. But I'd rather keep the counters local and update the "available" values only every 100 or so. This is to reduce the number of replication messages going out. Even if you do not know the exact starting point that is not a huge deal as DNA checks that an ID is free before assigning it anyways. Simo. </pre> </blockquote> <font face="Times New Roman, Times, serif">About the ranges, each replica has a unique replicaID, the selection of the ranges could use this replicaID for most significant digit.<br> Publishing the ranges to the shared tree looks good but what is benefit of publishing dnaRemainingValues (either the exact value or sample) ?<br> Who is consuming it ?<br> <br> thanks<br> thierry<br> </font> </body> </html>