[Freeipa-users] General status of my FreeIPA servers - is there a method for cleaning them?

Rich Megginson rmeggins at redhat.com
Fri Apr 13 18:38:02 UTC 2012


On 04/13/2012 12:22 PM, Dan Scott wrote:
> On Fri, Apr 13, 2012 at 13:43, Rich Megginson<rmeggins at redhat.com>  wrote:
>> On 04/13/2012 11:39 AM, Dan Scott wrote:
>>> I'm convinced that my LDAP directories contain lots of cruft which has
>>> built up and is causing problems on my system. There may even be some
>>> corruption since there's an entry which I'm unable to remove - this
>>> entry does not get replicated to the other servers.
>>
>> What version of 389-ds-base is this?  Do you get any errors?  It just
>> silently fails to delete this particular entry?
> [root at fileserver1 ~]# rpm -qa|grep 389
> 389-ds-base-libs-1.2.10.4-2.fc16.x86_64
> 389-ds-base-1.2.10.4-2.fc16.x86_64
> [root at fileserver1 ~]#ldapmodify -f rmfileserver5.ldif -D 'cn=directory
> manager' -W
> Enter LDAP Password:
> deleting entry "cn=fileserver5.ecg.mit.edu,cn=masters,cn=ipa,cn=etc,dc=ecg,dc=mit,dc=edu"
> ldap_delete: Operation not allowed on non-leaf (66)
>
> [root at fileserver1 ~]# ldapsearch -D 'cn=directory manager' -W -v -b
> 'cn=fileserver5.ecg.mit.edu,cn=masters,cn=ipa,cn=etc,dc=ecg,dc=mit,dc=edu'
> '(objectclass=*)'
> ldap_initialize(<DEFAULT>  )
> Enter LDAP Password:
> filter: (objectclass=*)
> requesting: All userApplication attributes
> # extended LDIF
> #
> # LDAPv3
> # base<cn=fileserver5.ecg.mit.edu,cn=masters,cn=ipa,cn=etc,dc=ecg,dc=mit,dc=edu>
> with scope subtree
> # filter: (objectclass=*)
> # requesting: ALL
> #
>
> # fileserver5.ecg.mit.edu, masters, ipa, etc, ecg.mit.edu
> dn: cn=fileserver5.ecg.mit.edu,cn=masters,cn=ipa,cn=etc,dc=ecg,dc=mit,dc=edu
> cn: fileserver5.ecg.mit.edu
> objectClass: top
> objectClass: nsContainer
>
> # search result
> search: 2
> result: 0 Success
>
> # numResponses: 2
> # numEntries: 1
> [root at fileserver1 ~]#
>
> If I'm interpreting this correctly, it can't be deleted because it's
> not a leaf node, but it doesn't have any sub-entries that I can delete
> first.

You are correct.  Try this:
ldapsearch -D 'cn=directory manager' -W -v -b
'cn=fileserver5.ecg.mit.edu,cn=masters,cn=ipa,cn=etc,dc=ecg,dc=mit,dc=edu'
'(|(objectclass=nstombstone)(objectclass=*))'


>
>>> I also see
>>> inconsistent replication states on the servers. i.e. server1 shows
>>> that it's replicating with server2 but server2 does not show that it's
>>> replicating with server1.
>>
>> Do you have errors in the server2 log showing that it is attempting to
>> replicate with server1 but failing with some error?
> [root at fileserver1 ~]# ipa-csreplica-manage list -v fileserver1.ecg.mit.edu
> Directory Manager password:
>
> fileserver2.ecg.mit.edu
>    last init status: None
>    last init ended: None
>    last update status: 0 Replica acquired successfully: Incremental
> update succeeded
>    last update ended: 2012-04-13 17:57:39+00:00
> [root at fileserver1 ~]# ipa-csreplica-manage list -v fileserver2.ecg.mit.edu
> Directory Manager password:
>
> fileserver1.ecg.mit.edu
>    last init status: None
>    last init ended: None
>    last update status: 0 Replica acquired successfully: Incremental
> update succeeded
>    last update ended: 2012-04-13 17:57:41+00:00
> fileserver3.ecg.mit.edu
>    last init status: None
>    last init ended: None
>    last update status: 0 Replica acquired successfully: Incremental
> update succeeded
>    last update ended: 2012-04-13 17:57:41+00:00
> [root at fileserver1 ~]# ipa-csreplica-manage list -v fileserver3.ecg.mit.edu
> Directory Manager password:
>
> fileserver2.ecg.mit.edu
>    last init status: None
>    last init ended: None
>    last update status: 0 Replica acquired successfully: Incremental
> update succeeded
>    last update ended: 2012-04-13 17:57:44+00:00
> fileserver1.ecg.mit.edu
>    last init status: None
>    last init ended: None
>    last update status: 0 Replica acquired successfully: Incremental
> update succeeded
>    last update ended: 2012-04-13 17:57:43+00:00
> [root at fileserver1 ~]#
>
> fileserver1's (and fileserver2s) /var/log/dirsrv/slapd-PKI-IPA/errors
> contains lots of:
> [13/Apr/2012:13:57:43 -0400] NSMMReplicationPlugin -
> repl_set_mtn_referrals: could not set referrals for replica o=ipaca:
> 20

This error usually means a replica was deleted and the RUV needs to be 
cleaned.
see http://port389.org/wiki/Howto:CLEANRUV
and
https://fedorahosted.org/freeipa/ticket/2303
and
https://fedorahosted.org/389/ticket/337

>
> fileserver3's /var/log/dirsrv/slapd-PKI-IPA/errors contains lots of:
> [13/Apr/2012:13:52:50 -0400] slapi_ldap_bind - Error: could not send
> startTLS request: error -1 (Can't contact LDAP server) errno 107
> (Transport endpoint is not connected)

This is a real connection error - could be cert or hostname lookup related.

> [13/Apr/2012:13:57:39 -0400] NSMMReplicationPlugin -
> repl_set_mtn_referrals: could not set referrals for replica o=ipaca:
> 20
>
> fileserver2's non-PKI replication agreements to both fileserver1 and 3
> are in place, but both say: Incremental update has failed and requires
> administrator actionSystem error.


> When I try to re-initialize:
>
> [root at fileserver2 ~]# ipa-replica-manage re-initialize --from
> fileserver3.ecg.mit.edu
> Directory Manager password:
>
> [fileserver3.ecg.mit.edu] reports: Replica Busy! Status: [1
> Replication error acquiring replica: replica busy]

This is a transient condition.

>
> this command has been running for 1/2hr and produced no more output
> (fileserver2 is the remaining server running Fedora 15, the others are
> Fedora 16 with latest updates).

Not sure how ipa-replica-manage handles busy - does it keep trying until 
it is not busy?

>
>>> Is there some way that I can refresh/clean my LDAP directories and
>>> ensure that everything's running correctly.
>> We first need to find out what's going on and why you are seeing these
>> failures before we can recommend a particular course of action.  There is
>> currently no "find all of my problems and fix them" command.
> :) Wish there was. It's just that I've been having lots of problems
> recently and I was thinking that there is something fundamentally
> wrong with my installation. I keep having to ask you guys for help.

I think some of these problems were due to the fact that an alpha 
version of 389 got pushed to the Stable repo in F-16, and in between 
that alpha version and the real "Stable" version we were forced to 
change the database format to fix a serious issue, and that introduced 
some inconsistencies into the database upon upgrade.

>
> An additional problem, which Rob Crittenden is helping with is that
> I'm trying to install another replica (fileserver4) which fails when
> setting up the CA:
>
> 2012-04-11 11:30:47,289 CRITICAL failed to configure ca instance
> Command '/usr/bin/perl /usr/bin/pkisilent 'ConfigureCA' '-cs_hostname'
> 'fileserver4.ecg.mit.edu' '-cs_port' '9445' '-client_certdb_dir'
> '/tmp/tmp-JJIkrk' '-client_certdb_pwd' XXXXXXXX '-preop_pin'
> 'LI1En8UwjZ2BYDcnu8nJ' '-domain_name' 'IPA' '-admin_user' 'admin'
> '-admin_email' 'root at localhost' '-admin_password' XXXXXXXX
> '-agent_name' 'ipa-ca-agent' '-agent_key_size' '2048'
> '-agent_key_type' 'rsa' '-agent_cert_subject'
> 'CN=ipa-ca-agent,O=ECG.MIT.EDU' '-ldap_host' 'fileserver4.ecg.mit.edu'
> '-ldap_port' '7389' '-bind_dn' 'cn=Directory Manager' '-bind_password'
> XXXXXXXX '-base_dn' 'o=ipaca' '-db_name' 'ipaca' '-key_size' '2048'
> '-key_type' 'rsa' '-key_algorithm' 'SHA256withRSA' '-save_p12' 'true'
> '-backup_pwd' XXXXXXXX '-subsystem_name' 'pki-cad' '-token_name'
> 'internal' '-ca_subsystem_cert_subject_name' 'CN=CA
> Subsystem,O=ECG.MIT.EDU' '-ca_ocsp_cert_subject_name' 'CN=OCSP
> Subsystem,O=ECG.MIT.EDU' '-ca_server_cert_subject_name'
> 'CN=fileserver4.ecg.mit.edu,O=ECG.MIT.EDU'
> '-ca_audit_signing_cert_subject_name' 'CN=CA Audit,O=ECG.MIT.EDU'
> '-ca_sign_cert_subject_name' 'CN=Certificate Authority,O=ECG.MIT.EDU'
> '-external' 'false' '-clone' 'true' '-clone_p12_file' 'ca.p12'
> '-clone_p12_password' XXXXXXXX '-sd_hostname'
> 'fileserver3.ecg.mit.edu' '-sd_admin_port' '443' '-sd_admin_name'
> 'admin' '-sd_admin_password' XXXXXXXX '-clone_start_tls' 'true'
> '-clone_uri' 'https://fileserver3.ecg.mit.edu:443'' returned non-zero
> exit status 255
>
> Sorry to dump a tonne of problems in one go, but you can see why I
> think there's something (probably several things) badly wrong with my
> installation. I guess I was looking for a few very basic things to
> check to ensure that the servers are fundamentally configured
> properly.

Unfortunately, it appears that some of your problems are unexpected 
and/or have not been seen before.

>
> Thanks,
>
> Dan Scott




More information about the Freeipa-users mailing list