<font face="arial" size="2"><p style="margin:0;padding:0;font-family: arial; font-size: 10pt;">The only thing I see that could be related is:</p>
<p style="margin:0;padding:0;"><span style="font-family: arial; font-size: 10pt;">Jan 21 10:31:05 freeipa2 named[20660]: LDAP query timed out. Try to adjust "timeout" parameter</span></p>
<div style="font-family: arial; font-size: 10pt;">and then the message:</div>
<div><span style="font-family: arial; font-size: 10pt;">Jan 21 10:31:05 freeipa2 named[20660]:update_zone (psearch) failed for 'idnsname=example.com,cn=dns,dc=example,dc=com'. Zones can be outdated, run `rndc reload`: timed out</span></div>
<div></div>
<div><span style="font-family: arial; font-size: 10pt;">However in errors/access log for that 389 instance, I do not see anything around that time.</span></div>
<div></div>
<div><span style="font-family: arial; font-size: 10pt;">When this happens again I will do what you suggested below (already have the debug packages installed) and will email you. Thanks a TON for your help on this!</span></div>
<div><span style="font-family: arial; font-size: 10pt;"><br /></span></div>
<div></div>
<p style="margin:0;padding:0;font-family: arial; font-size: 10pt;"> </p>
<p style="margin:0;padding:0;font-family: arial; font-size: 10pt;">-----Original Message-----<br />From: "Petr Spacek" <pspacek@redhat.com><br />Sent: Tuesday, January 21, 2014 10:29am<br />To: andrew.tranquada@mailtrust.com, freeipa-users@redhat.com<br />Subject: Re: [Freeipa-users] named unresponsive at seemingly random times<br /><br /></p>
<div id="SafeStyles1390324304" style="font-family: arial; font-size: 10pt;">
<p style="margin:0;padding:0;">On 19.1.2014 03:38, andrew.tranquada@mailtrust.com wrote:<br />> It seems to be at random and on different servers, but I will see the following in named.run:<br />><br />> update_zone (psearch) failed for 'idnsname=example.com,cn=dns,dc=example,dc=com'. Zones can be outdated, run `rndc reload`: bad zone<br />This typically mean that your zone is missing NS or glue records. Did you do <br />some changes in the zone at time when the message appeared?<br /><br />Do you see any errors related to connection between LDAP server and named? <br />Look carefully to /var/log/messages for any other messages from named.<br /><br />> When I see this, I cannot do any dns lookup for records in example.com. In addition, named will not restart, I have to manually kill it and then start it again. Once it is restarted, everything is fine, I can lookup records again.<br />This is really weird. Could you capture stacks at the time when the problem <br />manifests?<br /><br />You can use following commands:<br />$ yum install gdb<br />$ debuginfo-install bind bind-dyndb-ldap<br />$ gdb -ex 'set confirm off' -ex 'set pagination off' -ex 'thread apply all bt <br />full' -ex 'quit' `which named` `pgrep named` > stacktrace.`date +%s`.log 2>&1<br /><br />Please send the stracktrace file to this list of privately to me and I will <br />look into it.<br /><br />Have a nice day!<br /><br />Petr^2 Spacek<br /><br />> I am looking for suggestions on troubleshooting or if anyone has seen this before and found a resolution.<br />><br />> I am running Centos 6.5:<br />> 389-ds-base-1.2.11.15-30<br />> bind-dyndb-ldap-2.3-5<br />> bind-libs-9.8.2-0.17.rc1<br />> bind-utils-9.8.2-0.17.rc1<br />><br />> bind-9.8.2-0.17.rc1<br /><br /></p>
</div></font>