389-Directory/1.2.11.15 B2013.238.2155 2 MMR master servers (this hang happened on one of the masters) along with 3 read-only replicas. Linux XXXX 2.6.32-358.18.1.el6.x86_64 #1 SMP Fri Aug 2 17:04:38 EDT 2013 x86_64 x86_64 x86_64 GNU/Linux 389-admin.x86_64 1.1.29-1.el6 @epel-x86_64-server-6 389-admin-console.noarch 1.1.8-1.el6 @epel-x86_64-server-6 389-admin-console-doc.noarch 1.1.8-1.el6 @epel-x86_64-server-6 389-adminutil.x86_64 1.1.15-1.el6 installed 389-console.noarch 1.1.7-3.el5 installed 389-ds.noarch 1.2.2-1.el6 @epel-x86_64-server-6 389-ds-base.x86_64 1.2.11.15-22.el6_4 389-ds-base-debuginfo.x86_64 1.2.11.15-22.el6_4 389-ds-base-libs.x86_64 1.2.11.15-22.el6_4 389-ds-console.noarch 1.2.6-1.el6 @epel-x86_64-server-6 389-ds-console-doc.noarch 1.2.6-1.el6 @epel-x86_64-server-6 389-dsgw.x86_64 1.1.10-1.el6 @epel-x86_64-server-6 389-admin.i686 1.1.29-1.el6 epel-x86_64-server-6 389-adminutil.i686 1.1.15-1.el6 epel-x86_64-server-6 389-adminutil-devel.i686 1.1.15-1.el6 epel-x86_64-server-6 389-adminutil-devel.x86_64 1.1.15-1.el6 epel-x86_64-server-6 389-ds-base.x86_64 1.2.11.25-1.el6 389_rhel6_x86_64 389-ds-base-debuginfo.i686 1.2.11.15-30.el6_5 389-ds-base-debuginfo.x86_64 1.2.11.25-1.el6 389_rhel6_x86_64 389-ds-base-devel.i686 1.2.11.15-30.el6_5 389-ds-base-devel.x86_64 1.2.11.25-1.el6 389_rhel6_x86_64 389-ds-base-libs.i686 1.2.11.15-30.el6_5 389-ds-base-libs.x86_64 1.2.11.25-1.el6 389_rhel6_x86_64 nothing in the error log. non-buffered access log: last 2 seconds [11/Dec/2013:00:07:07 -0500] conn=104961 fd=84 slot=84 connection from 172.19.224.96 to XX [11/Dec/2013:00:07:07 -0500] conn=104961 op=0 BIND dn="" method=128 version=2 [11/Dec/2013:00:07:07 -0500] conn=104961 op=0 RESULT err=0 tag=97 nentries=0 etime=0 dn="" [11/Dec/2013:00:07:07 -0500] conn=104961 op=1 SRCH base="dc=cmu,dc=edu" scope=0 filter="(objectClass=*)" attrs=ALL [11/Dec/2013:00:07:07 -0500] conn=104961 op=1 RESULT err=0 tag=101 nentries=1 etime=0 [11/Dec/2013:00:07:07 -0500] conn=104961 op=2 UNBIND [11/Dec/2013:00:07:07 -0500] conn=104961 op=2 fd=84 closed - U1 [11/Dec/2013:00:07:07 -0500] conn=104962 fd=85 slot=85 SSL connection from 172.19.224.96 to XX [11/Dec/2013:00:07:07 -0500] conn=104963 fd=84 slot=84 SSL connection from 172.19.224.96 to XX [11/Dec/2013:00:07:07 -0500] conn=104962 SSL 256-bit AES [11/Dec/2013:00:07:07 -0500] conn=104962 op=-1 fd=85 closed - B1 [11/Dec/2013:00:07:07 -0500] conn=104963 SSL 256-bit AES [11/Dec/2013:00:07:07 -0500] conn=104963 op=0 BIND dn="uid=zenoss,ou=Account,dc=andrew,dc=cmu,dc=edu" method=128 version=2 [11/Dec/2013:00:07:08 -0500] conn=104963 op=0 RESULT err=0 tag=97 nentries=0 etime=1 dn="uid=zenoss,ou=Account,dc=andrew,dc=cmu,dc=edu” stack trace of threads apply all bt full attached. I have the core file so am happy to do any additional gdb poking. there is no SASL activity (given our previous reports) so this is something else. We have to kill -9 and then start the server to get it back up and running. Happy to make a bug report if that is more appropriate. I wanted to move to 1.2.11.26 but could not given the unprotected cert db problem. |
Attachment:
core.31571.trace.txt.gz
Description: GNU Zip compressed data
-- 389 users mailing list 389-users@xxxxxxxxxxxxxxxxxxxxxxx https://admin.fedoraproject.org/mailman/listinfo/389-users