Re: 389 crash libdb: PANIC: fatal region error detected

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 08/02/2013 01:30 AM, Manel Gimeno Zaragozá wrote:
Hello,

Yesterday afternoon my LDAP server crashed without doing any modification, just consulting.

My environment is an openvz container:

# cat /etc/issue
CentOS release 6.4 (Final)
Kernel \r on an \m

# uname -a
Linux newldap.test.es 2.6.32-042stab053.5 #1 SMP Tue Mar 27 11:42:17 MSD 2012 x86_64 x86_64 x86_64 GNU/Linux

# rpm -qa | grep 389
389-console-1.1.7-1.el6.noarch
389-admin-1.1.29-1.el6.x86_64
389-admin-console-1.1.8-1.el6.noarch
389-ds-base-1.2.11.15-14.el6_4.x86_64
389-ds-console-1.2.6-1.el6.noarch
389-ds-base-libs-1.2.11.15-14.el6_4.x86_64
389-ds-base-devel-1.2.11.15-14.el6_4.x86_64
389-adminutil-1.1.15-1.el6.x86_64


Please find below the log when it crashes:

errors:10:[01/Aug/2013:14:09:44 +0200] configure_pr_socket - Unable to move socket file descriptor 62 above 64: OS error 24 (Too many open files)
errors:11:[01/Aug/2013:14:09:44 +0200] configure_pr_socket - Unable to move socket file descriptor 63 above 64: OS error 24 (Too many open files)
errors:12:[01/Aug/2013:14:09:44 +0200] - PR_Accept() failed, Netscape Portable Runtime error -5971 (Process open FD table is full.)
errors:13:[01/Aug/2013:14:09:44 +0200] - PR_Accept() failed, Netscape Portable Runtime error -5971 (Process open FD table is full.)
errors:14:[01/Aug/2013:14:09:44 +0200] - PR_Accept() failed, Netscape Portable Runtime error -5971 (Process open FD table is full.)
...
[01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable Runtime error -5971 (Process open FD table is full.)
[01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable Runtime error -5971 (Process open FD table is full.)
[01/Aug/2013:14:10:01 +0200] - libdb: PANIC: fatal region error detected; run recovery
[01/Aug/2013:14:10:01 +0200] - Serious Error---Failed in deadlock detect (aborted at 0x0), err=-30974 (DB_RUNRECOVERY: Fatal error, run database recovery)
[01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable Runtime error -5971 (Process open FD table is full.)
[01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable Runtime error -5971 (Process open FD table is full.)
...
[01/Aug/2013:14:12:27 +0200] - libdb: PANIC: fatal region error detected; run recovery
[01/Aug/2013:14:12:27 +0200] - Serious Error---Failed in deadlock detect (aborted at 0x0), err=-30974 (DB_RUNRECOVERY: Fatal error, run database recovery)
[01/Aug/2013:14:12:28 +0200] - libdb: PANIC: fatal region error detected; run recovery
[01/Aug/2013:14:12:28 +0200] - Serious Error---Failed in deadlock detect (aborted at 0x0), err=-30974 (DB_RUNRECOVERY: Fatal error, run database recovery)
[01/Aug/2013:14:12:28 +0200] - libdb: PANIC: fatal region error detected; run recovery
[01/Aug/2013:14:12:28 +0200] - Serious Error---Failed to checkpoint database, err=-30974 (DB_RUNRECOVERY: Fatal error, run database recovery)

when I noticed that the ldap had crashed, I tried to restart the process and the following errors show up:

[01/Aug/2013:14:20:15 +0200] - 389-Directory/1.2.11.15 B2013.105.2259 starting up
[01/Aug/2013:14:20:15 +0200] - WARNING: userRoot: entry cache size 10485760B is less than db size 14729216B; We recommend to increase the entry cache size nsslapd-cachememsize.
[01/Aug/2013:14:20:15 +0200] - libdb: PANIC: fatal region error detected; run recovery
[01/Aug/2013:14:20:15 +0200] - Opening database environment (/var/lib/dirsrv/slapd-ldap_kolab/db) failed. err=-30974: DB_RUNRECOVERY: Fatal error, run database recovery
[01/Aug/2013:14:20:15 +0200] - start: Failed to init database, err=-30974 DB_RUNRECOVERY: Fatal error, run database recovery
[01/Aug/2013:14:20:15 +0200] - Failed to start database plugin ldbm database
[01/Aug/2013:14:20:15 +0200] - WARNING: ldbm instance userRoot already exists
[01/Aug/2013:14:20:15 +0200] - ldbm_config_read_instance_entries: failed to add instance entry cn=userRoot,cn=ldbm database,cn=plugins,cn=config
[01/Aug/2013:14:20:15 +0200] - ldbm_config_load_dse_info: failed to read instance entries
[01/Aug/2013:14:20:15 +0200] - start: Loading database configuration failed
[01/Aug/2013:14:20:15 +0200] - Failed to start database plugin ldbm database
[01/Aug/2013:14:20:15 +0200] - Error: Failed to resolve plugin dependencies
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin 7-bit check is not started
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Account Usability Plugin is not started
[01/Aug/2013:14:20:15 +0200] - Error: accesscontrol plugin ACL Plugin is not started
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin ACL preoperation is not started
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Auto Membership Plugin is not started
[01/Aug/2013:14:20:15 +0200] - Error: object plugin Class of Service is not started
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin deref is not started
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin HTTP Client is not started
[01/Aug/2013:14:20:15 +0200] - Error: database plugin ldbm database is not started
[01/Aug/2013:14:20:15 +0200] - Error: object plugin Legacy Replication Plugin is not started
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Linked Attributes is not started
[01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Managed Entries is not started
[01/Aug/2013:14:20:15 +0200] - Error: object plugin Multimaster Replication Plugin is not started
[01/Aug/2013:14:20:15 +0200] - Error: object plugin Roles Plugin is not started
[01/Aug/2013:14:20:15 +0200] - Error: object plugin Views is not started

Can any one help me?

Looks like it did not recovery gracefully from running out of file descriptors.  You should increase the number of file descriptors.
http://port389.org/wiki/Performance_Tuning#Linux


Thanks for your help

Manel


--
389 users mailing list
389-users@xxxxxxxxxxxxxxxxxxxxxxx
https://admin.fedoraproject.org/mailman/listinfo/389-users

--
389 users mailing list
389-users@xxxxxxxxxxxxxxxxxxxxxxx
https://admin.fedoraproject.org/mailman/listinfo/389-users

[Index of Archives]     [Fedora User Discussion]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [Fedora News]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Maintainers]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Legacy]     [Fedora Desktop]     [Fedora Fonts]     [ATA RAID]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Centos]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora QA]     [Fedora Triage]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Tux]     [Yosemite News]     [Yosemite Photos]     [Linux Apps]     [Maemo Users]     [Gnome Users]     [KDE Users]     [Fedora Tools]     [Fedora Art]     [Fedora Docs]     [Maemo Users]     [Asterisk PBX]     [Fedora Sparc]     [Fedora Universal Network Connector]     [Fedora ARM]

  Powered by Linux