Re: largish member changes causing problems

"Michael R. Gettes" <gettes@xxxxxxxxx> · Tue, 27 Mar 2012 21:35:29 -0400

output is below.  I am starting to believe how my systems are administered is getting in the way.  So I don't spend too much time bothering you - is there some place I can find (or can you point me) to what programs should have run to massage the DBs so I might get back in biz and then I am going to have a bit of a chat with my sysadmins regarding our admin practices.  I hate bothering you with this stuff.  Or, if i simply remove 389 and re-install 1.2.9.9 will I be back in biz?  these are not critical systems (test environment), but I need to get things working again so I can have a chat with my admins (show them how things break).  Have I mentioned how grateful I am for your help?

[root@ldap-master-t02 slapd-cmu]# dbscan -f /var/lib/dirsrv/slapd-cmu/db/userRoot/entryrdn.db4 | head
100000:guid=00000000-0000-1000-24ae-0800207f02e6
  ID: 100000; RDN: "guid=00000000-0000-1000-24AE-0800207F02E6"; NRDN: "guid=00000000-0000-1000-24ae-0800207f02e6"
100001:guid=7ff2dbbc-7623-11d7-8000-080020cc75d3
  ID: 100001; RDN: "guid=7FF2DBBC-7623-11D7-8000-080020CC75D3"; NRDN: "guid=7ff2dbbc-7623-11d7-8000-080020cc75d3"
100002:guid=f79ff0d0-6416-11d5-8000-000080020af0
  ID: 100002; RDN: "guid=F79FF0D0-6416-11D5-8000-000080020AF0"; NRDN: "guid=f79ff0d0-6416-11d5-8000-000080020af0"
100003:guid=00000000-0000-1000-3571-0800207f02e6
  ID: 100003; RDN: "guid=00000000-0000-1000-3571-0800207F02E6"; NRDN: "guid=00000000-0000-1000-3571-0800207f02e6"
100004:guid=00000000-0000-1000-6366-0800207f02e6
  ID: 100004; RDN: "guid=00000000-0000-1000-6366-0800207F02E6"; NRDN: "guid=00000000-0000-1000-6366-0800207f02e6"
[root@ldap-master-t02 slapd-cmu]# dbscan -f /var/lib/dirsrv/slapd-cmu/db/userRoot/entryrdn.db4 | tail
P99999:guid=329e90ce-494b-11d5-8000-080020cc75d3
  ID: 21; RDN: "ou=person"; NRDN: "ou=person"
P9999:uid=jw6a
  ID: 36; RDN: "ou=account"; NRDN: "ou=account"
P999:uid=toohey
  ID: 36; RDN: "ou=account"; NRDN: "ou=account"
P99:ou=art
  ID: 63; RDN: "o=CFA - COLLEGE OF FINE ARTS"; NRDN: "o=cfa - college of fine arts"
dc=cmu,dc=edu
  ID: 1; RDN: "dc=cmu,dc=edu"; NRDN: "dc=cmu,dc=edu"

On Mar 27, 2012, at 21:25, Rich Megginson wrote:

> On 03/27/2012 07:14 PM, Michael R. Gettes wrote:
>> earlier in the log (sorry, i didn't look there) I see
>> 
>> [27/Mar/2012:20:19:50 -0400] - check_and_set_import_cache: pagesize: 4096, pages: 256511, procpages: 55220
>> [27/Mar/2012:20:19:50 -0400] - WARNING: After allocating import cache 410416KB, the available memory is 615628KB, which is less than the soft limit 1048576KB. You may want to decrease the import cache size and rerun import.
>> [27/Mar/2012:20:19:50 -0400] - Import allocates 410416KB import cache.
>> [27/Mar/2012:20:19:50 -0400] Upgrade DN Format - changelog: Start upgrade dn format.
>> [27/Mar/2012:20:19:50 -0400] Upgrade DN Format - Instance changelog in /var/lib/dirsrv/slapd-cmu/db/changelog is up-to-date
>> [27/Mar/2012:20:19:50 -0400] - check_and_set_import_cache: pagesize: 4096, pages: 256511, procpages: 55220
>> [27/Mar/2012:20:19:50 -0400] - WARNING: After allocating import cache 410416KB, the available memory is 615628KB, which is less than the soft limit 1048576KB. You may want to decrease the import cache size and rerun import.
>> [27/Mar/2012:20:19:50 -0400] - Import allocates 410416KB import cache.
>> [27/Mar/2012:20:19:50 -0400] Upgrade DN Format - NetscapeRoot: Start upgrade dn format.
>> [27/Mar/2012:20:19:50 -0400] Upgrade DN Format - Instance NetscapeRoot in /var/lib/dirsrv/slapd-cmu/db/NetscapeRoot is up-to-date
>> [27/Mar/2012:20:19:50 -0400] - check_and_set_import_cache: pagesize: 4096, pages: 256511, procpages: 55219[27/Mar/2012:20:19:50 -0400] - WARNING: After allocating import cache 410416KB, the available memory is 615628KB, which is less than the soft limit 1048576KB. You may want to decrease the import cache size and rerun import.
>> [27/Mar/2012:20:19:50 -0400] - Import allocates 410416KB import cache. [27/Mar/2012:20:19:50 -0400] Upgrade DN Format - userRoot: Start upgrade dn format.
>> [27/Mar/2012:20:19:50 -0400] Upgrade DN Format - Instance userRoot in /var/lib/dirsrv/slapd-cmu/db/userRoot is up-to-date
>> 
>> and
>> 
>> find /var/lib/dirsrv -name DBVERSION -exec cat {} \;
>> bdb/4.3/libreplication-plugin
>> bdb/4.3/libback-ldbm/newidl/rdn-format-2/dn-4514
>> bdb/4.3/libback-ldbm/newidl/rdn-format-2/dn-4514
>> bdb/4.3/libback-ldbm/newidl/rdn-format-2/dn-4514
>> bdb/4.3/libback-ldbm/newidl/rdn-format-2/dn-4514
>> 
>> yes, i did upgrade from 1.2.9.9
> hmm - looks like the entryrdn conversion was missed - try this:
> dbscan -f
> 
> /var/lib/dirsrv/slapd-cmu/db/userRoot/entryrdn.db4 | head
> and
> /var/lib/dirsrv/slapd-cmu/db/userRoot/entryrdn.db4 | tail
> 
> 
>> 
>> /mrg
>> 
>> On Mar 27, 2012, at 21:05, Rich Megginson wrote:
>> 
>>> On 03/27/2012 06:58 PM, Michael R. Gettes wrote:
>>>> I have upgraded one of my masters to 1.2.10.3 and i see the following
>>>> 
>>>> [27/Mar/2012:20:25:04 -0400] - 389-Directory/1.2.10.3 B2012.065.2248 starting up
>>>> [27/Mar/2012:20:25:04 -0400] - libdb: DB handle previously used in transaction, missing transaction handle
>>>> [27/Mar/2012:20:25:04 -0400] - libdb: DB handle previously used in transaction, missing transaction handle
>>>> [27/Mar/2012:20:25:04 -0400] - libdb: DB handle previously used in transaction, missing transaction handle
>>>> [27/Mar/2012:20:25:04 -0400] - libdb: DB handle previously used in transaction, missing transaction handle
>>>> [27/Mar/2012:20:25:04 -0400] - libdb: DB handle previously used in transaction, missing transaction handle
>>>> [27/Mar/2012:20:25:04 -0400] - libdb: DB handle previously used in transaction, missing transaction handle
>>>> [27/Mar/2012:20:25:04 -0400] - slapd started.  Listening on All Interfaces port 389 for LDAP requests
>>>> [27/Mar/2012:20:25:04 -0400] - Listening on All Interfaces port 636 for LDAPS requests
>>>> [27/Mar/2012:20:30:04 -0400] NSMMReplicationPlugin - changelog program - cl5DBData2Entry: invalid data version
>>>> [27/Mar/2012:20:30:04 -0400] NSMMReplicationPlugin - changelog program - cl5DBData2Entry: invalid data version
>>>> 
>>>> is this serious?
>>>> I had to do an offline 'setup-ds-admin.pl -u' because i have everything as SSL and the online update doesn't seem to handle this case very well (i offer this info as i have no idea if it is relevant to the problem).
>>> What should have happened is that during yum/rpm upgrade of the 389-ds-base package, it should have upgraded the database to the latest version.  Did you upgrade from 1.2.9.9?
>>> 
>>> find /var/lib/dirsrv -name DBVERSION -exec cat {} \;
>>> 
>>> before
>>> 
>>> [27/Mar/2012:20:25:04 -0400] - 389-Directory/1.2.10.3 B2012.065.2248 starting up
>>> 
>>> you should have some messages about the database being upgraded
>>>> the only error in the setup was
>>>> 
>>>> [12/03/27:20:20:09] - [Setup] Warning Error: command 'getsebool httpd_can_connect_ldap' failed - output [getsebool:  SELinux is disabled] error []
>>> If you are really running with SELinux disabled, then this is just telling you that it couldn't perform some SELinux function because it is disabled, which is ok.
>>>> 389-admin.x86_64                    1.1.28-1.el5            installed
>>>> 389-admin-console.noarch            1.1.8-1.el5             installed
>>>> 389-admin-console-doc.noarch        1.1.8-1.el5             installed
>>>> 389-adminutil.x86_64                1.1.15-1.el5            installed
>>>> 389-console.noarch                  1.1.7-3.el5             installed
>>>> 389-ds.noarch                       1.2.1-1.el5             installed
>>>> 389-ds-base.x86_64                  1.2.10.3-1.el5          installed
>>>> 389-ds-base-libs.x86_64             1.2.10.3-1.el5          installed
>>>> 389-ds-console.noarch               1.2.6-1.el5             installed
>>>> 389-ds-console-doc.noarch           1.2.6-1.el5             installed
>>>> 389-dsgw.x86_64                     1.1.9-1.el5             installed
>>>> 
>>>> advice appreciated.
>>>> 
>>>> /mrg
>>>> 
>>>> On Mar 27, 2012, at 10:03, Rich Megginson wrote:
>>>> 
>>>>> On 03/27/2012 08:01 AM, Michael R. Gettes wrote:
>>>>>> I just checked and only 1.2.10.3-1.el5 is in the epel-testing repo
>>>>> 1.2.10.3 should be fine as long as you don't use compare operations on virtual attributes.
>>>>> I just pushed 1.2.10.4 to epel-testing - it should show up in the mirrors in a couple of days.
>>>>>> /mrg
>>>>>> 
>>>>>> On Mar 27, 2012, at 9:50, Michael R. Gettes wrote:
>>>>>> 
>>>>>>> I judge from your questions this is not a known problem.
>>>>>>> 
>>>>>>> /mrg
>>>>>>> 
>>>>>>> On Mar 27, 2012, at 9:17, Rich Megginson wrote:
>>>>>>> 
>>>>>>>> On 03/26/2012 08:25 PM, Michael R. Gettes wrote:
>>>>>>>>> I am a little perplexed.
>>>>>>>>> 
>>>>>>>>> I am making a change to a groupOfNames object having some 16069 member attributes.  I am deleting nearly 16000 members and then adding nearly 16000 members.  CPU goes to 100% and never comes down.  I have plenty of memory allocated (700MB) to nss-slapd and I have made the adjustments to allow for large objects (maxbersize).  I end up having to kill -9 slapd.  the annoying thing is some times it works, some times it doesn't.  I can't seem to find any common conditions of the failures (or successes).
>>>>>>>> Are you using replication?  If so, do you see the high CPU usage on the master or on the replica?
>>>>>>>> Are you able to reproduce with 389-ds-base-1.2.10.4-1 in epel-testing?
>>>>>>>>> ds = 1.2.9.9
>>>>>>>>> RHEL = 5.7
>>>>>>>>> 
>>>>>>>>> Thoughts?
>>>> --
>>>> 389 users mailing list
>>>> 389-users@xxxxxxxxxxxxxxxxxxxxxxx
>>>> https://admin.fedoraproject.org/mailman/listinfo/389-users
> 

--
389 users mailing list
389-users@xxxxxxxxxxxxxxxxxxxxxxx
https://admin.fedoraproject.org/mailman/listinfo/389-users