Re: 1.2.11.15 "rolls up" some objectclasses?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Rich, apologies for the delay in replying, I've been out of the
office for a couple of days.



On 12/04/2013 08:31 AM, Rich Megginson wrote:
> On 12/02/2013 06:42 PM, Colin Panisset wrote:
>> I have a 4-way multi-master replication configuration; the servers are
>> slightly different versions, as below:
>>
>> A - 1.2.9.9-1.el5 (CentOS 5)
>> B - 1.2.9.9-1.el5 (CentOS 5)
>> C - 1.2.10.2-20.el6_3 (CentOS 6)
>> D - 1.2.11.15-22.el6_4 (CentOS 6)
>>
>> D was recently brought into the configuration to replace A (ultimately).
>>
>> I initialized D as a consumer directly from A, and I've confirmed that
>> replication proceeds throughout the mesh without apparent incident --
>> there are no errors in /var/log/dirsrv/slapd*/errors relating to
>> replication.
>>
>> The problem is that *some* objects under ou=people,dc=foo,dc=bar on D do
>> not show some objectclasses, notably "person" and
>> "organizationalPerson". These values don't show up in the output of
>> ldapsearch, via the console, or when used by an internal search process,
>> such as populating an nsFilteredRole.
> 
> Is this the blocker problem, that filtered roles are not working? Just
> trying to gauge the severity.

I was able to work around the problem by changing the filter on those
roles that weren't working -- so it's not a major issue for us now.
However, I'm still trying to work out why it happened.


> What is the full objectclass chain for these entries?  That is, are
> these "inetOrgPerson" entries that are missing the intermediate parent
> object classes "person" and "organizationalPerson"?

Yes. The objectclass chain for one example user looks like this:

nscpEntryWSI:
objectClass;adcsn-5296e4e8000064a40000;vucsn-5296e4e8000064a40000: top
nscpEntryWSI: objectClass;vucsn-5296e4e8000064a40000: inetOrgPerson
nscpEntryWSI: objectClass;vucsn-5296e4e8000064a40000: posixAccount
nscpEntryWSI: objectClass;vucsn-5296e66c000164a40000: ntUser
nscpEntryWSI: objectClass;vucsn-5296e48b000064a40000;deleted:
organizationalPerson
nscpEntryWSI: objectClass;vucsn-5296e48b000064a40000;deleted: person

I note the 'deleted' flag on the 'person' and 'organizationalPerson' values.

I checked the other users in the same boat, and for *some*, it's that
the intermediate parents are marked as 'deleted', and on others they
simply don't exist; for example:

nscpEntryWSI:
objectClass;adcsn-5276ebaf000064a40001;vucsn-5276ebaf000064a40001: top
nscpEntryWSI: objectClass;vucsn-5276ebaf000064a40001: inetOrgPerson
nscpEntryWSI: objectClass;vucsn-5276ebaf000064a40001: posixAccount
nscpEntryWSI: objectClass;vucsn-5276ec06000064a40000: inetUser
nscpEntryWSI: objectClass;vucsn-5276ee0d000064a40000: ntUser

> What you can do is work backwards from these entries that are missing
> these objectclasses.
> 1) do an ldapsearch like this to get the replication state information:
> ldapsearch -xLLL -D "cn=directory manager" -W -b dc=foo,dc=bar
> uid=myuser nscpEntryWSI
> 
> among the data will be the first CSN for the entry.  The CSN looks like
> this
> TTTTTTTTSSSSRRRRUUUU
> Where TTTTTTTT is the 8 hex bytes of the timestamp, SSSS is a sequence
> number, and RRRR is the replica ID of the server on which the entry
> originated.  The RRRR is in hex.

Ok, from the above I get that the replica ID is 64a4, which is 25764 --
I know which replica of ours this is (we have an internally-meaningful
numbering scheme)

> 
> Next, go to the server on which the entry originated.  Do this:
> 
> dbscan -f /var/lib/dirsrv/slapd-*/cldb/*.db4 -k $theoriginalCSN
> 
> The server is supposed to "fill in" the missing parent object classes
> during the original add request, but perhaps not during a replicated add.

Unfortunately when I got back to the originating server, the changelog
holding the create event had expired so I wasn't able to see the
original entry. Instead, I added a user (via ldapadd on the
command-line) and traced the replication via that method, but only
specifying the "leaf" classes; the intermediate classes *were* properly
filled in and were replicated correctly.

At this point, I'm not sure what I can do to re-replicate the issue.
I'll keep an eye out for it again, and I'm very happy to assist in
further triage if you have any ideas, but from where I sit right now it
appears the trail has gone cold.

Thanks for the hints in tracking it further.

  -- C.
-- 
Colin Panisset
Tech Lead - Infrastructure, REA Group
Ph: +61 (0)3 8456 4636 Mb: +61 (0) 457 788 259
--
389 users mailing list
389-users@xxxxxxxxxxxxxxxxxxxxxxx
https://admin.fedoraproject.org/mailman/listinfo/389-users





[Index of Archives]     [Fedora User Discussion]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [Fedora News]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Maintainers]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Legacy]     [Fedora Desktop]     [Fedora Fonts]     [ATA RAID]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Centos]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora QA]     [Fedora Triage]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Tux]     [Yosemite News]     [Yosemite Photos]     [Linux Apps]     [Maemo Users]     [Gnome Users]     [KDE Users]     [Fedora Tools]     [Fedora Art]     [Fedora Docs]     [Maemo Users]     [Asterisk PBX]     [Fedora Sparc]     [Fedora Universal Network Connector]     [Fedora ARM]

  Powered by Linux