So since nobody who's actually working on it has chimed in: While there is some deduplication functionality built into the system, AFAIK it's not something considered for users at this point. It's under ongoing development, doesn't have performance data, and isn't plumbed through into a lot of the system. Last I heard about it was a discussion about how to handle the reference counting used to delete data when it's no longer needed. (See the thread "refcounting chunks vs snapshots" and https://github.com/ceph/ceph/pull/29283) -Greg On Wed, Oct 2, 2019 at 4:48 PM The Zombie Hunter <thezombiehunter@xxxxxxxxx> wrote: > > From my initial testing it looks like 14.2.4 fully supports the deduplication mentioned here: > > https://docs.ceph.com/docs/master/dev/deduplication/ > > However, I'm not sure where the struct object_manifest script goes in relation to foo and foo-chunk, and I'm not aware of what the offsets/caspool should be. > > If this still isn't fully implemented how does the dedup tool work? If I remove a file but it exists elsewhere on the volume, will it be purged or would the tool need to run again to clear the data? > _______________________________________________ > ceph-users mailing list -- ceph-users@xxxxxxx > To unsubscribe send an email to ceph-users-leave@xxxxxxx _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx