Re: Data deduplication for Linux : lessfs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Roy Sigurd Karlsbakk wrote:
On 24. juni. 2009, at 17.12, Mark Ruijter wrote:

For those who need OpenSource data deduplication today instead of
tomorrow one might take a look at lessfs.
http://www.lessfs.com

It's a good idea, but given the current traffic on the lessfs mailing list, I'm not sure if much work is done. I have been a member of that list since June 1 and haven't received more than one message, which was the one I wrote myself.

I am thinking about starting to work on a data deduplicating
blockdevice, a kernel module called blockless.

If done smartly, this may perhaps be possible, but the problem is the filesystem's metadata. Is this going to be dedup'ed? How much will this take? A simple backup will update atime on all the files backed up, and although atime isn't always wanted or needed, the problem occurs elsewhere.

Block level deduplication isn't going to know/care about the difference between file contents and metadata. It is either stored in blocks that match other blocks or not and the difference should not be visible to the filesystem living on top of the block device.

--
  Les Mikesell
   lesmikesell@gmail.com

_______________________________________________
linux-lvm mailing list
linux-lvm@redhat.com
https://www.redhat.com/mailman/listinfo/linux-lvm
read the LVM HOW-TO at http://tldp.org/HOWTO/LVM-HOWTO/

[Index of Archives]     [Gluster Users]     [Kernel Development]     [Linux Clusters]     [Device Mapper]     [Security]     [Bugtraq]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]

  Powered by Linux