Re: Data deduplication in LVM?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 10. juni. 2009, at 20.41, Roy Sigurd Karlsbakk wrote:

Hi all

I've been reading up a little about data deduplication, and have been in search for an OSS filesystem with dedup without much luck. While testing snapshots and so on in LVM, I started wondering if dedup would be better off in LVM than in the filesystem. Would it be possible/efficient to add dedup to the LVM layer, or perhaps a layer above LVM? This could make dedup work for all or most of filesystems. Make a hash table with 4k (or whatever) blocks, make virtual blocks pointing to the physical blocks and run a remapping/ deduping job at night. If written to, copy-on-write could be used to increase speed.


Answering myself, it seems there can be a problem with this without a rather large change in the APIs. If I understand it correctly, if metadata is deduplicated, it may impose a rather large performance impact on writes, and from the block layer, how do you know what's metadata and what's not?

roy
--
Roy Sigurd Karlsbakk
(+47) 97542685
roy@karlsbakk.net
http://blogg.karlsbakk.net/
--
I all pedagogikk er det essensielt at pensum presenteres intelligibelt. Det er et elementært imperativ for alle pedagoger å unngå eksessiv anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller eksisterer adekvate og relevante synonymer på norsk.


_______________________________________________
linux-lvm mailing list
linux-lvm@redhat.com
https://www.redhat.com/mailman/listinfo/linux-lvm
read the LVM HOW-TO at http://tldp.org/HOWTO/LVM-HOWTO/

[Index of Archives]     [Gluster Users]     [Kernel Development]     [Linux Clusters]     [Device Mapper]     [Security]     [Bugtraq]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]

  Powered by Linux