Re: Understanding VDO vs ZFS

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]




Il 03/05/20 04:50, david ha scritto:
Folks

I'm looking for a solution for backups because ZFS has failed on me too many times.  In my environment, I have a large amount of data (around 2tb) that I periodically back up.  I keep the last 5 "snapshots".  I use rsync so that when I overwrite the oldest backup, most of the data is already there and the backup completes quickly, because only a small number of files have actually changed.

Because of this low change rate, I have used ZFS with its deduplication feature to store the data.  I started using a Centos-6 installation, and upgraded years ago to Centos7.  Centos 8 is on my agenda.  However, I've had several data-loss events with ZFS where because of a combination of errors and/or mistakes, the entire store was lost.  I've also noticed that ZFS is maintained separately from Centos.  At this moment, the Centos 8 update causes ZFS to fail.  Looking for an alternate, I'm trying VDO.

In the VDO installation, I created a logical volume containing two hard-drives, and defined VDO on top of that logical volume.  It appears to be running, yet I find the deduplication numbers don't pass the smell test.  I would expect that if the logical volume contains three copies of essentially identical data, I should see deduplication numbers close to 3.00, but instead I'm seeing numbers like 1.15.  I compute the compression number as follows:
 Use df and extract the value for "1k blocks used" from the third column
 use vdostats --verbose and extract the number titled "1K-blocks used"

Divide the first by the second.

Can you provide any advice on my use of ZFS or VDO without telling me that I should be doing backups differently?

Thanks

David

Hi David, I'm not an expert about vdo but I will try it for backup purpose with rsync + hardlink. I know that this is not an answer you asked, sorry for this.

Many user said me to use  more specific tool for running backup using deduplication (borg in my case). I'm testing it and I'm not sure if I will adopt it in the long term. As you reported, you are using rsync solution so I would ask: why not use a more specific tool? What are benefits to stay with rsync for you?

Thank you in advance.

_______________________________________________
CentOS mailing list
CentOS@xxxxxxxxxx
https://lists.centos.org/mailman/listinfo/centos




[Index of Archives]     [CentOS]     [CentOS Announce]     [CentOS Development]     [CentOS ARM Devel]     [CentOS Docs]     [CentOS Virtualization]     [Carrier Grade Linux]     [Linux Media]     [Asterisk]     [DCCP]     [Netdev]     [Xorg]     [Linux USB]


  Powered by Linux