On Fri, Feb 17, 2012 at 9:35 AM, Nathan Stratton <nathan at robotics.net> wrote: > Gluster can easily support a large number of nodes, the question is how much > you care about the underlying data. We store 2 copies of the data and our > underlying hardware is RAID6 allowing us to lose 2 disks on each server. As > the number of nodes grows that chance of losing a node becomes higher, but > one of the beautiful things about Gluster is you still have access to all > the other data that is NOT on the lost pair of servers. We have found that > your best bet is to split the data over as many servers as possible provides > the best up time. How are folks backing up large amounts of data in their gluster file systems? Replication? Snapshots and archival? As file systems grow to > 1PB the conventional backup to disk / tape methodology needs to change. We are putting a lot of thought into this subject here. - Ryan -- http://prefetch.net