managing GFS corruption on large FS

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



hi all

We have a large GFS consisting of 4 TB of maildir data. We have a corruption on this GFS which causes nodes to be withdrawn intermittently.

The cause of the fs corruption is due to user error and lack of documentation (initially not having the clustered flag enabled on the VG when growing the LV/GFS). We now know better, and will avoid this particular cause of corruption. However, management wants to know from us how we can prevent corruption, or minimize the downtime incurred if this should happen again.

For the current problem, since a gfs_fsck will take too long (we cannot afford the 1 - 3 days of downtime it will take to complete the fsck), we are planning to migrate the data to a new GFS, and at the same time set up the new environment optimally to cause the minimum of downtime, if a corruption were to happen again.

One option is to split the one big GFS into a number of smaller GFS's. Unfortunately, our environment does not lean itself to being split up in (for example) a number of 200GB GFS's. Also, this negates a lot of the advantages of GFS (e.g. having your storage consolidated onto one big GFS, and scaling it out by growing the GFS and adding nodes).

I would really like to know how others on this list manage the threat/risk of FS corruption, and the corruption itself, if it does happen. Also, w.r.t. data protection, if you do snapshots, SAN-based mirroring, backup to disk/tape, I would really appreciate it if you could give me detail information like
a) mechanism (e.g snaps, backup, etc)
b) type of data (e.g. many small files)
c) size of GFS
d) the time it takes to perform the action

thank you
Riaan
begin:vcard
fn:Riaan van Niekerk
n:van Niekerk;Riaan
org:Obsidian Systems;Obsidian Red Hat Consulting
email;internet:riaan@xxxxxxxxxxxxxx
title:Systems Architect
tel;work:+27 11 792 6500
tel;fax:+27 11 792 6522
tel;cell:+27 82 921 8768
x-mozilla-html:FALSE
url:http://www.obsidian.co.za
version:2.1
end:vcard

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux