Re: [Lsf-pc] [LSF/MM TOPIC] Online filesystem check framework

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Jan 22, 2016 at 10:36:26AM +0100, Jan Kara wrote:
> Hello,
> 
> On Thu 21-01-16 22:45:29, Goldwyn Rodrigues wrote:
> > Topic: Generic Online filesystem Check framework
> > 
> > Motivation:
> >  + Better uptime - Filesystems turn read-only at the first error encountered
> > and it may block critical applications which have not encountered the error.
> >  + Unmountable Filesystems - Some filesystems such as clustered filesystem
> > may not be unmountable because they are used by too many computers to be
> > taken offline.
> >  + Autofix - may sound dangerous as fixing is without user intervention, but
> > an option may help admins which are looking for a good uptime.
> >  + Logic inbuilt - most logic of access to filesystem is already in the
> > filesystem driver. The fix/check would make use of existing functionality.
> > 
> > Framework would be around providing a generic interface framework which
> > would use inode numbers as the basic unit to check or fix. Other metadata
> > may need special parameters. Userspace scripts will issue check/fixes to the
> > system, which may/may not be driven by
> 
> Well, this is a very difficult topic so I think we need some concrete
> proposal and ideally some RFC code about what you think needs to be done to
> make online fsck possible. Without that it will be just a useless
> handwaving since I don't think anybody is able to come up with a decent
> proposal during that half an hour session...

I don't see it being generic, either, because metadata structurei,
dependencies, redundancy and algorithms are highly filesystem
specific. It just doesn't seem feasible to me - if it was then we'd
only need one userspace fsck/repair program....

FWIW, for XFS we already have a plan towards implementing online
repair[*] which we've been working towards for the past 5 years, but
it's another 5 years worth of work before we'll get to the point of
being able to dynamically repair corruptions as they are detected...

[*] I wrote this document back in 2008:

http://xfs.org/index.php/Reliable_Detection_and_Repair_of_Metadata_Corruption

We've got metadata CRCs now, and reverse mapping is just about ready
to be merged. Then we need parent pointers (i.e. directory structure
reverse mappings), transaction rollback on error, etc, before we can
even start to think about online repair algorithms.

Cheers,

Dave.
-- 
Dave Chinner
david@xxxxxxxxxxxxx
--
To unsubscribe from this list: send the line "unsubscribe linux-block" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux RAID]     [Linux SCSI]     [Linux ATA RAID]     [IDE]     [Linux Wireless]     [Linux Kernel]     [ATH6KL]     [Linux Bluetooth]     [Linux Netdev]     [Kernel Newbies]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Device Mapper]

  Powered by Linux