Re: [PATCHSET #upstream] libata: improve FLUSH error handling

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Alan Cox wrote:
I do agree with the above, we should try to get the FLUSH done according to spec, I meant to argue that we should bound the time spent. If my laptop spends more than 30? 60? 120? seconds trying to flush a write cache, I will probably be looking for a way to force it to power down ;-)

But if your PhD thesis is being written back you'd be different 8). I am
not sure we can exceed 30 seconds, currently although we set 60 second
I/O timeouts we are timing out at 30 seconds in some traces I get sent so
something is messing up our timeout handling back to the default. I've
tried tracing it and so far failed to figure it out.

The challenge is in getting the retry, more than in just the timeout on just one IO. For example, if we have a full 16MB write cache and the disk is really, really toast (i.e., a head failed which means each and every IO in that 16MB will fail), we don't want to do 16MB/512 distinct 30-60 seconds retries....

That is where Mark's idea about capping the whole sequence of retries comes into play - we can use the global timer to prevent this from running into an eternity of retry attempts.


It is also worth noting that most users of ext3 run without barriers enabled (and the drive write cache enabled) which means that we test this corruption path on any non-UPS power failure.

It is most unfortunate that distributions continue to ship that default.

Alan

I have been thinking that running without barriers by default is mostly OK for laptops (which have a fairly usable UPS in a working battery). If we destage the write cache rebustly as this thread is discussing, we should cover almost all normal failure cases.

Desktop and server systems should normally use either barriers or disable the write cache when ever you have data you care about...

ric
--
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Filesystems]     [Linux SCSI]     [Linux RAID]     [Git]     [Kernel Newbies]     [Linux Newbie]     [Security]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Samba]     [Device Mapper]

  Powered by Linux