-------- Original Message --------
Subject: How to handle remove media (was: Re: What about not warn on
some abort_transaction() case whose reason is known?)
From: Martin Steigerwald <Martin@xxxxxxxxxxxx>
To: Qu Wenruo <quwenruo@xxxxxxxxxxxxxx>
Date: 2014年12月31日 18:24
I am cc´ing this to fsdevel as I think how to handle a disconnected usb device
may be of broader interest. Well free to drop Cc again in case you see it as
only BTRFS specific issue.
Am Mittwoch, 31. Dezember 2014, 09:30:49 schrieb Qu Wenruo:
Hi all,
Hi Qu,
While surfing the Redhat BZ, a lot(at least 5 I found in one month)
users report "bugs" in btrfs about
kernel warning in btrfs_abort_transaction().
And most of them (about 3 or more) are caused by disconnected usb device.
So I'm considering not to warn on some cases if we know its reason, like
the above device disconnected
case, but still warn on other cases.
This should reduce many unneeded bug report for the usb disconnected case.
Any advice is welcomed.
How about warning, but also mentioned the *reason*?
Disconnecting an USB device without unmounting is still not so nice and a
warning, well, any unwritten data has been lost then already, so, but still. I
know with esata disks you have a grace time, if you replug it quickly enough
while libata driver is still retrying it will continue the write.
Unneeded kernel warning always leads to duplicated BZ, but on the other
hand, it's true that is a problem
and need to warn user.
So warning with reason seems good enough.
I for a long time thought about a feature request for the Linux kernel to
handle removable media in the very sane way AmigaOS does. I never did so in
all the years, but heck, why not today?
If you remove it while writing, you get a nice dialog saying
"You MUST insert volume xyz again"
You do it, and it continues writing. Now how cute is that? Its the best way to
handle this kind of situation for the user in my point of view. (On the other
hand, if you didn´t, and it was a floppy disk with original Amiga filesystem,
the disk was broke, so the "MUST" was no joke).
I remember that this has been topic of a summer of code project for NetBSD,
but I don´t know what came out of it.
I know the difficulties with this. The kernel will need to pile up I/O to the
device and at some point halt processes to prevent memory exhaustion.And then
you need to route the request to reinsert the medium to the user, to the
desktop. And what do you do on a server? Where do you ask then? On the command
line? And if so, how do to that in a non annoying way? Maybe that is just
something to opt in for a desktop system.
Other than the method to inform users, more difficulties lie in the
half-written handle things, filesystems may need
to keep all the page caches even after some of them is written to disk,
to ensure they can be re-queued to handle
disk remove.
This may need to a huge change to almost every fs to implement a full
rollback(at least full rollback to previous version)
transaction mechanism(good news is, btrfs may be less impacted), and the
problem you already mentioned, extra
memory usage.
But the idea is still great, I really hope to see it.
Thanks,
Qu
So this would be quite some work, but I always thought: How AmigaOS handles
this is the *only* sane way to do it for any media that you cannot prevent
accidental removal on a hardware level – at least for the desktop case. At
least from a users point of view. Just discarding data on that accident is
just plain unfriendly to the user and an invitation for data loss (if the user
chose to move files instead of copying them).
And I found it that for some users I can tell them to safely remove the USB
stick before unplugging it again and again, but they still won´t do it, it
just doesn´t sink in. Meanwhile I usually say: Wait 30 seconds after last
write and then unplug and then hope for the best.
I still think AmigaOS goes beyond all the other operation systems I know with
this feature. But well, I am not exactly sure how MS-DOS or Windows handle
this. I vaguely remember some retry prompt from MS-DOS, but it may have been
for another case.
But well, so yes, a warning in the log may just be completely useless, cause
its too late then, for the data that was about to be safed. And if there is no
data to be saved anymore, a warning does not make any sense either, cause
there isn´t a problem. Yet, an aborted transaction means there was data to be
saved, so.
So or so, this may be something to handle on the block or VFS layer anyway?
Ciao,
--
To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html