On Mon, Aug 26, 2019 at 11:20:17AM -0700, Matthew Wilcox wrote: > On Mon, Aug 26, 2019 at 05:29:49PM +0100, Al Viro wrote: > > On Mon, Aug 26, 2019 at 03:48:38AM +0100, Al Viro wrote: > > > > > We might be able to paper over that mess by doing what /dev/st does - > > > checking that file_count(file) == 1 in ->flush() instance and doing commit > > > there in such case. It's not entirely reliable, though, and it's definitely > > > not something I'd like to see spreading. > > > > This "not entirely reliable" turns out to be an understatement. > > If you have /proc/*/fdinfo/* being read from at the time of final close(2), > > you'll get file_count(file) > 1 the last time ->flush() is called. In other > > words, we'd get the data not committed at all. > > How about always doing the write in ->flush instead of ->release? > Yes, that means that calling close(dup(fd)) is going to flush the > write, but you shouldn't be doing that. I think there'll also be > extra flushes done if you fork() during one of these writes ... but, > again, don't do that. It's not like these are common things. For configfs bin_attr it won't work, simply because it wants the entire thing to be present - callback parses the data. For SCSI tape... Maybe, but you'll need to take care of the overlaps with ->write(). Right now it can't happen (the last reference, about to be dropped right after st_flush() returns); if we do that on each ->flush(), we will have to cope with that fun and we'll need to keep an error (if any) for the next call of st_flush() to pick and return. I'm not saying it can't be done, but that's really a question for SCSI folks. > Why does the prototype of file_operations::release suggest that it can > return an int? __fput doesn't pay any attention to the return value. > Changing that to return void might help some future programmers avoid > this mistake. Hysterical raisins. It's doable, the main question is how much do we aim for and whether it's worth the amount of churn. It has been discussed (last time about 6 years ago), didn't go anywhere. Boggled down in discussing how much churn which cleanups are worth; I wanted to make them void (*some_sane_name)(struct file *) (except that the name I'd used hadn't been sane). Linus wanted void (*release)(struct file *, struct inode *) and suggested to do a big change replacing int with void, basically, then followups fixing the resulting warnings.