[delayed and partial response because I'm on vacation, still feeling like I should shime in] On Fri, Jul 31, 2020 at 08:08:57AM +1000, Dave Chinner wrote: > In which case, you just identified why the uptodate array is > necessary and can't be removed. If we do a sub-page write() the page > is not fully initialised, and so if we then mmap it readpage needs > to know what part of the page requires initialisation to bring the > page uptodate before it is exposed to userspace. > > But that also means the behaviour of the 4kB write on 64kB page size > benchmark is unexplained, because that should only be marking the > written pages of the page up to date, and so it should be behaving > exactly like ext4 and only writing back individual uptodate chunks > on the dirty page.... We have two different cases here: file read in through read or mmap, or just writing to a not cached file. In the former case redpage reads everything in, and everything will also be written out. If OTOH write only read in parts only those parts will be written out. > > You're clearly talking to different SSD people than I am. > > Perhaps so. > > But it was pretty clear way back in the days of early sandforce SSD > controllers that compression and zero detection at the FTL level > resulted in massive reductions in write amplification right down at > the hardware level. The next generation of controllers all did this > so they could compete on performance. They still do this, which is > why industry benchmarks test performance with incompressible data so > that they expose the flash write perofrmance, not just the rate at > which the drive can detect and elide runs of zeros... I don't know of any modern SSDs doing zeroes detection. > IOWs, showing that even high end devices end up bandwidth limited > under common workloads using default configurations is a much more > convincing argument... Not every SSD is a high end device. If you have an enterprise SSD with a non-volatile write cache and a full blown PCIe interface bandwith is not going to a limitation. If on the other hand you have an el-cheapo ATA SSD or a 2x gen3 PCIe consumer with very few flash channels OTOH..