On Tue, 2010-02-02 at 18:14 -0500, Jeff Garzik wrote: > On 02/02/2010 06:05 PM, Andrew Morton wrote: > > On Tue, 02 Feb 2010 16:58:38 -0600 > > James Bottomley<James.Bottomley@xxxxxxx> wrote: > > > >> On Tue, 2010-02-02 at 14:11 -0800, akpm@xxxxxxxxxxxxxxxxxxxx wrote: > >>> From: Catalin Marinas<catalin.marinas@xxxxxxx> > >>> > >>> Depending on the direction of the transfer, flush_dcache_page() must be > >>> called either before (ATA_TFLAG_WRITE) or after (!ATA_TFLAG_WRITE) the > >>> data copying to avoid D-cache aliasing with user space or I-D cache > >>> coherency issues (when reading data from an ATA device using PIO, the > >>> kernel dirties the D-cache but there is no flush_dcache_page() required on > >>> Harvard architectures). > >>> > >>> This patch allows the ARM boards to use a rootfs on CompactFlash with the > >>> PATA platform driver. > >>> > >>> As Anfei Zhou mentioned in a recent patch ("flush dcache before writing > >>> into page to avoid alias"), on some architectures there may be a > >>> performance benefit in differentiating the flush_dcache_page() calls based > >>> on whether the kernel or the user page needs flushing. > >>> > >>> IMHO, we should differentiate based on the direction (kernel reading or > >>> writing from/to such page). In the ARM case with PIPT Harvard caches > >>> (newer processors), the kernel reading from a page that may be mapped in > >>> user space shouldn't need cache flushing. The kernel writing to such page > >>> would require D-cache flushing because of coherency with the I-cache. > >>> Currently on ARM, the latter happens in both cases. > >>> > >>> Signed-off-by: Catalin Marinas<catalin.marinas@xxxxxxx> > >>> Cc: Jeff Garzik<jgarzik@xxxxxxxxx> > >>> Cc: Tejun Heo<tj@xxxxxxxxxx> > >>> Cc:<stable@xxxxxxxxxx> > >>> Signed-off-by: Andrew Morton<akpm@xxxxxxxxxxxxxxxxxxxx> > >>> --- > >>> > >>> drivers/ata/libata-sff.c | 6 ++++++ > >>> 1 file changed, 6 insertions(+) > >>> > >>> diff -puN drivers/ata/libata-sff.c~ata-call-flush_dcache_page-around-pio-data-transfers-in-libata-affc drivers/ata/libata-sff.c > >>> --- a/drivers/ata/libata-sff.c~ata-call-flush_dcache_page-around-pio-data-transfers-in-libata-affc > >>> +++ a/drivers/ata/libata-sff.c > >>> @@ -874,6 +874,9 @@ static void ata_pio_sector(struct ata_qu > >>> > >>> DPRINTK("data %s\n", qc->tf.flags& ATA_TFLAG_WRITE ? "write" : "read"); > >>> > >>> + if (do_write) > >>> + flush_dcache_page(page); > >>> + > >> > >> This looks wrong; the upper layers should already have made the page > >> aliases coherent from user to kernel by calling flush_dcache_page (in > >> __get_user_pages()), so the aliases should already be up to date and > >> this flush is spurious. > > > > The upper layers don't know that the CPU touched the data! If the > > driver did a DMA transfer then such a flush is unneeded, so we don't do > > it. > > The patch in question only affects PIO transfers, not DMA. Data is > transferred from a kernel buffer to hardware via out[bwl] via > > page data -> CPU register -> out[bwl] No flush required here ... all the aliases should be up to date after __get_user_pages() > or, data is transferred from hardware to a kernel buffer via > > in[bwl] -> CPU register -> page data This writes to the page via the kernel alias, so the kernel alias becomes dirty. In order to make the underlying page up to date, the kernel alias must be flushed this flush *must* occur while the page is mapped along the kernel alias. In order that the user aliases see the correct data, any speculation they may have done before the kernel alias was flushed must be invalidated (depending on architectures ... some architectures won't speculate reads in this case). This invalidation *should* be occurring further up the stack .. but if the architecture has implemented this invalidation in the dma ops, then flush_dcache_page() may be needed. > So what are the flushing rules given those conditions? See above. James -- To unsubscribe from this list: send the line "unsubscribe linux-ide" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html