Re: [PATCH] whymb: fix description in DMA coherence

"Paul E. McKenney" <paulmck@xxxxxxxxxx> · Fri, 6 May 2022 12:51:22 -0700

On Wed, Apr 20, 2022 at 11:38:57AM +0000, Hao Lee wrote:
> On Tue, Apr 19, 2022 at 10:36:16AM -0700, Paul E. McKenney wrote:
> > On Mon, Apr 18, 2022 at 06:23:59AM +0000, Hao Lee wrote:
> > > On Sun, Apr 17, 2022 at 09:52:39AM -0700, Paul E. McKenney wrote:
> > > > On Sun, Apr 17, 2022 at 11:09:47AM +0000, Hao Lee wrote:
> > > > > CPU caches need to be flushed before DMA operations to avoid data
> > > > > overwriting.
> > > > > 
> > > > > Signed-off-by: Hao Lee <haolee.swjtu@xxxxxxxxx>
> > > > > ---
> > > > >  appendix/whymb/whymemorybarriers.tex | 2 +-
> > > > >  1 file changed, 1 insertion(+), 1 deletion(-)
> > > > > 
> > > > > diff --git a/appendix/whymb/whymemorybarriers.tex b/appendix/whymb/whymemorybarriers.tex
> > > > > index b49d5fe8..4ec27962 100644
> > > > > --- a/appendix/whymb/whymemorybarriers.tex
> > > > > +++ b/appendix/whymb/whymemorybarriers.tex
> > > > > @@ -1640,7 +1640,7 @@ future such problems:
> > > > >  	in any DMA buffer before presenting that buffer to the
> > > > >  	I/O device.
> > > > 
> > > > The paragraph above is before the I/O starts.  Any changes to the buffer
> > > > need to be flushed to memory so that the I/O device's DMA actually
> > > > sees them.
> > > > 
> > > > The next paragraph is the other end of I/O request, after it completes.
> > > > So we cannot have "before".  But maybe we need "invalidate" instead of
> > > > "flush".
> > > 
> > > Got it!
> > > 
> > > > With a note that itcan be more efficient to both flush and
> > > > invalidate prior to presenting the DMA buffer to the device.
> > > > 
> > > > The point is that any pre-DMA data in the CPU caches must be gotten
> > > > rid of in order for the CPUs to see the data that was actually DMA'ed
> > > > into memory.
> > > 
> > > Great explanation! Now I know we need something like WBINVD instruction
> > > for the former case and INVD instruction for the latter.
> > 
> > Glad that it helped!
> > 
> > > > >  	Similarly, you need to flush the CPU caches of any location
> > > 
> > > Should we change the word "flush" to "invalidate" to differentiate these
> > > two opposite DMA operations?

I did use "invalidate" after an input DMA, but added a footnote about
the incorrect store operation.

> > If after the DMA, we need to care about the difference, hasn't the CPU
> > incorrectly written to part of the DMA buffer during the DMA operation?
> 
> Sorry, I can't understand this passage...  Maybe I didn't make myself
> clear.  I find a great example in arch/powerpc/include/asm/cacheflush.h
> 
> Before DMA provides output buffer to devices, we should call
> flush_dcache_range or clean_dcache_range to let the devices see the
> latest data generated by the CPU. After DMA writes data into the input
> buffer, we should use invalidate_dcache_range to invalidate the CPU
> cache so the CPU can read the latest data written by devices.
> 
> Should we change the word "flush" in the following sentence to "invalidate"?
> 
> 	Similarly, you need to flush the CPU caches of any location in
> 	any DMA buffer after DMA to that buffer completes.

The trick is that "invalidate" and "flush" normally do the same thing
for cachelines in the MESI exclusive and shared states.  In both cases,
memory is considered to be up to date, so the data can simply be dropped
from the cache.  There is a difference in the MESI modified state, but
the flush prior to the DMA will have ensured that none of the cache
lines corresponding to the DMA buffer are in exclusive state.

So the only way that a post-DMA flush can be different than a post-DMA
invalidation is if the CPU (incorrectly!) stored to the DMA buffer while
the DMA was happening.

And you should never store to a DMA buffer while the DMA is happening,
at least not if you value the data baing DMA'ed.  ;-)

> > If you are thinking of some other scenario, could you please list the
> > events leading up to that scenrio?
> > 
> > > > > -	in any DMA buffer after DMA to that buffer completes.
> > > > > +	in any DMA buffer before DMA to that buffer completes.
> > > > >  	And even then, you need to be \emph{very} careful to avoid
> > > > >  	pointer bugs, as even a misplaced read to an input buffer
> > > > >  	can result in corrupting the data input!
> > > > 
> > > > And this last clause could just wordsmithed a bit.
> > > 
> > > Maybe it's not quite rigorous?
> > > 
> > > If I understand correctly, a misplaced read will load the data from
> > > input buffer area to cache line. If the cpu doesn't write to this cache
> > > line, I think this case can only result in that the cpu reads stale data
> > > in subsequent reads but won't corrupt the input buffer given that this
> > > cache line is clean and won't writeback to memory.
> > 
> > If the user code that accesses the buffer runs on this same CPU, it will
> > look to that CPU as if the DMA buffer was corrupted, correct?
> 
> Ah, this indeed makes sense!
> 
> > After
> > all, that user code will see the value loaded by the CPU, not the value
> > actually DMAed into memory.
> > 
> > Hmmm...  It might be necessary to spell out these DMA-corruption
> > scenarios.  Thoughts?
> 
> Yes. I think spelling out these scenarios should make this passage easier to
> understand.

How does the diff shown below look?

							Thanx, Paul

------------------------------------------------------------------------

commit 5b1df1e1d99d59ee73286fded86276495d4cb138
Author: Paul E. McKenney <paulmck@xxxxxxxxxx>
Date:   Fri May 6 12:49:36 2022 -0700

    appendix/whymb: Clarify DMA data-corruption scenarios
    
    Reported-by: Hao Lee <haolee.swjtu@xxxxxxxxx>
    Signed-off-by: Paul E. McKenney <paulmck@xxxxxxxxxx>

diff --git a/appendix/whymb/whymemorybarriers.tex b/appendix/whymb/whymemorybarriers.tex
index 2140eb8a..f2cca5d6 100644
--- a/appendix/whymb/whymemorybarriers.tex
+++ b/appendix/whymb/whymemorybarriers.tex
@@ -1739,11 +1739,30 @@ future such problems:
 	you must carefully flush the CPU caches of any location
 	in any DMA buffer before presenting that buffer to the
 	I/O device.
-	Similarly, you need to flush the CPU caches of any location
-	in any DMA buffer after DMA to that buffer completes.
+	Otherwise, a store from one of the CPUs might not be
+	accounted for in the data DMAed out through the device.
+	This is a form of data corruption, which is an extremely
+	serious bug.
+
+	Similarly, you need to invalidate\footnote{
+		Why not flush?
+		If there is a difference, then a CPU must have incorrectly
+		stored to the DMA buffer in the midst of the DMA operation.}
+	the CPU caches corresponding to any location in any DMA buffer
+	after DMA to that buffer completes.
+	Otherwise, a given CPU might see the old data still residing in
+	its cache instead of the newly DMAed data that it was supposed
+	to see.
+	This is another form of data corruption.
+
 	And even then, you need to be \emph{very} careful to avoid
 	pointer bugs, as even a misplaced read to an input buffer
 	can result in corrupting the data input!
+	One way to avoid this is to invalidate all of the caches of
+	all of the CPUs once the DMA completes, but it is much easier
+	and more efficient if the device DMA participates in the
+	cache-coherence protocol, making all of this flushing and
+	invalidating unnecessory.
 
 \item	External busses that fail to transmit cache-coherence data.