On Sun, 2008-02-03 at 11:38 -0500, Jeff Garzik wrote: > James Bottomley wrote: > > The aic94xx sequencer has a very finely honed sense of DMA transfers. > > It's fully automated, and handles both ATA DMA and ATA PIO in the > > sequencer engine (so all the driver sees is DMA). > > ditto AHCI, and most other DMA engines > > > > It reports both underrun and overrun conditions. For DMA underrun > > ditto AHCI, and most other DMA engines > > > > (device transfers less than expected, it just returns what it has and > > how much was missing as the residual) for DMA overrun (as in device > > tried to take more than it was programmed to send on either read or > > write) for PIO it does seem to zero fill or discard and then simply > > report task complete with overrun and let libsas sort it out. I suspect > > for DMA it first tries DMAT before taking other actions, but I'd need a > > protocol analyser (or the sequencer docs) to be sure. > > Almost every other DMA engine on the planet besides aic94xx is pretty > much the same... you set up an s/g tables, and it reports overrun or > underrun via an interrupt + status register bit. > > It sounds like aic94xx might do more work in the firmware -- that counts > as "advanced", since some of the DMA engine cleanup clearly occurs in > firmware, rather than pushed to kernel software. > > Nowhere do I see anything about AHCI that is "broken." It has standard > DMA engine behavior found in storage and non-storage hardware. I'm only really going by what Tejun says about AHCI. The problem as I understand it is data overrun on PIO mode commands. AHCI apparently (like aic94xx) processes these internally and doesn't actually use the libata pio handlers, so it just uses an internal buffer to receive the PIO and DMA it into memory. However, Tejun says (and he'll correct me if I'm paraphrasing wrongly, since this was on IRC a while ago) that jmb ahci and sata_sil24 both error out in different (but fairly nasty) ways if they get extra PIO data that there's no place in the SG list to deposit. This seems to be why he wants to introduce a DMA drain in addition to the existing PIO drain. I'd certainly characterise this behaviour as "broken" ... especially as not all AHCI implementations apparently have the bug ... some do exactly the right thing on PIO overruns and don't need the drain element. > > We handle overruns as error conditions in both SAS and ATA at the > > moment, but the point is that the ATAPI device is fully happy and > > quiesced when we do this. > > That may be the result of aic94xx handling extra FIS's in the firmware, > something we cannot depend on for purely silicon-based devices. > > mvsas, broadsas, ahci, sata_sil24, and others behave similarly... > Please don't mistake lack of firmware cleanup as "broken hardware." OK, well this is probably me coming from the SCSI world. Data overruns happen (especially if you allow users to control the command parameters), and just discarding the overrun has been common practice for decades. Correct me if I'm wrong, but I think ATAPI7v3 covers the PIO overrun case in 20.4 and indicates precisely how the HBA should handle the problem. ACHI also includes the PxIS.OFS bits precisly for this case. The problem seems to come when using this mechanism hangs the device (as it apparently does in a limited number of ACHI implementations) and we just have to take the data in via DMA from the HBA (hence the need for a drain buffer). James - To unsubscribe from this list: send the line "unsubscribe linux-ide" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html