Allocating DMA-able memory for user programs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Memory management folk:

People have been complaining about memory-related problems with their 
userspace USB drivers.  There are two basic issues:

	Memory fragmentation eventually prevents the kernel from 
	allocating contiguous buffers large enough to hold the I/O 
	data.  Such buffers currently have to be allocated for
	each individual I/O transfer.

	Copying data back and forth between the userspace and kernel
	buffers wastes a lot of time.

The ideal solution, of course, is to use some form of zerocopy I/O, 
telling the hardware to DMA to/from the userspace buffer directly.  
However, we are under some constraints that make this difficult.

	Mapping a userspace buffer for DMA implies using some form of
	scatter-gather, because pages with adjacent virtual addresses
	generally are not physically adjacent.  But the USB kernel
	drivers do not support scatter-gather for isochronous
	transfers, only for bulk transfers (and the complaints here
	are concerned with isochronous).

	Even if scatter-gather weren't an issue, user memory pages
	are not always usable for hardware DMA.  Lots of USB 
	controllers do only 32-bit DMA, so the pages containing the
	user buffer would have to be located physically below 4 GB.
	(If an IOMMU is present this may not matter, but plenty of
	lower-end systems don't have an IOMMU.)

	We want to avoid using automatic bounce buffers, for two
	reasons.  First, they obviously defeat the purpose of
	zerocopy I/O.  Second, isochronous READ transfers often
	leave gaps in the data buffer.  (For example, a buffer might
	be set up to hold two 32-byte transfers, but the first
	transfer might only receive 20 bytes of data.  The buffer
	would end up containing 20 bytes of data read in, followed
	by a 12-byte gap holding stale data -- whatever happened to be 
	there before -- followed by 32 bytes of data read in.)  If we 
	use a bounce buffer automatically allocated in the kernel, we
	have no way to prevent the stale data in the gaps from being
	copied back to userspace, which would be a security leak.

The only solution we have come up with is to create a device-specific
mmap(2) implementation that would allocate contiguous pages within the
device's DMA mask and map them into the user's virtual address space.  
The user program could then use these pages as a buffer to get true
zerocopy I/O.

There's the potential issue of exhausting a limited resource (memory
below 4 GB), but we can take care of that by placing on overall cap on
the amount of memory allocated using this mechanism.

Does this seem reasonable?  I'm not certain about the wisdom of
creating an API for allocating and locking pages below 4 GB and then
hiding it away in the USB subsystem.  But if you folks say it's okay, 
we'll go ahead and do it.

Alan Stern

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>



[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [ECOS]     [Asterisk Internet PBX]     [Linux API]