On Wed, 13 May 2015, Michal Hocko wrote: > On Fri 08-05-15 16:06:10, Eric B Munson wrote: > > On Fri, 08 May 2015, Andrew Morton wrote: > > > > > On Fri, 8 May 2015 15:33:43 -0400 Eric B Munson <emunson@xxxxxxxxxx> wrote: > > > > > > > mlock() allows a user to control page out of program memory, but this > > > > comes at the cost of faulting in the entire mapping when it is > > > > allocated. For large mappings where the entire area is not necessary > > > > this is not ideal. > > > > > > > > This series introduces new flags for mmap() and mlockall() that allow a > > > > user to specify that the covered are should not be paged out, but only > > > > after the memory has been used the first time. > > > > > > Please tell us much much more about the value of these changes: the use > > > cases, the behavioural improvements and performance results which the > > > patchset brings to those use cases, etc. > > > > > > > The primary use case is for mmaping large files read only. The process > > knows that some of the data is necessary, but it is unlikely that the > > entire file will be needed. The developer only wants to pay the cost to > > read the data in once. Unfortunately developer must choose between > > allowing the kernel to page in the memory as needed and guaranteeing > > that the data will only be read from disk once. The first option runs > > the risk of having the memory reclaimed if the system is under memory > > pressure, the second forces the memory usage and startup delay when > > faulting in the entire file. > > Is there any reason you cannot do this from the userspace? Start by > mmap(PROT_NONE) and do mmap(MAP_FIXED|MAP_LOCKED|MAP_READ|other_flags_you_need) > from the SIGSEGV handler? > You can generate a lot of vmas that way but you can mitigate that to a > certain level by mapping larger than PAGE_SIZE chunks in the fault > handler. Would that work in your usecase? This might work for the use cases I have laid out (I am not sure about the anonymous mmap one, but I will try it). I am concerned about how much memory management policy these suggestions push into userspace. I am also concerned about the number of system calls required to do the same thing. This will require a new call to mmap() for every new page accessed in the file (or for every file_size/map_size in the multiple page chunk). The simple case of calling mlock() on the every time the file was accessed was significantly slower than the LOCKONFAULT flag. Your suggestion will be better in that it avoids the extra mlock call for pages already locked, but there still significantly more system calls. I will add this to the program I have been using to measure executuion times and see how it compares to the other options. Eric
Attachment:
signature.asc
Description: Digital signature