On Mon 09-03-20 21:02:50, Shaju Abraham wrote: > On Mon, Mar 9, 2020 at 5:28 PM Michal Hocko <mhocko@xxxxxxxxxx> wrote: > > > On Mon 09-03-20 11:31:41, Shaju Abraham wrote: > > > The VM pressure notification flags have excluded GFP_KERNEL with the > > > reasoning that user land will not be able to take any action in case of > > > kernel memory being low. This is not true always. Consider the case of > > > a user land program managing all the huge memory pages. By including > > > GFP_KERNEL flag whenever the kernel memory is low, pressure notification > > > can be send, and the manager process can split huge pages to satisfy > > kernel > > > memory requirement. > > > > Are you sure about this reasoning? GFP_KERNEL = __GFP_FS | __GFP_IO | > > __GFP_RECLAIM > > Two of the flags mentioned there are already listed so we are talking > > about __GFP_RECLAIM here. Including it here would be a more appropriate > > change than GFP_KERNEL btw. > > > > But still I do not really understand what is the actual problem and how > > is this patch meant to fix it. vmpressure is triggered only from the > > reclaim path which inherently requires to have __GFP_RECLAIM present > > so I fail to see how this can make any change at all. How have you > > tested it? > > > > We have a user space application which waits on memory pressure events. > Upon receiving the event, the user space program will free up huge > pages to make more memory available in the system. This mechanism > works fine if the memory is being consumed by other user space > applications. To test this, we wrote a test program which will > allocate all the memory available in the system using malloc() and > touch the allocated pages. When the free memory level becomes low, > the pressure event is fired and the process gets notified about it . > The same test is repeated with kmalloc() instead of malloc(). A test > kernel module is developed, which will allocate all the available > memory with kmalloc(GFP_KERNEL) flag. The OOM killer gets invoked in > this case. The memory pressure event is not fired. After modifying > the vmpressure.c with the attached patch, the pressure event gets > triggered. Swap is disabled in the system we were testing. Are you sure this is really the case? I am either missing something here or your test might simply be timing specific because GFP_KERNEL & (__GFP_FS | __GFP_IO) = true so I really do not see how the current code could bail out on the test you are patching so that the patch would make any change. The only real difference this patch makes is to trigger events for __GFP_RECLAIM allocations which could be GFP_NOIO. All non-sleepable allocations would wake kswapd and that would in turn reclaim with _GFP_FS | __GFP_IO set so the check doesn't change anything. Am I missing something? -- Michal Hocko SUSE Labs