On Tue, Nov 15, 2011 at 04:28:36PM -0800, Colin Cross wrote: > On Tue, Nov 15, 2011 at 4:22 PM, Minchan Kim <minchan.kim@xxxxxxxxx> wrote: > > On Tue, Nov 15, 2011 at 05:36:56PM +0000, Mel Gorman wrote: > >> On Wed, Nov 16, 2011 at 01:13:30AM +0900, Minchan Kim wrote: > >> The impact would be that during the time between processes been frozen > >> and storage being suspended, GFP_NOIO allocations that used to call > >> wait_iff_congested and retry while kswapd does its thing will return > >> failure instead. These GFP_NOIO allocations that used to succeed will > >> now fail in rare cases during suspend and I don't think we want that. > >> > >> Is this what you meant or had you something else in mind? > >> > > > > You read my mind exactly! > > > > I thought hibernation process is as follows, > > > > freeze user processes > > oom_disable > > hibernate_preallocate_memory > > freeze kernel processes(include kswapd) > > pm_restrict_gfp_mask > > swsusp_save > > > > My guessing is hibernate_prealocate_memory should reserve all memory needed > > for hibernation for reclaimaing pages of kswapd because kswapd just would be > > stopped so during swsusp_save, page reclaim should not be occured. > > > > But being see description of patch, my guess seems wrong. > > Now the problem happens and it means page reclaim happens during swsusp_save. > > Colin or someone could confirm this? > > The problem I see is during suspend, not hibernation. The particular > allocation that usually causes the problem is the pgd_alloc for page > tables when re-enabling the 2nd cpu during resume, which is odd as > those same page tables were freed during suspend. I guess an > unfreezable kernel thread allocated that memory between the free and > re-allocation. Then, How about this? [barrios@barrios-laptop linux-2.6]$ git diff diff --git a/kernel/power/suspend.c b/kernel/power/suspend.c index fdd4263..01aa9b5 100644 --- a/kernel/power/suspend.c +++ b/kernel/power/suspend.c @@ -297,9 +297,11 @@ int enter_state(suspend_state_t state) goto Finish; pr_debug("PM: Entering %s sleep\n", pm_states[state]); + oom_killer_disable(); pm_restrict_gfp_mask(); error = suspend_devices_and_enter(state); pm_restore_gfp_mask(); + oom_killer_enable(); Finish: pr_debug("PM: Finishing wakeup.\n"); diff --git a/mm/page_alloc.c b/mm/page_alloc.c index 6e8ecb6..d8c31b7 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -2177,9 +2177,9 @@ rebalance: * running out of options and have to consider going OOM */ if (!did_some_progress) { - if ((gfp_mask & __GFP_FS) && !(gfp_mask & __GFP_NORETRY)) { - if (oom_killer_disabled) + if (oom_killer_disabled) goto nopage; + if ((gfp_mask & __GFP_FS) && !(gfp_mask & __GFP_NORETRY)) { page = __alloc_pages_may_oom(gfp_mask, order, zonelist, high_zoneidx, nodemask, preferred_zone, -- Kind regards, Minchan Kim -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/ Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>