Re: [PATCH v3] mm: compaction: handle incorrect Unmovable type pageblocks

Mel Gorman <mgorman@xxxxxxx> · Thu, 26 Apr 2012 17:47:13 +0100

On Thu, Apr 26, 2012 at 11:53:47AM -0400, Rik van Riel wrote:
> On 04/26/2012 10:36 AM, Mel Gorman wrote:
> 
> >Hmm, at what point does COMPACT_ASYNC_FULL get used? I see it gets
> >used for the proc interface but it's not used via the page allocator at
> >all.
> 
> He is using COMPACT_SYNC for the proc interface, and
> COMPACT_ASYNC_FULL from kswapd.
> 

Ah, yes, of course. My bad.

Even that is not particularly satisfactory though as it's depending on
kswapd to do the work so it's a bit of a race to see if kswapd completes
the job before the page allocator needs it.

> >Minimally I was expecting to see if being used from the page allocator.
> 
> Makes sense, especially if we get the CPU overhead
> saving stuff that we talked about at LSF to work :)
> 

True.

> >A better option might be to track the number of MIGRATE_UNMOVABLE blocks that
> >were skipped over during COMPACT_ASYNC_PARTIAL and if it was a high
> >percentage and it looked like compaction failed then to retry with
> >COMPACT_ASYNC_FULL. If you took this option, try_to_compact_pages()
> >would still only take sync as a parameter and keep the decision within
> >compaction.c
> 
> This I don't get.
> 
> If we have a small number of MIGRATE_UNMOVABLE blocks,
> is it worth skipping over them?
> 

We do not know in advance how many MIGRATE_UNMOVABLE blocks are going to
be encountered. Even if we kept track of the number of MIGRATE_UNMOVABLE
pageblocks in the zone, it would not tell us how many pageblocks the
scanner will see.

> If we have really large number of MIGRATE_UNMOVABLE blocks,
> did we let things get out of hand?  By giving the page
> allocator this many unmovable blocks to choose from, we
> could have ended up with actually non-compactable memory.
> 

If there are a large number of MIGRATE_UNMOVABLE blocks, each with a single
unmovable page at the end of the block then the worst case situation
is that the second pass (COMPACT_ASYNC_PARTIAL being the first pass)
is useless and slow due to the scanning within MIGRATE_UNMOVABLE blocks.

When this situation occurs, I would also expect that the third pass
(COMPACT_SYNC) will also fail and then compaction will get deferred to
limit further damage.

In the average case, I would expect the large number of
MIGRATE_UNMOVABLE blocks to also be partially populated which means that
scans of these blocks will also be partial limiting the amount of
scanning we do. How much this is limited is impossible to estimate as
it's dependant on the workload.

> If we have a medium number of MIGRATE_UNMOVABLE blocks,
> is it worth doing a restart and scanning all the movable
> blocks again?
> 

This goes back to the same problem of we do not know how many
MIGRATE_UNMOVABLE pageblocks are going to be encountered in advance However,
I see your point.

Instead of COMPACT_ASYNC_PARTIAL and COMPACT_ASYNC_FULL should we have
COMPACT_ASYNC_MOVABLE and COMPACT_ASYNC_UNMOVABLE? The first pass from
the page allocator (COMPACT_ASYNC_MOVABLE) would only consider MOVABLE
blocks as migration targets. The second pass (COMPACT_ASYNC_UNMOVABLE)
would examine UNMOVABLE blocks, rescue them and use what blocks it
rescues as migration targets. The third pass (COMPACT_SYNC) would work
as it does currently. kswapd would only ever use COMPACT_ASYNC_MOVABLE.

That would avoid rescanning the movable blocks uselessly on the second
pass but should still work for Bartlomiej's workload.

What do you think?

> In other words, could it be better to always try to
> rescue the unmovable blocks?

I do not think we should always scan within unmovable blocks on the
first pass. I strongly suspect it would lead to excessive amounts of CPU
time spent in mm/compaction.c.

-- 
Mel Gorman
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>