> From: Dave Chinner [mailto:david@xxxxxxxxxxxxx] > Subject: Re: [PATCH 7/8] zswap: add to mm/ > > <much useful info from Dave deleted> OK, I have suitably proven how little I know about slab and have received some needed education from your response... Thanks for that Dave. So let me ask some questions instead of making stupid assumptions. > Thinking that there is a fixed amount of memory that you should > reserve for some subsystem is simply the wrong approach to take. > caches are dynamic and the correct system balance should result of > the natural behaviour of the reclaim algorithms. > > The shrinker infrastructure doesn't set any set size goals - it > simply tries to balance the reclaim across all the shrinkers and > relative to the page cache... First, it's important to note that zcache/zswap is not really a subsystem. It's simply a way of increasing the number of anonymous pages (zswap and zcache) and pagecache pages (zcache only) in RAM by using compression. Because compressed pages can't be byte-addressed directly, pages enter zcache/zswap through a "transformation" process I've likened to a Fourier transform: In their compressed state, they must be managed differently than normal whole pages. Compressed anonymous pages must transition back to uncompressed before they can be used. Compressed pagecache pages (zcache only) can be either uncompressed when needed or gratuitously discarded (eventually) when not needed. So I've been proceeding with the assumption that it is the sum of wholepages used by both compressed-anonymous pages and uncompressed-anonymous pages that must be managed/balanced, and that this sum should be managed similarly to the non-zxxxx case of the total number of anonymous pages in the system (and similarly for compressed+uncompressed pagecache pages). Are you suggesting that slab can/should be used instead? > And so the two subsystems need different reclaim implementations. > And, well, that's exactly what we have shrinkers for - implmenting > subsystem specific reclaim policy. The shrinker infrastructure is > responsible for them keeping balance between all the caches that > have shrinkers and the size of the page cache... Given the above, do you think either compressed-anonymous-pages or compressed-pagecache-pages are suitable candidates for the shrinker infrastructure? Note that compressed anonymous pages are always dirty so cannot be "reclaimed" as such. But the mechanism that Seth and I are working on causes compressed anonymous pages to be decompressed and then sent to backing store, which does (eventually, after I/O latency) free up pageframes. Currently zcache does use the shrinker API for reclaiming pageframes-used-for-compressed-pagecache-pages. Since these _are_ a form of pagecache pages, is the shrinker suitable? > There are also cases where we've moved metadata caches out of the > page cache into shrinker controlled caches because the page cache > reclaim is too simplistic to handle the complex relationships > between filesystem metadata. We've done this in XFS, and IIRC btrfs > did this recently as well... So although the objects in zswap/zcache are less than one page, they are still "data" not "metadata", true? In your opinion, then, should they be managed by core MM, or by shrinker-controlled caches, by some combination, or independently of either? > > In any case, I would posit that both the nature of zpages and their > > average size relative to a whole page is quite unusual compared to slab. > > Doesn't sound at all unusual. I think I've addressed the different "nature" above, so let me ask about the size... Can slab today suitably manage "larger" objects that exceed half-PAGESIZE? Or "larger" objects, such as 35%-PAGESIZE where there would be a great deal of fragmentation? If so, we should definitely consider slab as an alternative for zpage allocation. > > So while there may be some useful comparisons between zswap > > and slab, the differences may warrant dramatically different policy. > > There may be differences, but it doesn't sound like there's anything > you can't implment with an appropriate shrinker implmentation.... Depending on your answers above, we may definitely need to consider that as well! Thanks, Dan -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href