Re: [PATCH v6] mm: add zblock - new allocator for use via zpool API

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Nov 29, 2022 at 08:48:27AM +0100, Vitaly Wool wrote:
> On Mon, Nov 28, 2022 at 9:01 PM Johannes Weiner <hannes@xxxxxxxxxxx> wrote:
> >
> > On Fri, Nov 04, 2022 at 11:58:56AM +0300, ananda wrote:
> > > From: Ananda <a.badmaev@xxxxxxxxxxxx>
> > >
> > >     Zblock stores integer number of compressed objects per zblock block.
> >
> > What does that mean?
> 
> It's explained later in the patch but anyway, an example: let's create
> an object with 4 adjacent pages, a total of 16384 bytes. We can divide
> it into 43 subblocks of size 381, plus we'll have one byte that's not
> used. Subblocks will then be treated as an array.

Thanks, that makes sense. The 'integer' threw me off, I think.

Maybe 'stores a fixed number' would be a bit clearer?

> > > These blocks consist of several physical pages (1/2/4/8) and are arranged
> > > in linked lists.
> > >     The range from 0 to PAGE_SIZE is divided into the number of intervals
> > > corresponding to the number of lists and each list only operates objects
> > > of size from its interval. Thus the block lists are isolated from each
> > > other, which makes it possible to simultaneously perform actions with
> > > several objects from different lists.
> >
> > This was benchmarked not long ago in the context of zsmalloc, and it
> > didn't seem to matter too much in real world applications:
> >
> > https://lore.kernel.org/linux-mm/20221107213114.916231-1-nphamcs@xxxxxxxxx/
> 
> We basically reproduced this test and also ran it with zblock, and
> zblock performs better by 3.5% on a 8G ZRAM disk with btrfs and this
> difference is getting bigger with disk sizes getting bigger.
> I'm pretty sure that the difference will get even bigger over time
> because zsmalloc will run compaction more and more.

Very interesting results.

Do you know if the difference is owed fully to compaction, or is that
just the factor you expect to have the biggest scaling impact?

The numbers speak for themselves, I mostly ask out of curiosity.

> > Do you have situations where this matters?
> >
> > >     Blocks make it possible to densely arrange objects of various sizes
> > > resulting in low internal fragmentation. Also this allocator tries to fill
> > > incomplete blocks instead of adding new ones thus in many cases providing
> > > a compression ratio substantially higher than z3fold and zbud.
> >
> > How does it compare to zsmalloc?
> 
> That depends on the type of data being compressed, but typically
> zsmalloc is better by 5-10%.

Thanks. I think this would be great to include in the Kconfig help
text, to help users understand the tradeoff and choose accordingly.

> > >     Zblock does not require MMU and also is superior to zsmalloc with
> > > regard to the worst execution times, thus allowing for better response time
> > > and real-time characteristics of the whole system.
> >
> > zsmalloc has depends on MMU, but which parts actually require it? It
> > has its own handle indirection and can migrate objects around and
> > replace backing pages without any virtual memory tricks. There is the
> > kmap stuff of course, because it supports highmem backing pages, but
> > that isn't relevant on NOMMU either.
> >
> > Also can you please elaborate on the worst execution time?
> 
> I don't have the numbers at hand but zsmalloc (and z3fold, for that
> matter) do have high spikes when compaction kicks in, not to speak
> about longer disabled preemption.

Gotcha, makes sense.

> > My first impression is that this looks awfully close to zsmalloc, with
> > a couple fewer features and somewhat more static design choices. It's
> > in that sense reminiscent of the slob allocator, which we're in the
> > process of removing, because 3 slab allocators is a pain to
> > maintain. This would be the 4th zswap allocator, and it's not clear
> > that it's drastically outperforming or doing something that isn't
> > possible in one of the existing ones.
> 
> I don't think this comparison is on point, at least because zblock's
> code is at least 4x smaller than zsmalloc's, and the execution
> overhead is lower too. For lower performance devices, zblock is a real
> enabler, and there's a class of high performance devices where it can
> be the best fit too.

That's fair enough.

> I get your point about 4 zswap allocators though, and have no problem
> obsoleting z3fold as soon as we get zblock in.

Ok no objection from me then.

Not to sound greedy or anything, but do you see a chance it could
supplant zbud as well in the longer term?

We noticed in tests across various Meta workloads that 2:1 packing is
pretty much always too low. The compression algorithms are just better
than that for the majority of data. The allocation strategy is fast
and simple, yes, but it wastes too much space.

zblock looks like a more reasonable balance between simplicity,
performance, and acceptable space efficiency. If it performs in the
same ballpark as zbud, it would be great to ditch that too and make
life easier for both developers and the users having to pick one.




[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux