On Fri, 12 Apr 2019, Vlastimil Babka wrote: > On 4/12/19 9:14 AM, James Bottomley wrote: > >> In the session I hope to resolve the question whether this is indeed > >> the right thing to do for all kmalloc() users, without an explicit > >> alignment requests, and if it's worth the potentially worse > >> performance/fragmentation it would impose on a hypothetical new slab > >> implementation for which it wouldn't be optimal to split power-of-two > >> sized pages into power-of-two-sized objects (or whether there are any > >> other downsides). > > > > I think so. The question is how aligned? explicit flushing arch's > > definitely need at least cache line alignment when using kmalloc for > > I/O and if allocations cross cache lines they have serious coherency > > problems. The question of how much more aligned than this is > > interesting ... I've got to say that the power of two allocator implies > > same alignment as size and we seem to keep growing use cases that > > assume this. Well that can be controlled on a per arch level through KMALLOC_MIN_ALIGN already. There are architectues that align to cache line boundaries. However you sometimes have hardware with ridiculous large cache line length configurations like VSMP with 4k. > Right, by "natural alignment" I meant exactly that - align to size for > power-of-two sizes. Well for which sizes? Double word till PAGE_SIZE? This gets us into weird and difficult to comprehend rules for how objects are aligned. Or do we start on the cache line size to provide cacheline alignment and do word alignment before? Consistency is important I think and if you want something different then you need to say so in one way or another. > > I'm not so keen on growing a separate API unless there's > > a really useful mm efficiency in breaking the kmalloc alignment > > assumptions. > > I'd argue there's not.