On Sat, Feb 22, 2025 at 11:24:56AM +0200, Lilith Gkini wrote: > On Sat, Feb 22, 2025 at 12:58:20PM +0900, Harry Yoo wrote: > > On Fri, Feb 21, 2025 at 04:57:24PM +0200, Lilith Gkini wrote: > > > On Thu, Feb 20, 2025 at 06:21:14PM +0900, Harry Yoo wrote: > > > > On Thu, Feb 20, 2025 at 05:20:00PM +0900, Harry Yoo wrote: > > > > > On Sat, Feb 15, 2025 at 06:57:01PM +0200, Lilitha Persefoni Gkini wrote: > > > > > > The condition `nr <= slab->objects` in the `on_freelist()` serves as > > > > > > bound while walking through the `freelist` linked list because we can't > > > > > > have more free objects than the maximum amount of objects in the slab. > > > > > > But the `=` can result in an extra unnecessary iteration. > > > > > > > > > > > > The patch changes it to `nr < slab->objects` to ensure it iterates > > > > > > at most `slab->objects` number of times. > > > > > > > > > > > > Signed-off-by: Lilitha Persefoni Gkini <lilithpgkini@xxxxxxxxx> > > > > > > --- > > > > > > mm/slub.c | 2 +- > > > > > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > > > > > > > > > diff --git a/mm/slub.c b/mm/slub.c > > > > > > index 1f50129dcfb3..ad42450d4b0f 100644 > > > > > > --- a/mm/slub.c > > > > > > +++ b/mm/slub.c > > > > > > @@ -1435,7 +1435,7 @@ static int on_freelist(struct kmem_cache *s, struct slab *slab, void *search) > > > > > > int max_objects; > > > > > > > > > > > > fp = slab->freelist; > > > > > > - while (fp && nr <= slab->objects) { > > > > > > + while (fp && nr < slab->objects) { > > > > > > > > > > Hi, this makes sense to me. > > > > > > > > > > But based on what the name of the variable suggests (nr of objects), > > > > > I think it makes clearer to initialize it to 1 instead? > > > > > > > > Oh, actually iterating at most (slab->objects + 1) times allows it to catch > > > > cases where the freelist does not end with NULL (see how validate_slab() > > > > calls on_freelist(), passing search = NULL). > > > > > > > > It's very subtle. A comment like this would help: > > > > > > > > /* > > > > * Iterate at most slab->objects + 1 times to handle cases > > > > * where the freelist does not end with NULL. > > > > */ > > > > > > > > -- > > > > Cheers, > > > > Harry > > > > > > nr is the number of "free objects" in the freelist, so it should start > > > from zero for the case when there are no free objects. > > > > Hi Lilith, > > > > You're right. It was an oversight. > > > > > Oh, you think its on purpose to catch edgecases, like a defensive > > > programing sort of way? Huh, thats interesting! > > > > > > In that case it would prevent a case where every object in the slab is > > > freed and the tail of the freelist isnt NULL like it should be, maybe because > > > of some Out-Of-Bounds write from another object, or a Use-After-Free. > > > If that pointer is some gibberish then the chech_valid_pointer() check > > > on line 1441 will catch it, set it as NULL in line 1445 with > > > set_freepointer() and then break from the While and continue with the > > > rest of the program. nr will correctly remain as the number of freed > > > objects and the freelist will have a NULL in its tail, as it should! > > > > Yes, but corrupted freelist implies that the number of the free > > objects (nr) may be invalid? (if free pointer in the middle is > > corrupted). > > > > But that's another story... > > > > > But if the pointer isn't some random address and instead is an address in > > > the slab, lets say as an example the address of a free object in the > > > linked list (making the freelist cicrular) it wont get caught by the > > > check_valid_pointer() since technically it is a valid pointer, it will > > > increment nr to slab->objects + 1 and then exit the While loop because > > > it will fail the conditional nr <= slab->objects. > > > > > > Then later on, in line 1470 slab->objects - nr will be -1 which is not > > > equals to slab->inuse and enter the If case where it will set the > > > slab->inuse to -1, but because slab-inuse is an unsinged short it will > > > be stored as 0xFFFF, ie 65535, corrupting the slab struct with an > > > unreasonably large "inuse" value. > > > > While (slab->inuse + nr != slab->objects) will prevent overflow, > > I think either way is functional, because it prints error when there are > > more or less objects than it should have on the freelist. > > > > > You mentioned validate_slab(), I assume you are refering to how it > > > searches for NULL when it calls on_freelist() and if it does find NULL > > > in the freelist it will return 1 (basicaly TRUE). > > > > Yes. > > > > > In the example where every object is freed it will return TRUE > > > regardless if NULL is in the freelist or not, because on_freelist() > > > returns search == NULL if it doesnt find the search in the freelist. In > > > this case it would be NULL == NULL which is TRUE again. > > > This will have the same behavior even if we remove the equals sign from > > > the While, like the Patch suggests. > > > > Ok. that's actually a good point. > > > > But as validate_slab() expects to return false if there is no NULL > > in the freelist, I think we need to fix on_freelist() to support that? > > > > I'm not sure why on_freelist() returns (search == NULL). > > It has been there since the beginning of the SLUB allocator > > (commit 81819f0fc828). > > > > Since commit 53e15af03be4 ("slub: validation of slabs (metadata and guard > > zones)"), validate_slab() started passing NULL to on_freelist(). > > Looks like passing NULL to on_freelist() has never worked as intended... > > > > Can we return false in on_freelist(), if it could not find > > target object (search) in the loop? (need some testing to verify, > > though...) regardless of search is NULL or not? > > > > > I am still pretty new to this so I apologize for any mistakes. I > > > appreciate the feedback! > > > > Your questions are valid. > > > > > Is it ok to refer to lines of code, or should I copy paste the entire line? > > > > I prefer copy-and-paste because sometimes it's not obvious what commit > > is HEAD of your repository. > > > > > I understand that even small changes could have a huge effect to some > > > other function or subsystem in ways that might not be obvious to someone > > > not as familiar with the codebase. > > > > That's why we need to be as careful as possible and test the code ;-) > > > > > I hope I am not coming off to strong or anything. > > > > It's ok. I don't think so. > > > > -- > > Cheers, > > Harry > > Hi! > > > Since commit 53e15af03be4 ("slub: validation of slabs (metadata and guard > > zones)"), validate_slab() started passing NULL to on_freelist(). > > Looks like passing NULL to on_freelist() has never worked as intended... > > Haha, yeah, it would seem like. > > > Can we return false in on_freelist(), if it could not find > > target object (search) in the loop? (need some testing to verify, > > though...) regardless of search is NULL or not? > > You are right, I should write a test driver that uses on_freelist() and > analyze it through GDB with QEMU to see how it actually behaves when I > have the time, but just looking at it I am thiking that something like > `return fp == NULL` could replace the `search == NULL` and check if NULL is > at the end of the freelist or not, in addition to my original patch. On second thought (after reading your email), I think it should be (fp == NULL) && (search == NULL)? When fp is NULL after the loop, it means (the end of the freelist is NULL) AND (there were equal to or less than (slab->objects - nr) objects on the freelist). If the loop has ended after observing fp == NULL, on_freelist() should return true only when search == NULL. If fp != NULL, it means there were more objects than it should have on the free list. In that case, we can't take risk of iterating the loop forever and it's reasonable to assume that the freelist does not end with NULL. > The reason I m saying this is cause the While loop is looking through > every non-NULL freelist pointer for the "search" pattern. If someone > wants to search for NULL in the freelist that return 1 will never > trigger, even for normal freelists. If "fp" was ever null it wouldn't re > enter the loop. Thus the result of the function would be search == NULL > because the original writers assumed the freelist will always end with > NULL. Agreed. > As you correctly pointed out there might be a case where the freelist > is corrupted and it doesnt end in NULL. In that case two things could happen: > > 1) The check_valid_pointer() catches it and fixes it, restoring the > corrupted freelist. > > 2) The check_valid_pointer() fails to catch it and there isn't any NULL. > > > In the first case the problem fixes itself and thats probably why > validate_slab() worked fine all this time. > > The second case is pretty rare, but thats the case that validate_slab() > wants to rout out when it checks the `!on_freelist(s, slab, NULL)`, > right? Yes. > Also to make my added suggestion of `return fp == NULL` work in the first > case (since it does corrrect the freelist we want it to now return TRUE) > we could also add a line that nulls out the fp right after the > `set_freepointer(s, object, NULL);`. Why? fp = get_freepionter() will observe NULL anyway. > But words are cheap, I should test it out dynamically rather than just > reading the code to see how it behaves. Feel free to also test it > yourself. Yes, testing is important, and you should test to some extent before submitting a patch. > I know I am supposed to send a proper Patch with the multiple added > lines, but for now we are mostly brainstorming solutions. It will be > better to see its behavior first in a debugger I think. I think it's generally considered good practice to discuss before writing any code. > I hope I am making sense with the thought process I outlined for the > return thing. I probably shouldn't be writing emails ealry Saturday > morning, haha. > > Also I really appreciate the kind feedback! The Linux Kernel is infamously > known for how rude and unhinged people can be, which can make it a bit > stressful and intimidating sending any emails, especially for someone > starting out such as myself. You're welcome ;-) -- Cheers, Harry