On Thu, Aug 29, 2024 at 07:55:08AM -0400, Kent Overstreet wrote: > Ergo, if you're not absolutely sure that a GFP_NOFAIL use is safe > according to call path and allocation size, you still need to be > checking for failure - in the same way that you shouldn't be using > BUG_ON() if you cannot prove that the condition won't occur in real wold > usage. We've been using __GFP_NOFAIL semantics in XFS heavily for 30 years now. This was the default Irix kernel allocator behaviour (it had a forwards progress guarantee and would never fail allocation unless told it could do so). We've been using the same "guaranteed not to fail" semantics on Linux since the original port started 25 years ago via open-coded loops. IOWs, __GFP_NOFAIL semantics have been production tested for a couple of decades on Linux via XFS, and nobody here can argue that XFS is unreliable or crashes in low memory scenarios. __GFP_NOFAIL as it is used by XFS is reliable and lives up to the "will not fail" guarantee that it is supposed to have. Fundamentally, __GFP_NOFAIL came about to replace the callers doing do { p = kmalloc(size); while (!p); so that they blocked until memory allocation succeeded. The call sites do not check for failure, because -failure never occurs-. The MM devs want to have visibility of these allocations - they may not like them, but having __GFP_NOFAIL means it's trivial to audit all the allocations that use these semantics. IOWs, __GFP_NOFAIL was created with an explicit guarantee that it -will not fail- for normal allocation contexts so it could replace all the open-coded will-not-fail allocation loops.. Given this guarantee, we recently removed these historic allocation wrapper loops from XFS, and replaced them with __GFP_NOFAIL at the allocation call sites. There's nearly a hundred memory allocation locations in XFS that are tagged with __GFP_NOFAIL. If we're now going to have the "will not fail" guarantee taken away from __GFP_NOFAIL, then we cannot use __GFP_NOFAIL in XFS. Nor can it be used anywhere else that a "will not fail" guarantee it required. Put simply: __GFP_NOFAIL will be rendered completely useless if it can fail due to external scoped memory allocation contexts. This will force us to revert all __GFP_NOFAIL allocations back to open-coded will-not-fail loops. This is not a step forwards for anyone. -Dave. -- Dave Chinner david@xxxxxxxxxxxxx