On Mon, Aug 05, 2024 at 12:38:29PM +0200, Vlastimil Babka wrote: > What about module unload/reload with a SLAB_TYPESAFE_BY_RCU cache that will > delay its freeing. Soon also if there are kfree_rcu()'s in flight. And the > zombie cache can stay also permamently around if it fails to be destroy > because some objects were not freed. > It should be an invariant that the cache is fully whacked by the time kmem_cache_destroy returns, at worst with the exception of when leaked items are encountered (but even then it should be renamed to something indicating it is defunct). Suppose a cache could not have been destroyed and was left as is, then the offending module was loaded again -- now you got two with the same name, which is not that great either. I find myself quite surprised that kmem_cache_destroy can return even if cache destruction is still pending. This was added in 657dc2f97220 ("slab: remove synchronous rcu_barrier() call in memcg cache release path"), citing batching benefits for frequent kmem cache creation/destruction. I believe the very notion of doing that *frequently* is b0rked and any code doing it should be patched to stop regardless. Even so, if there are any benefits to the committed patch, it perhaps can be augmented so that the kmem_cache_destroy caller can wait for the entire thing to finish (e.g., with a completion).