On 09/27, Liao Chang wrote: > > The uprobe handler allocates xol slot from xol_area and quickly release > it in the single-step handler. The atomic operations on the xol bitmap > and slot_count lead to expensive cache line bouncing between multiple > CPUs. Liao, could you please check if this series [PATCH 0/2] uprobes: kill xol_area->slot_count https://lore.kernel.org/all/20241001142416.GA13599@xxxxxxxxxx/ makes any difference performance-wise in your testing? Oleg.