On Tue, 19 Apr 2016 22:59:14 +0000 (UTC) Mathieu Desnoyers <mathieu.desnoyers@xxxxxxxxxxxx> wrote: > ----- On Apr 19, 2016, at 6:49 PM, rostedt rostedt@xxxxxxxxxxx wrote: > > > On Tue, 19 Apr 2016 21:22:21 +0000 (UTC) > > Mathieu Desnoyers <mathieu.desnoyers@xxxxxxxxxxxx> wrote: > > > >> It makes sense. Anyway, looking back at my own implementation, I have > >> an array of 64 hlist_head entries (64 * 8 = 512 bytes), typically > >> populated by NULL pointers. It's only a factor 8 smaller than the > >> bitmap, so it's not a huge gain. > > > > Actually we talked about a second level bitmap for quicker searchers. I > > can't remember what it was called, but I'm sure HPA can ;-) > > > > Basically it was a much smaller bitmap, where each bit represents a > > number of bits in the main bitmap. When a bit is set in the main > > bitmap, its corresponding bit is set in the smaller one. This means > > that if you don't have many PIDs, the smaller bitmap wont have many > > bits set either, and you keep all the checks very cache local, because > > you are checking the smaller bitmap most of the time. But this too > > makes things more complex, especially when clearing a bit (although, > > that only happens on exit, where speed isn't a big deal). But we > > decided it still wasn't worth it. > > Seems like an interesting possible improvement if ever needed. > One reason we decided against it is, if you think about use cases; if you are tracing a single task, and other tasks are created around it in the same time period, you will have pids of tasks running that are close to the pid of the traced task. That means the bit in the smaller array will most likely be always set, and now you are taking two cache hits to find the pid you are looking for, and not gaining anything out of it. -- Steve -- To unsubscribe from this list: send the line "unsubscribe linux-trace-users" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html