On Fri, Jul 29, 2016 at 08:33:34AM -0700, Dave Hansen wrote: > What's to stop you from using tracing to gather and transport data out > of the kernel and then aggregate and present it to apps in an "elegant" > way of your choosing? > > I don't think it's really even worth having an in-depth discussion of > how to modify duet. I can't imagine that this would get merged as-is, > or even anything resembling the current design. If you want to see > duet-like functionality in the kernel, I think it needs to be integrated > better and enhance or take advantage of existing mechanisms. > > You've identified a real problem and a real solution, and it is in an > area where Linux is weak (monitoring the page cache). If you are really > interested in seeing a solution that folks can use, I think you need to > find some way to leverage existing kernel functionality (ftrace, > fanotify, netlink, etc...), or come up with a much more compelling story > about why you can't use them. I took a few measurements of the ftrace overhead, and if limited to the page cache functions we're interested in, it's very reasonable. Duet does depend on exporting some data with each event, however, and tracepoints seem to be the most efficient way to do this. There are two issues, however: (a) There are no tracepoints for page dirtying and flushing. Those would have to be added at the same place as the Duet hooks I submitted (unwrapping the page-flags.h macros) to catch those cases where pages are locked and the dirty bit is set manually. (b) The page cache tracepoints are currently not exported symbols. If I can export those four tracepoints for page addition, removal, dirtying, and flushing, then the rest of the work (exporting the information to userspace) can be carried out within a module. In the future, once we reach a point of maturity where we are confident about the stability of the exporting interface and performance, we could engage in another conversation about potentially mainlining some of that code. Dave, I can produce a patch that adds the extra two tracepoints and exports all four tracepoint symbols. This would be a short patch that would just extend existing tracing functionality. What do you think? -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html