On Mon, 8 Nov 2021 12:25:27 -0800 Beau Belgrave <beaub@xxxxxxxxxxxxxxxxxxx> wrote: > On Mon, Nov 08, 2021 at 01:16:39PM -0500, Steven Rostedt wrote: > > On Mon, 8 Nov 2021 09:13:36 -0800 > > Beau Belgrave <beaub@xxxxxxxxxxxxxxxxxxx> wrote: > > > > > > > Does that mean the decoders in eprobes/histogram don't check event > > > record sizes before accessing the data? Shouldn't that get fix > > > centrally? That would mean a loaded module could do the same thing > > > (user_events only works if the user has access to tracefs, so it's not > > > like it's open to all users). > > > > There's checks to make sure everything fits in eprobes and kprobes. If it > > doesn't then the event is simply dropped. > > > > For example, if you look at __eprobe_trace_func() in trace_eprobe.c, you'll > > see that it calls get_eprobe_size(), which goes through and just reads what > > it is about to accept. Then it reserves the amount of data on the ring > > buffer, and then calls store_trace_args() which also passes in the size > > that it found, in case things change. If it's too big, it only records what > > it originally intended. > > > > -- Steve > > It seems there are 2 concerns: > 1. If data comes in and it's not in the size that is specified, it's > suspicious and should either be truncated or ignored. Maybe under > ignore, over truncate. Yes, this is for the events which is defined with fixed-size parameters and what I suggested. > > 2. If the data is more than specified, it must be checked to see if > there are __data_loc / __rel_loc entries and they must be validated as > within range of accepted limits. If there are no __data_loc / __rel_loc > it should either be truncated or ignored. Yes, this is for the events, which is defined with variable length parameters, like null-terminated string. In this case, along with the __data/__rel_loc validation, it needs a null termination check. > > Is there more that I may have missed? > > I'd like to know if I do fix them that the features like filtering will still > be available to user_events or if it's better to just add flags to disable > kernel filtering? I would rather like that the filters will be available on the user_events. My question is that you need to log the dynamic data or strings via user- events or not. Since the other user-events, like SDT doesn't support the string variables to trace, I guess that is not a high priority. Moreover, since now we can use eprobes, if user event records the address of user-string, the eprobes can fetch it. So, my suggestion is implmenting with fixed-size parameters as the first step and keep filter/histograms/eprobes available on the user-events. If you find any performance issue, you can expand the user-events to support dynamic (array) data and strings. > > I'm still unsure this is limited to just user_events. > > For example, why doesn't filter_pred_strloc and filter_pred_pchar in > trace_events_filter.c check the boundary it will be accessing? Because all data is written from the kernel code. We can trust the data exists on the buffer. (If not, there is an actual BUG in the kenrel) We can add a verifieer for the debug purpose. > > It seems like tracepoints from kernel modules, while more trusted, can also > cause this kind of thing due to bugs, etc. Of course, and that must be fixed. And, the tracepoints (traceevents) should be automatically generated code, that is more trusted than the events crafted by user. Thank you, -- Masami Hiramatsu <mhiramat@xxxxxxxxxx>