Hi Masami, On 6/21/19 9:18 AM, Masami Hiramatsu wrote: > Hi, > > Here is an RFC series of patches to add boot-time tracing using > devicetree. > > Currently, kernel support boot-time tracing using kernel command-line > parameters. But that is very limited because of limited expressions > and limited length of command line. Recently, useful features like > histogram, synthetic events, etc. are being added to ftrace, but it is > clear that we can not expand command-line options to support these > features. "it is clear that we can not expand command-line options" needs a fuller explanation. And maybe further exploration. > > Hoever, I've found that there is a devicetree which can pass more > structured commands to kernel at boot time :) The devicetree is usually > used for dscribing hardware configuration, but I think we can expand it Devicetree is standardized and documented as hardware description. > for software configuration too (e.g. AOSP and OPTEE already introduced > firmware node.) Also, grub and qemu already supports loading devicetree, > so we can use it not only on embedded devices but also on x86 PC too. Devicetree is NOT for configuration information. This has been discussed over and over again in mail lists, at various conferences, and was also an entire session at plumbers a few years ago: https://elinux.org/Device_tree_future#Linux_Plumbers_2016_Device_Tree_Track There is one part of device tree that does allow non-hardware description, which is the "chosen" node which is provided to allow communication between the bootloader and the kernel. There clearly are many use cases for providing configuration information and other types of data to a booting kernel. I have been encouraging people to come up with an additional boot time communication channel or data object to support this use case. So far, no serious proposal that I am aware of. > > With the devicetree, we can setup new kprobe and synthetic events, more > complicated event filters and trigger actions including histogram. > > For example, following kernel parameters > > trace_options=sym-addr trace_event=initcall:* tp_printk trace_buf_size=1M > > it can be written in devicetree like below. > > ftrace { > compatible = "linux,ftrace"; > options = "sym-addr"; > events = "initcall:*"; > tp-printk; > buffer-size-kb = <0x400>; // 1024KB == 1MB > }; > > Moreover, now we can expand it to add filters for events, kprobe events, > and synthetic events with histogram like below. > > ftrace { > compatible = "linux,ftrace"; > ... > event0 { > event = "task:task_newtask"; > filter = "pid < 128"; // adding filters > enable; > }; > event1 { > event = "kprobes:vfs_read"; > probes = "vfs_read $arg1 $arg2"; // add kprobes > filter = "common_pid < 200"; > enable; > }; > event2 { > event = "initcall_latency"; // add synth event > fields = "unsigned long func", "u64 lat"; > // with histogram > actions = "hist:keys=func.sym,lat:vals=lat:sort=lat"; > }; > // and synthetic event callers > event3 { > event = "initcall:initcall_start"; > actions = "hist:keys=func:ts0=common_timestamp.usecs"; > }; > event4 { > event = "initcall:initcall_finish"; > actions = "hist:keys=func:lat=common_timestamp.usecs-$ts0:onmatch(initcall.initcall_start).initcall_latency(func,$lat)"; > }; > }; > > These complex configuration can not be done by kernel parameters. > However, this is not replacing boot-time tracing by kernel parameters. > This devicetree settings are applied in fs_initcall() stage, but kernel > parameters are applied earlier stage. Anyway, this is enough useful > for debugging/tracing kernel driver initializations. > > I would like to discuss on some points about this idea. > > - Can we use devicetree for configuring kernel dynamically? No. Sorry. My understanding of this proposal is that it is intended to better support boot time kernel and driver debugging. As an alternate implementation, could you compile the ftrace configuration information directly into a kernel data structure? It seems like it would not be very difficult to populate the data structure data via a few macros. -Frank > - Would you have any comment for the devicetree format and default > behaviors? > - Currently, kprobe and synthetic events are defined inside event > node, but it is able to define globally in ftrace node. Which is > better? > - Do we need to support "status" property on each event node so > that someone can prepare "dtsi" include file and override the status? > - Do we need instance-wide pid filter (set_event_pid) when boot-time? > - Do we need more structured tree, like spliting event and group, > event actions and probes to be a tree of node, etc? > - Do we need per group filter & enablement support? > - How to support instances? (nested tree or different tree?) > - What kind of options would we need? > > Some kernel parameters are not implemented yet, like ftrace_filter, > ftrace_notrace, etc. These will be implemented afterwards. > > Thank you, > > --- > > Masami Hiramatsu (11): > tracing: Apply soft-disabled and filter to tracepoints printk > tracing: kprobes: Output kprobe event to printk buffer > tracing: Expose EXPORT_SYMBOL_GPL symbol > tracing: kprobes: Register to dynevent earlier stage > tracing: Accept different type for synthetic event fields > tracing: Add NULL trace-array check in print_synth_event() > dt-bindings: tracing: Add ftrace binding document > tracing: of: Add setup tracing by devicetree support > tracing: of: Add trace event settings > tracing: of: Add kprobe event support > tracing: of: Add synthetic event support > > > .../devicetree/bindings/tracing/ftrace.yaml | 170 +++++++++++ > include/linux/trace_events.h | 1 > kernel/trace/Kconfig | 10 + > kernel/trace/Makefile | 1 > kernel/trace/trace.c | 49 ++- > kernel/trace/trace_events.c | 3 > kernel/trace/trace_events_hist.c | 14 + > kernel/trace/trace_events_trigger.c | 2 > kernel/trace/trace_kprobe.c | 81 +++-- > kernel/trace/trace_of.c | 311 ++++++++++++++++++++ > 10 files changed, 589 insertions(+), 53 deletions(-) > create mode 100644 Documentation/devicetree/bindings/tracing/ftrace.yaml > create mode 100644 kernel/trace/trace_of.c > > -- > Masami Hiramatsu (Linaro) <mhiramat@xxxxxxxxxx> >