Re: [PATCH 02/12] trace-cmd: Add logic for TSC to nanosecond conversion

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, Mar 17, 2021 at 11:49 PM Steven Rostedt <rostedt@xxxxxxxxxxx> wrote:
>
> On Wed, 17 Mar 2021 11:57:31 +0200
> Tzvetomir Stoyanov <tz.stoyanov@xxxxxxxxx> wrote:
>
> > > I would break this up into two patches. One that adds the perf mult shift
> > > logic, and even allow users to use it! As TSC clocks are still faster than
> > > the local clock (as the local clock needs to do the multiplier and shift at
> > > every event while recording). It would be a feature to have this:
> > >
> > >  trace-cmd record --tsc2nsec ...
> >
> > As the tracing clock is per instance, does it mean this flag should
> > also be per instance ? I think the below commands should be valid:
> >    trace-cmd record --tsc2nsec -B foo -C mono  <-- use tsc-x86 clock +
> > tsc2nsec for the top instance and mono clock for instance foo
> >    trace-cmd record -C mono -B foo --tsc2nsec  <-- use mono clock for
> > the top instance and tsc-x86 clock + tsc2nsec clock for instance foo
> > or even more confusing:
> >    trace-cmd record -C x86-tsc -B foo --tsc2nsec  <-- use tsc-x86
> > clock for the top instance and tsc-x86 clock + tsc2nsec clock for
> > instance foo
>
> I just remembered that I previously discussed making the "tsc2nsec" into a
> clock type from the perspective of the user (not the trace file). That is,
> not to have --tsc2nsec, but instead to have:
>
>  trace-cmd record -C tsc2nsec
>
> And see if there's a tsc clock and the perf interface is available with the
> multiplier and shift, and if not, it would return an error just like if you
> said "-C foo". I also remember saying that we should have trace-cmd list
> show it as well if it is available.
>
>  # trace-cmd list -C
> [local] global counter uptime perf mono mono_raw boot x86-tsc tsc2nsec
>
> That is, tack on at the end " tsc2nsec" if we find that it is available.
>
> > This will require changes in the trace.dat file format. The problem is
> > that tsc2nsec multiplier, shift and offset are stored as an option in
> > the trace.dat file. Options are global, not per instance. When reading
> > the file, how to determine events in which instance should be affected
> > by this option ? We should either add options per instance, or somehow
> > encode the instance scope of each option.
>
> Your right. And I just tried this:
>
>  # trace-cmd record -C x86-tsc -e sched -B foo -C local -e sched sleep 1
>
> And it appears to ignore the -C local. Which I think is the right thing to
> do ;-)

According to the code, trace-cmd supports clock per instance. And it
works, I tried this:

#trace-cmd record -C x86-tsc -e irq -B foo -C local -e sched sleep 1

and got different clocks:
             <...>-290676 [006]1956659520918: softirq_exit:        ....
foo:       zoom-10894 [004]112198623756671: sched_switch:  ....

although I cannot imagine a use case for this. It will be weird to
visualise this by KerneShark.

I think that we should introduce options per instance now,  as "-C
tsc2nsec" cannot be implemented without it.
I see two possible ways to implement it:
 1. Break the current trace.dat file format by adding metadata for
each instance.
 2. Keep the current format and use nested options -  that's it, just
another option "OPTION_INSTANCE". It will hold instance name and all
options, specific for that instance, using the current options format.

But there is one question, valid for both ways: What should be the
scope for the global options ? Apply them only on the top instance, or
they should be common for all instances ?

>
> Yeah, we should only support one clock for all instances. And keep this as
> a global option. We may in the future have instance options, but that will
> come as a separate "option" :-)
>
>
> > > > +static void set_vsync_clock(void)
> > > > +{
> > > > +     const char *clock = top_instance.clock;
> > > > +     struct buffer_instance *instance;
> > > > +     bool tsc2nsec = false;
> > > > +     int shift, mult, offset;
> > > > +
> > > > +     /*
> > > > +      * If no clock is specified for the top trace instance AND
> > > > +      * KVM time sync protocol is available AND
> > > > +      * TSC to nsec multiplier and shift are available:
> > > > +      * force using the x86-tsc clock for this host-guest tracing session
> > > > +      * and store TSC to nsec multiplier and shift.
> > > > +      */
> > > > +     if (!clock && tsync_proto_is_supported("kvm") &&
> > > > +         !get_tsc_nsec(&shift, &mult, &offset) && mult) {
> > > > +             top_instance.clock = strdup(TSC_CLOCK);
> > > > +             if (!top_instance.clock)
> > > > +                     die("Could not allocate top instance clock");
> > > > +             clock = top_instance.clock;
> > >
> > > Why is this using the top_instance? What if the user had done:
> > >
> > >  # trace-cmd record -B host -e kvm -e sched -A guest -e sched
> >
> > You are right, that command is valid. The problem is that guest
> > timestamps can be synchronized with one clock only. If there are two
> > instances on the host, each with a different clock, the
> > synchronization will not work. That's why I look at the top instance
> > clock only, which is wrong. I'll change that logic to take the first
> > configured host trace clock, i.e. in case of multiple host instances
> > with multiple trace clocks, the first wins.
> >
>
>
> The code should just use the first instance available. Not about being
> global, but because I want trace-cmd to not interfere with the top instance
> if it is not specified on the command line. That way we can have this:
>
>  # trace-cmd record -e all
>
> in one window, and in another window:
>
>  # trace-cmd record -B guest -e kvm -A guest -e all
>
> and that recorder not do anything to bother the first one. That is, we
> should only touch the top instance if it is explicitly specified.
>
>
> -- Steve



-- 
Tzvetomir (Ceco) Stoyanov
VMware Open Source Technology Center



[Index of Archives]     [Linux USB Development]     [Linux USB Development]     [Linux Audio Users]     [Yosemite Hiking]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux