Re: [RFC 3/3] drm/i915: Export engine busy stats in debugfs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 





On 5/10/2017 1:30 AM, Tvrtko Ursulin wrote:

On 09/05/2017 19:17, Dmitry Rogozhkin wrote:
On 5/9/2017 7:09 AM, Tvrtko Ursulin wrote:

[snip]

+static ssize_t i915_engine_stats_read(struct file *file, char __user
*ubuf,
+                      size_t count, loff_t *pos)
+{
+    struct i915_engine_stats_buf *buf =
+        (struct i915_engine_stats_buf *)file->private_data;
+
+    if (*pos == 0) {
+        struct drm_i915_private *dev_priv = file->f_inode->i_private;
+        char *ptr = &buf->buf[0];
+        int left = buf->len;
+        struct intel_engine_cs *engine;
+        enum intel_engine_id id;
+
+        buf->available = 0;
+
+        for_each_engine(engine, dev_priv, id) {
+            u64 total;
+            int len;
+
+            spin_lock_irq(&engine->stats.lock);
+            total = engine->stats.total;
+            /*
+             * If the engine is executing something at the moment
+             * add it to the total.
+             */
+            if (engine->stats.ref)
+                total += ktime_get_real_ns() -
+                     engine->stats.start;
+            spin_unlock_irq(&engine->stats.lock);
+
+            len = snprintf(ptr, left, "%s: %llu\n",
+                       engine->name, total);
If I caught it right, file format is:
  render ring: 12345
  bsd ring: 12345
  ...

Yes almost, just that the engine names have been changed to likes of rcs0, vcs0, vcs1, vecs0 and bcs0 in the meantime.

where numbers are busy clocks (ns) from the system boot time. Is that

Nanoseconds, but not since boot time but since the last time tracking got enabled.

From my perspective that's bad: clocks from the boot time is more natural metric. And with it you will be able to definitely know what you did with GPU on the boot time. For certain customers like on Android and other embedded devices this is critically important. Just recently we worked in one of our project on the boot time optimization. Thus, I would recommend to have this metric permanently available.

Now, if we will still fall to the clocks from the some moment in time, I do not like "the last time tracking got enabled" approach. You did not count on the few consumers of the metric? Why? What if there are few independent clients requesting the access to the metric in parallel? If you track from the moment when last client requested an access, then you will damage data for the clients already having access.


Because the most important thing in this version, from the point of view of overhead in interrupt tasklet, is that the tracking is not done unless somebody is listening (has the file open).

As I wrote in the cover letter and the 2nd patch, when nobody has the file open the only thing which exists in the interrupt tasklets are three no-nop instructions. They only get patched to jumps (to sections actually collecting the stats) for as long as someone has the file open.

right? What if we will want to expose some other statistics information
later, not only busy clocks? For example, engines i915 queues depths is
a next interest. Maybe later we will find something else interesting.
So, do we want to consider this file to contain all kind of statistics
in the future, and hence it should be of somewhat different format, or
it will have only busy clocks, and maybe we need other file name then?

It can be either of the two, or some third option. It sounds like it is too early to discuss those level of detail. At this point it was an RFC only to gather some opinions on the overall idea.
Yep, agree. Just something to remember going forward...

Regards,

Tvrtko

_______________________________________________
Intel-gfx mailing list
Intel-gfx@xxxxxxxxxxxxxxxxxxxxx
https://lists.freedesktop.org/mailman/listinfo/intel-gfx




[Index of Archives]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux