Chris, I tested this patch. It is still not enough, I keep catch running out of lmem. Every worker invocation takes larger and larger freeing object count. Here is my debugging code: +static int obj_count = 0; + ...... + if (llist_add(&obj->freed, &i915->mm.free_list)) { + bool b; + b = queue_work(i915->wq, &i915->mm.free_work); + pr_err("queue_work: %d, %d; %d\n", atomic_read(&i915->mm.free_count), obj_count, b); + obj_count = 1; + } else { + obj_count++; + } And here is the output: [ 821.213680] queue_work: 108068, 105764; 1 [ 823.309468] queue_work: 148843, 147948; 1 [ 826.453132] queue_work: 220000, 218123; 1 [ 831.522506] queue_work: 334812, 333773; 1 [ 840.040571] queue_work: 539650, 538922; 1 [ 860.337644] queue_work: 960811, 1017158; 1 The second number, 'obj_count', is the objects taken by last worker invocation to free. --CQ > -----Original Message----- > From: Chris Wilson <chris@xxxxxxxxxxxxxxxxxx> > Sent: Tuesday, October 13, 2020 3:33 AM > To: intel-gfx@xxxxxxxxxxxxxxxxxxxxx > Cc: Chris Wilson <chris@xxxxxxxxxxxxxxxxxx>; Tang, CQ <cq.tang@xxxxxxxxx> > Subject: [PATCH] drm/i915: Make the GEM reclaim workqueue high priority > > Since removing dev->struct_mutex usage, we only use i915->wq for batch > freeing of GEM objects and ppGTT, it is essential for memory reclaim. If we > let the workqueue dawdle, we trap excess amounts of memory, so give it a > priority boost. Although since we no longer depend on a singular mutex, we > could run unbounded, but first lets try to keep some constraint upon the > worker. > > Signed-off-by: Chris Wilson <chris@xxxxxxxxxxxxxxxxxx> > Cc: CQ Tang <cq.tang@xxxxxxxxx> > --- > drivers/gpu/drm/i915/i915_drv.c | 16 +++------------- > 1 file changed, 3 insertions(+), 13 deletions(-) > > diff --git a/drivers/gpu/drm/i915/i915_drv.c > b/drivers/gpu/drm/i915/i915_drv.c index 8bb7e2dcfaaa..8c9198f0d2ad > 100644 > --- a/drivers/gpu/drm/i915/i915_drv.c > +++ b/drivers/gpu/drm/i915/i915_drv.c > @@ -219,20 +219,10 @@ intel_teardown_mchbar(struct drm_i915_private > *dev_priv) static int i915_workqueues_init(struct drm_i915_private > *dev_priv) { > /* > - * The i915 workqueue is primarily used for batched retirement of > - * requests (and thus managing bo) once the task has been > completed > - * by the GPU. i915_retire_requests() is called directly when we > - * need high-priority retirement, such as waiting for an explicit > - * bo. > - * > - * It is also used for periodic low-priority events, such as > - * idle-timers and recording error state. > - * > - * All tasks on the workqueue are expected to acquire the dev mutex > - * so there is no point in running more than one instance of the > - * workqueue at any time. Use an ordered one. > + * The i915 workqueue is primarily used for batched freeing of > + * GEM objects and ppGTT, and is essential for memory reclaim. > */ > - dev_priv->wq = alloc_ordered_workqueue("i915", 0); > + dev_priv->wq = alloc_ordered_workqueue("i915", WQ_HIGHPRI); > if (dev_priv->wq == NULL) > goto out_err; > > -- > 2.20.1 _______________________________________________ Intel-gfx mailing list Intel-gfx@xxxxxxxxxxxxxxxxxxxxx https://lists.freedesktop.org/mailman/listinfo/intel-gfx