On Mon, 19 Jun 2023 11:20:06 +0200 Christian König <christian.koenig@xxxxxxx> wrote: > Hi guys, > > Am 19.06.23 um 10:59 schrieb Thomas Hellström (Intel): > > [SNIP] > >>>> > >>>> I really need to find some time to work on that anyway. > >> I've been playing with drm_exec for a couple weeks now, and I wanted > >> to share something I hacked to try and make the API simpler and > >> more robust against misuse (see the below diff, which is a slightly > >> adjusted version of your work). > > > > It would be good if we could have someone taking charge of this series > > and address all review comments, I see some of my comments getting > > lost, we have multiple submitters and I can't find a dri-devel > > patchwork entry for this. Anyway some comments below. > > I can try to find some time for the series this week (As long as nobody > comes along and has any burning roof). That's great news! > > > > >> > >> In this version, the user is no longer in control of the retry > >> loop. Instead, it provides an expression (a call to a > >> sub-function) to be re-evaluated each time a contention is > >> detected. IMHO, this makes the 'prepare-objs' functions easier to > >> apprehend, and avoids any mistake like calling > >> drm_exec_continue_on_contention() in an inner loop, or breaking > >> out of the drm_exec_while_all_locked() loop unintentionally. > > > > In i915 we've had a very similar helper to this, and while I agree > > this newer version would probably help make code cleaner, but OTOH > > there also are some places where the short drm_exec_while_all_locked() > > -likeblock don't really motivate a separate function. Porting i915 to > > the current version will take some work, For the xe driver both > > versions would work fine. > > Yeah, this is actually what my first version of this looked like. But I > abandoned that approach because we have a lot of cases were we just > quickly want to lock a few GEM objects and don't want the extra overhead > of putting all the state into some bag to forward it to a function. If you're talking about verbosity, it might be the case, though I guess it mostly a matter of taste (I do like when things are well isolated). As for runtime overhead, I'd expect the compiler to inline the function anyway, so it's unlikely to change anything. > >> +/* Track the locked object in the array */ > >> +static int drm_exec_obj_locked(struct drm_exec *exec, > >> + struct drm_gem_object *obj) > >> +{ > >> + if (unlikely(exec->num_objects == exec->max_objects)) { > >> + size_t size = exec->max_objects * sizeof(void *); > >> + void *tmp; > >> + > >> + tmp = kvrealloc(exec->objects, size, size + PAGE_SIZE, > >> + GFP_KERNEL); > >> + if (!tmp) > >> + return -ENOMEM; > > > > Sometimes you need to just temporarily lock an object and then unlock > > it again if it goes out of scope before reaching the end of > > _until_all_locked(). In that case you might need to remove a lock from > > the array. I *think* for all use-cases in i915 it would suffice to > > take a snapshot of num_objects, and unlock everything above that, > > having exec->objects behave like a stack, but was ever a list > > considered instead of a realloced array? > > Yes, the problem is that linked lists really suck regarding their cache > line locality. That's why I've came up with this approach here. Hm, maybe I'm missing something, but if you place the list_head obj you use to stack the locked objects close enough to the resv pointer, and aligned on cache line, it shouldn't really be a problem, given you have to dereference the GEM object to retrieve its resv anyway.