[Adding Nirmoy, setting bunch of people to BCC]
This bubbled up in our internal testing as well. Nirmoy now wants to
take a look at it.
Am 01.04.20 um 11:17 schrieb Christian König:
Am 01.04.20 um 10:53 schrieb Chris Wilson:
Quoting Christian König (2020-04-01 08:29:34)
Am 31.03.20 um 15:19 schrieb Chris Wilson:
Quoting Daniel Vetter (2020-03-31 11:38:50)
On Tue, Mar 31, 2020 at 11:20 AM Chris Wilson
<chris@xxxxxxxxxxxxxxxxxx> wrote:
Quoting Daniel Vetter (2020-03-31 10:16:18)
On Tue, Mar 31, 2020 at 10:59:45AM +0200, Christian König wrote:
[SNIP]
We allow userspace to poison the drm_mm at roughly 8K intervals, a
search space of 35b with typically O(N^2) behaviour and each node
traversal (rb_next/rb_prev) will itself be costly. Even our simple
tests
can generate a search of several minutes before our patience runs
out.o
Any drm_mm that allows for userspace to control alignment can be
arbitrarily fragmented, hence a raised eyebrow that this search
would be
allowed in atomic context.
Wow, that is indeed quite a lot.
What is the criteria use for ordering the tree? Just the size or is
that
size+alignment?
The tree is just size. Alignment is a little used parameter, but there's
a requirement for userspace to be able to control it -- although it is
strictly the older interface, it is still open to abuse.
Converting the tree to [size, ffs(addr)] would help for many, but on top
of that we have zones in the drm_mm, so search-in-range can be abused on
top of search-for-alignment.
The difference is that search in range is not controllable by
userspace, but at least for amdgpu the alignment is very well
controllable.
Never looked into this, but maybe we have a low hanging fruit for an
improvement here?
A bit -- alignment is so rarely used in practice, optimising it was not
a concern, just someone else has now noticed the potential for abuse.
Well we do use alignment rather widely. IIRC we can have everything
between 4K and 2MB based on the tilling flags, memory channel config
etc etc...
They ran a test, get bored and complained that it didn't respond to ^C
for a long period of time and from that derive a proof-of-concept
test to
show how it can be used by one client to upset another :|
And as far as I can see that is a really valid problem we need to fix.
Give me a second to write a test case for this.
Nirmoy, could you tackle this first? I've came up with some very quick
and dirty code for this for our libdrm unit tests.
Ping me internally and we can chat about it.
Thanks,
Christian.
Thanks for pointing that out,
Christian.
_______________________________________________
dri-devel mailing list
dri-devel@xxxxxxxxxxxxxxxxxxxxx
https://lists.freedesktop.org/mailman/listinfo/dri-devel