Hi Hannes, Thomas,
On 29/01/2019 12:01, Thomas Gleixner wrote:
On Tue, 29 Jan 2019, Hannes Reinecke wrote:
That actually is a very good question, and I have been wondering about this
for quite some time.
I find it a bit hard to envision a scenario where the IRQ affinity is
automatically (and, more importantly, atomically!) re-routed to one of the
other CPUs.
Isn't this what happens today for non-managed IRQs?
And even it it were, chances are that there are checks in the driver
_preventing_ them from handling those requests, seeing that they should have
been handled by another CPU ...
Really? I would not think that it matters which CPU we service the
interrupt on.
I guess the safest bet is to implement a 'cleanup' worker queue which is
responsible of looking through all the outstanding commands (on all hardware
queues), and then complete those for which no corresponding CPU / irqhandler
can be found.
But I defer to the higher authorities here; maybe I'm totally wrong and it's
already been taken care of.
TBH, I don't know. I merily was involved in the genirq side of this. But
yes, in order to make this work correctly the basic contract for CPU
hotplug case must be:
If the last CPU which is associated to a queue (and the corresponding
interrupt) goes offline, then the subsytem/driver code has to make sure
that:
1) No more requests can be queued on that queue
2) All outstanding of that queue have been completed or redirected
(don't know if that's possible at all) to some other queue.
This may not be possible. For the HW I deal with, we have symmetrical
delivery and completion queues, and a command delivered on DQx will
always complete on CQx. Each completion queue has a dedicated IRQ.
That has to be done in that order obviously. Whether any of the
subsystems/drivers actually implements this, I can't tell.
Going back to c5cb83bb337c25, it seems to me that the change was made
with the idea that we can maintain the affinity for the IRQ as we're
shutting it down as no interrupts should occur.
However I don't see why we can't instead keep the IRQ up and set the
affinity to all online CPUs in offline path, and restore the original
affinity in online path. The reason we set the queue affinity to
specific CPUs is for performance, but I would not say that this matters
for handling residual IRQs.
Thanks,
John
Thanks,
tglx
.