On Fri, 9 Mar 2018, Ming Lei wrote: > On Fri, Mar 09, 2018 at 11:08:54AM +0100, Thomas Gleixner wrote: > > > > So my understanding is that these irq patches are enhancements and not bug > > > > fixes. I'll queue them for 4.17 then. > > > > > > Wrt. this IO hang issue, these patches shouldn't be bug fix, but they may > > > fix performance regression[1] for some systems caused by 84676c1f21 ("genirq/affinity: > > > assign vectors to all possible CPUs"). > > > > > > [1] https://marc.info/?l=linux-block&m=152050347831149&w=2 > > > > Hmm. The patches are rather large for urgent and evtl. backporting. Is > > there a simpler way to address that performance issue? > > Not thought of a simpler solution. The problem is that number of active msix vector > is decreased a lot by commit 84676c1f21. It's reduced in cases where the number of possible CPUs is way larger than the number of online CPUs. Now, if you look at the number of present CPUs on such systems it's probably the same as the number of online CPUs. It only differs on machines which support physical hotplug, but that's not the normal case. Those systems are more special and less wide spread. So the obvious simple fix for this regression issue is to spread out the vectors accross present CPUs and not accross possible CPUs. I'm not sure if there is a clear indicator whether physcial hotplug is supported or not, but the ACPI folks (x86) and architecture maintainers should be able to answer that question. I have a machine which says: smpboot: Allowing 128 CPUs, 96 hotplug CPUs There is definitely no way to hotplug anything on that machine and sure the existing spread algorithm will waste vectors to no end. Sure then there is virt, which can pretend to have a gazillion of possible hotpluggable CPUs, but virt is an insanity on its own. Though someone might come up with reasonable heuristics for that as well. Thoughts? Thanks, tglx