https://bugzilla.kernel.org/show_bug.cgi?id=202055 --- Comment #39 from Dongli Zhang (dongli.zhang@xxxxxxxxxx) --- (In reply to Alex Williamson from comment #33) > Created attachment 280913 [details] > NVMe subsystem reset with ACS masking > > Dongli, I'd appreciate testing of this patch series. The differences from > the previous version are: > > 1) Use NVMe subsystem reset rather than secondary bus reset, this simplifies > some of the hotplug slot code from the previous version > 2) Mask ACS Source Validation around reset, this eliminates some of the > magic voodoo that avoided the fault on my system, but not yours > > This exploded into a several patch series to simplify the ACS masking, but > it should still apply easily. Testing by others obviously welcome as well. > Thanks Hi Alex, I have tested the 5-patch 280913 (as below). Unfortunately, I encountered the initial problem again, that is, the msix count changed from 16 to 22 again. There is no AER message this time. https://bugzilla.kernel.org/attachment.cgi?id=280913 ./x86_64-softmmu/qemu-system-x86_64 -hda /home/zhang/img/ubuntu/disk.img -smp 2 -m 2000M -enable-kvm -vnc :0 -device vfio-pci,host=0000:01:00.0 WARNING: Image format was not specified for '/home/zhang/img/ubuntu/disk.img' and probing guessed raw. Automatically detecting the format is dangerous for raw images, write operations on block 0 will be restricted. Specify the 'raw' format explicitly to remove the restrictions. qemu-system-x86_64: -device vfio-pci,host=0000:01:00.0: vfio error: 0000:01:00.0: failed to add PCI capability 0x11[0x50]@0xb0: table & pba overlap, or they don't fit in BARs, or don't align The msix count changed from 16 to 22 again. 01:00.0 Non-Volatile memory controller: Intel Corporation Device f1a6 (rev 03) (prog-if 02 [NVM Express]) Subsystem: Intel Corporation Device 390b Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Interrupt: pin A routed to IRQ 16 ... ... Capabilities: [b0] MSI-X: Enable- Count=22 Masked- Vector table: BAR=0 offset=00002000 PBA: BAR=0 offset=00002100 Dongli Zhang -- You are receiving this mail because: You are watching the assignee of the bug.