Kenji Kaneshige <kaneshige.kenji@xxxxxxxxxxxxxx> writes: > Eric W. Biederman wrote: >> Jesse Barnes <jbarnes@xxxxxxxxxxxxxxxx> writes: >> >>> Any update here, Eric? Sounds like you're using hotplug in real environments >>> with complex topologies (based on your earlier messages), so we're interested >>> in what you're seeing here... >> >> Yes. >> >> Currently I have a test system that is a subset of what I'm worried >> about and will shortly have the real hardware, so my immediate goal is >> to get things working well enough so my internal users won't get >> blocked by bugs. Currently I only have the pcie hotplug and pcie >> hotplug surprise case. My basic topology is 16 hotplug slots into >> which I will be plugging in pci express switches with a couple of >> additional hotplug slots. As for the firmware, I will have it reserving >> bus numbers and mmio space on each of the first 16 slots and the rest >> is going to be up to the linux kernel. This is an embedded design >> so no ACPI is appears more pain than it is worth to implement. >> > > Very interesting. Can I ask you some questions? > > - On hot-insertion of pci express switches with a additional hotplug > slots, who initialize HwInit registers (for example, physical slot > number field in the Slot Capabilities register)? OS, firmware, > hardware or others? It happens before the linux kernel gets to see it. Call it firmware. > - Bus numbers and MMIO space that needs to be reserved is depending > on platform design. How do you tell kernel (or hotplug drivers) how > many resources need to be reserved, in your current design? So far it looks like I can get away without telling the kernel anything, and just perform reservations at the layer of the firmware on the primary board, and have the kernel see those reservations when it boots up, and just subdivide them. I have some thoughts on how to do things better but I'm not at a point where it makes a difference right now. >> I need to revisit the pciehp driver but my first pass through it >> looked like every corner case appeared to get something wrong. So I >> have written myself a little 430 line replaces that handles the case >> that I currently care about. Part of what I was seeing before is that >> we don't clear pending events in the pciehp driver before we enable >> interrupts. So if booting the system has left some pending and you >> have CONFIG_DEBUG_SHIRQ enabled you get a nice oops because p_slot has >> not been initialized and so the interrupts can't be handled. >> > > I've made a fix (c4635eb06af700820d658a163f06aff12e17cfb2) for a similar > problem several months ago. With this fix, pciehp had been changed to > initialize p_slot before installing interrupt service routine. So I still > don't understand what is happening. Could you please tell me the details > about "p_slot has not been initialized..."? kobject_name is not initialized, and slot_name(p_slot) calls hoptlug_slot_name which calls pci_slot_name which kobj_name. It looks like this problem was introduced in commit e1acb24f059defdaa0264e925f19cc21b0a3e592 Eric -- To unsubscribe from this list: send the line "unsubscribe linux-pci" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html