Hello, On Tue, Jun 30, 2020 at 10:00 AM Greg Kroah-Hartman <gregkh@xxxxxxxxxxxxxxxxxxx> wrote: > > On Tue, Jun 30, 2020 at 06:08:31PM +0200, Rafael J. Wysocki wrote: > > On Tue, Jun 30, 2020 at 5:38 PM Greg Kroah-Hartman > > <gregkh@xxxxxxxxxxxxxxxxxxx> wrote: > > > > > > On Tue, Jun 30, 2020 at 03:00:34PM +0200, Rafael J. Wysocki wrote: > > > > On Tue, Jun 30, 2020 at 2:52 PM Greg Kroah-Hartman > > > > <gregkh@xxxxxxxxxxxxxxxxxxx> wrote: > > > > > > > > > > On Tue, Jun 30, 2020 at 01:49:48PM +0300, Heikki Krogerus wrote: > > > > > > On Mon, Jun 29, 2020 at 09:49:41PM -0700, Rajat Jain wrote: > > > > > > > Add a new (optional) field to denote the physical location of a device > > > > > > > in the system, and expose it in sysfs. This was discussed here: > > > > > > > https://lore.kernel.org/linux-acpi/20200618184621.GA446639@xxxxxxxxx/ > > > > > > > > > > > > > > (The primary choice for attribute name i.e. "location" is already > > > > > > > exposed as an ABI elsewhere, so settled for "site"). Individual buses > > > > > > > that want to support this new attribute can opt-in by setting a flag in > > > > > > > bus_type, and then populating the location of device while enumerating > > > > > > > it. > > > > > > > > > > > > So why not just call it "physical_location"? > > > > > > > > > > That's better, and will allow us to put "3rd blue plug from the left, > > > > > 4th row down" in there someday :) > > > > > > > > > > All of this is "relative" to the CPU, right? But what CPU? Again, how > > > > > are the systems with drawers of PCI and CPUs and memory that can be > > > > > added/removed at any point in time being handled here? What is > > > > > "internal" and "external" for them? > > > > > > > > > > What exactly is the physical boundry here that is attempting to be > > > > > described? > > > > > > > > Also, where is the "physical location" information going to come from? > > > > > > Who knows? :) > > > > > > Some BIOS seem to provide this, but do you trust that? > > > > > > > If that is the platform firmware (which I suspect is the anticipated > > > > case), there may be problems with reliability related to that. > > > > > > s/may/will/ > > > > > > which means making the kernel inact a policy like this patch series > > > tries to add, will result in a lot of broken systems, which is why I > > > keep saying that it needs to be done in userspace. > > > > > > It's as if some of us haven't been down this road before and just keep > > > being ignored... > > > > > > {sigh} > > > > Well, to be honest, if you are a "vertical" vendor and you control the > > entire stack, *including* the platform firmware, it would be kind of > > OK for you to do that in a product kernel. > > > > However, this is not a practical thing to do in the mainline kernel > > which must work for everybody, including people who happen to use > > systems with broken or even actively unfriendly firmware on them. > > > > So I'm inclined to say that IMO this series "as is" would not be an > > improvement from the mainline perspective. > > It can be, we have been using this for USB devices for many many years > now, quite successfully. The key is not to trust that the platform > firmware got it right :) > > > I guess it would make sense to have an attribute for user space to > > write to in order to make the kernel reject device plug-in events > > coming from a given port or connector, but the kernel has no reliable > > means to determine *which* ports or connectors are "safe", and even if > > there was a way for it to do that, it still may not agree with user > > space on which ports or connectors should be regarded as "safe". > > Again, we have been doing this for USB devices for a very long time, PCI > shouldn't be any different. Why people keep ignoring working solutions > is beyond me, there's nothing "special" about PCI devices here for this > type of "worry" or reasoning to try to create new solutions. > > So, again, I ask, go do what USB does, and to do that, take the logic > out of the USB core, make it bus-agnositic, and _THEN_ add it to the PCI > code. Why the original submitter keeps ignoring my request to do this > is beyond me, I guess they like making patches that will get rejected :( IMHO I'm actually trying to precisely do what I think was the conclusion of our discussion, and then some changes because of the further feedback I received on those patches. Let's take a step back and please allow me to explain how I got here (my apologies but this spans a couple of threads, and I"m trying to tie them all together here): GOAL: To allow user space to control what (PCI) drivers he wants to allow on external (thunderbolt) ports. There was a lot of debate about the need for such a policy at https://lore.kernel.org/linux-pci/CACK8Z6GR7-wseug=TtVyRarVZX_ao2geoLDNBwjtB+5Y7VWNEQ@xxxxxxxxxxxxxx/ with the final conclusion that it should be OK to implement such a policy in userspace, as long as the policy is not implemented in the kernel. The kernel only needs to expose bits & info that is needed by the userspace to implement such a policy, and it can be used in conjunction with "drivers_autoprobe" to implement this policy: -------------------------------------------------------------------- .... That's an odd thing, but sure, if you want to write up such a policy for your systems, great. But that policy does not belong in the kernel, it belongs in userspace. .... -------------------------------------------------------------------- 1) The post https://lore.kernel.org/linux-pci/20200609210400.GA1461839@bjorn-Precision-5520/ lists out the approach that was agreed on. Replicating it here: ----------------------------------------------------------------------- - Expose the PCI pdev->untrusted bit in sysfs. We don't expose this today, but doing so would be trivial. I think I would prefer a sysfs name like "external" so it's more descriptive and less of a judgment. This comes from either the DT "external-facing" property or the ACPI "ExternalFacingPort" property. - All devices present at boot are enumerated. Any statically built drivers will bind to them before any userspace code runs. If you want to keep statically built drivers from binding, you'd need to invent some mechanism so pci_driver_init() could clear drivers_autoprobe after registering pci_bus_type. - Early userspace code prevents modular drivers from automatically binding to PCI devices: echo 0 > /sys/bus/pci/drivers_autoprobe This prevents modular drivers from binding to all devices, whether present at boot or hot-added. - Userspace code uses the sysfs "bind" file to control which drivers are loaded and can bind to each device, e.g., echo 0000:02:00.0 > /sys/bus/pci/drivers/nvme/bind ----------------------------------------------------------------------- 2) As part of implementing the above agreed approach, when I exposed PCI "untrusted" attribute to userspace, it ran into discussion that concluded that instead of this, the device core should be enhanced with a location attribute. https://lore.kernel.org/linux-pci/20200618184621.GA446639@xxxxxxxxx/ ----------------------------------------------------------------------- ... The attribute should be called something like "location" or something like that (naming is hard), as you don't always know if something is external or not (it could be internal, it could be unknown, it could be internal to an external device that you trust (think PCI drawers for "super" computers that are hot pluggable but yet really part of the internal bus). .... "trust" has no direct relation to the location, except in a policy of what you wish to do with that device, so as long as you keep them separate that way, I am fine with it. ... ----------------------------------------------------------------------- And hence this patch. I don't see an attribute in USB comparable to this new attribute, except for the boolean "removable" may be. Are you suggesting to pull that into the device core instead of adding this "physical_location" attribute? 3) The one deviation from the agreed approach in (1) is https://patchwork.kernel.org/patch/11633095/ . The reason is I realized that contrary to what I earlier believed, we might not be able to disable the PCI link to all external PCI devices at boot. So external PCI devices may actually bind to drivers before userspace comes up and does "echo 0 > /sys/bus/pci/drivers_autoprobe"). I'm really happy to do what you think is the right way as long as it helps achieve my goal above. Really looking for clear directions here. Thanks & Best Regards, Rajat > thanks, > > greg k-h