Re: [libvirt PATCH v2 01/15] util: properly deal with module vs. driver when binding device to driver

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 11/28/23 8:21 AM, Ján Tomko wrote:
The commit message is longer than it needs to be and the long
explanation actually made it harder to comprehend for me.

Well, without *enough* explanation, it all seems like pointlessly pushing bits around :-). I guess my extra explanation failed though, since people have had issues with the patch twice now).


On a Monday in 2023, Laine Stump wrote:
Historically libvirt has treated the concept of "loadable kernel
module" and "device driver" as being effectively the same (at least in
the case of the vfio-pci driver used for VFIO device assignment). The
code assumed that a module named "vfio-pci" implemented a driver named
"vfio-pci".

In reality, the module is named "vfio_pci", and it implements a device
driver named "vfio-pci" (note the difference in separator characters),

The note about the difference in characters is not needed if you mention
them explicitly below.

I suppose I can remove it. Reading the paragraph by itself, though, it sounds like 'His name is "Bob", not "*Bob*"'.


and our code worked only because the modprobe utility we use to load
the module will always "normalize" the module name it's given by
replacing all "-" (dash) with "_" (underscore) (this has been verified
in the modprobe source, which is in the kmod package - there are 3
separate functions that perform this same operation!).

My package manager can tell me which package modprobe is in and I don't
need to know the number of functions.

My spidey senses tell me someone is getting fed up with my verbose explanations :-). I'll try to be more brief.


So even though
we asked modprobe to load the "vfio-pci" module, it would actually
load the "vfio_pci" module.

After loading the module with modprobe, libvirt then looks for the
desired *driver* to be present in sysfs, by looking for the directory
/sys/bus/pci/drivers/${driverName}, which would succeed because we
were still looking for the original "dash version" of the name
("vfio-pci").

When we recently gained the ability to manually specify a driver to
bind to with virsh nodedev-detach, the fragility of this system became
apparent - if a user gives the driver name as "vfio_pci", then we
would modprobe the module, but then erroneously believe it hadn't been
loaded because /sys/bus/pci/drivers/vfio_pci didn't exist. For manual
specification of the driver name, this could be dealt with by telling
the user "always use the correct name for the driver, don't assume
that it is the same as the modulename", but it would still end up
confusing people, especially since some drivers do use underscore in
their name (e.g. the mlx5_vfio_pci driver/module).

More more importantly, when we begin looking in the modules.alias file
for the "best" VFIO variant driver for a particular device (in an
upcoming patch), that lookup will provide us with the name of a
*module*, not a driver, and 3 of the 4 examples of vfio-pci/variant
drivers I have access to use underscore in the module name and dash in
the driver, while the 4th uses underscore in both, so 3 out of 4 fail
with the current code.

To make the current code more forgiving (and yet more correct!), as
well as to make the upcoming variant auto-selection actually useful,
this patch follows the following steps:

1) if the requested driver (${driverName}) is present in sysfs drivers
  list (/sys/bus/pci/drivers/${driverName}) then use ${driverName} -
  DONE


I did not see any response to Jason's points in v1:
https://listman.redhat.com/archives/libvir-list/2023-October/242780.html

I did send a response, but I sent it after the list switched to the new server, so you won't find it in the old archives (actually when I originally replied I forgot to change the list address, and then had to resend):

https://lists.libvirt.org/archives/list/devel@xxxxxxxxxxxxxxxxx/message/WSWE76D5ISIV7F2IQPM5HKQ7BUXQAJSD/


I think we can safely assume that modprobe will accept a dashed name
and that module names are using underscores.

Yes. But you can't assume that the driver name will use dashes, nor even that the driver name will resemble the module name in any programmatically derivable way - although it is common convention that the module name be underscored(drivername), driver name is *not* dashed(modulename).

(Technically it is even possible (although I've not seen it) that the driver and module names might be completely unrelated - e.g. module "foo" implementing driver "bar". Sure, we could assume that will never happen, but then in 5 years someone will do that and libvirt 15.2.0 will break because of this assumption made 5 years prior, and some poor soul will have to spend a bunch of time figuring out why.)


Is the problem here that you cannot figure out the driver the module
provides until after it's loaded?

Yes, that is true. But also:

1) users may know the driver name or the module name, and usually don't know the difference between the two, or even that there are two different names. This blurring between the two has been exacerbated by modprobe always converting "-" into "_", so someone using "vfio-pci" will still automatically get the proper module loaded (because the module just happens to be "vfio_pci"), and binding the new driver will also work properly, but if they use the module name (vfio_pci), then the module will be loaded, but the driver bind will fail. We could just log an error, but

    "driver 'vfio_pci' not found"

is pretty uninformative, and

    "driver 'vfio_pci' not found - maybe try 'vfio-pci' instead"

is kind of pointless - if we already know how to fix the problem, why not just fix it and continue on? (you can bet that if we don't, there will be bug reports saying "Why don't you just automatically fix this instead of just telling me exactly how to fix it?!?!?". As a matter of fact, I'd already had bugs filed because "vfio_pci" wasn't accepted "even though mlx5_vfio_pci is", which is at least part of the origin story of this patch)

2) modules.alias deals only with *module* names, while each device's driver_override file in sysfs only understands *driver* names. And even if we insist that the user specify the driver name, libvirt's own code (see patch 14/15) will be retrieving the *module* name from the modules.alias file, and there will need to be code in libvirt to convert that to driver name in order for the bind operation to be successful. I'm just fixing that in this patch in order to simplify the later patch, and also make user-specified <driver name="blah"/> more foolproof.


In the end, all I'm doing is:

1) Making it more clear in libvirt's code where we are using a module name and where we are using a driver name.

1.5) explicitly performing the "dash to underscore" transformation to derive a modulename from the user-provided "driver-module name", but only because we need to follow links in sysfs, and those links *don't* benefit from modprobe's automatic replacement of dash with underscore (i.e. even though we can assume that modprobe will always replace dash with underscore, modprobe isn't the only place that we need the underscore version of the name)

2) following links in sysfs to derive a driver name from a module name, rather than making the *incorrect* assumption that we can get that with dashed(modulename) (which already doesn't work, e.g. for mlx5_vfio_pci/mlx5_vfio_pci) (with current functionality this just automatically fixes a common user error, but beginning with patch 14/15 it will actually be essential to proper functioning of variant driver auto-detect)

There is really nothing controversial going on here. Maybe I just spent so much time explaining that everyone *thinks* there must be something fishy.


Jano

2) if underscored(${driverName}) (call it ${underName} for short) is
  in sysfs drivers list then use ${underName} - DONE

3) if ${underName} *isn't* in the sysfs modules list
  (/sys/module/${underName}) then "modprobe ${underName}" to load it.

4) look for the PCI driver implemented by this module, it will
  be at /sys/module/${underName}/driver/pci:${newName}.

5) use ${newName} - DONE.

A few notes:

a) This will *always* replace dash with underscore in the name used to
  load the module, which seems it could be problematic, but I have
  verified this is what modprobe itself does, so I don't think there
  will ever be a module with "-" in its name as modprobe would be
  unable to load it (the same can't be said for drivers though!)

b) The one case where the above steps wouldn't work would be if
  someone implemented a driver within a module where the driver and
  module had names that differed other than dash vs. underscore. I
  haven't seen an example of this, so the convention seems to be that
  they will "match" (modulo - & _). If this mismatch were to ever
  occur, though, it could be worked around by simply loading the
  module out-of-band during the host system startup, and then using
  the driver's name in the config.

c) All of this is conveniently ignoring the possibility of a VFIO
  variant driver that is statically linked in the kernel. The entire
  design of variant driver auto-detection is based on doing a lookup
  in modules.alias, and that only lists *loadable modules* (not
  drivers), so unless I'm missing something, it would be impossible
  to auto-detect a VFIO variant driver that was statically
  linked. This is beyond libvirt's ability to fix; the best that
  could be done would be to manually specify the driver name in the
  libvirt config, which I suppose is better than nothing :-)

Signed-off-by: Laine Stump <laine@xxxxxxxxxx>
---
src/util/virpci.c | 194 +++++++++++++++++++++++++++++++++++++++-------
1 file changed, 164 insertions(+), 30 deletions(-)


_______________________________________________
Devel mailing list -- devel@xxxxxxxxxxxxxxxxx
To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxx
_______________________________________________
Devel mailing list -- devel@xxxxxxxxxxxxxxxxx
To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxx




[Index of Archives]     [Virt Tools]     [Libvirt Users]     [Lib OS Info]     [Fedora Users]     [Fedora Desktop]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite News]     [KDE Users]     [Fedora Tools]

  Powered by Linux