Hi Guennadi, On 11/16/20 4:10 PM, Guennadi Liakhovetski wrote: > Hi Arnaud, > > On Mon, Nov 16, 2020 at 03:43:35PM +0100, Arnaud POULIQUEN wrote: >> Hi Guennadi, Mathieu, > > [snip] > >> I tried the find_module API, it's quite simple and seems to work well. just need >> to be protected by #ifdef MODULE >> >> I also added a select RMPS_NS in the RPMSG_VIRTIO for compatibility with the legacy. >> >> look to me that this patch is a simple fix that should work also for the vhost... >> that said, the question is can we use this API here? >> >> I attached the patch at the end of the mail. > > Thanks for the patch. Yes, this would guarantee, that the NS module is loaded. But > does it also guarantee, that the NS probing completes faster than an NS announcement > arrives? We have a race here: > > rpmsg_ns_register_device() -----------------\ > | | > virtio_device_ready() | > | | > remote sends NS announcement rpmsg_register_device() > | | > | rpmsg_ns_probe() > | | > | rpmsg_create_ept() > rpmsg_ns_cb() > > In practice we *should* be fine - maybe the whole probing (the right branch) happens > synchronously on the same core as where rpmsg_ns_register_device() was called, or > even if not, the probing runs locally and the NS announcement either talks to a > remote core or a VM. So, it *should* be safe, but unless we can make guarantee, that > the probing is synchronous, I wouldn't rely on this. So, without a completion this > still seems incomplete to me. Thanks for this description! I tested on a dual core, and I expected what you are describing above but in fact I observed following: rpmsg_ns_register_device() | rpmsg_register_device() | rpmsg_ns_probe() | rpmsg_create_ept() | virtio_device_ready() | remote sends NS announcement rpmsg_ns_cb() Here is the associated call stack generated in rpmsg_ns_probe using "warn_on" [ 11.498678] CPU: 1 PID: 348 Comm: systemd-udevd Not tainted 5.10.0-rc2 #54 [ 11.504106] Hardware name: STM32 (Device Tree Support) [ 11.509271] [<c011062c>] (unwind_backtrace) from [<c010c528>] (show_stack+0x10/0x14) [ 11.516992] [<c010c528>] (show_stack) from [<c0ad1360>] (dump_stack+0xb8/0xcc) [ 11.524204] [<c0ad1360>] (dump_stack) from [<c0120478>] (__warn+0xd8/0xf0) [ 11.531066] [<c0120478>] (__warn) from [<c0acd2a0>] (warn_slowpath_fmt+0x64/0xc4) [ 11.538547] [<c0acd2a0>] (warn_slowpath_fmt) from [<bf01505c>] (rpmsg_ns_probe+0x5c/0xe0 [rpmsg_ns]) [ 11.547681] [<bf01505c>] (rpmsg_ns_probe [rpmsg_ns]) from [<bf0005cc>] (rpmsg_dev_probe+0x14c/0x1b0 [rpmsg_core]) [ 11.557933] [<bf0005cc>] (rpmsg_dev_probe [rpmsg_core]) from [<c067ae44>] (really_probe+0x208/0x4f0) [ 11.567050] [<c067ae44>] (really_probe) from [<c067b2f4>] (driver_probe_device+0x78/0x16c) [ 11.575305] [<c067b2f4>] (driver_probe_device) from [<c0678e44>] (bus_for_each_drv+0x84/0xd0) [ 11.583822] [<c0678e44>] (bus_for_each_drv) from [<c067ab9c>] (__device_attach+0xf0/0x188) [ 11.592075] [<c067ab9c>] (__device_attach) from [<c0679c0c>] (bus_probe_device+0x84/0x8c) [ 11.600248] [<c0679c0c>] (bus_probe_device) from [<c0676194>] (device_add+0x3b0/0x7b0) [ 11.608165] [<c0676194>] (device_add) from [<bf0003dc>] (rpmsg_register_device+0x54/0x88 [rpmsg_core]) [ 11.617486] [<bf0003dc>] (rpmsg_register_device [rpmsg_core]) from [<bf00ab84>] (rpmsg_probe+0x2c4/0x3f4 [virtio_rpmsg_bus]) [ 11.628696] [<bf00ab84>] (rpmsg_probe [virtio_rpmsg_bus]) from [<c05cb748>] (virtio_dev_probe+0x1f4/0x2c4) [ 11.638352] [<c05cb748>] (virtio_dev_probe) from [<c067ae44>] (really_probe+0x208/0x4f0) [ 11.646406] [<c067ae44>] (really_probe) from [<c067b2f4>] (driver_probe_device+0x78/0x16c) [ 11.654658] [<c067b2f4>] (driver_probe_device) from [<c067b648>] (device_driver_attach+0x58/0x60) [ 11.663522] [<c067b648>] (device_driver_attach) from [<c067b704>] (__driver_attach+0xb4/0x154) [ 11.672134] [<c067b704>] (__driver_attach) from [<c0678d64>] (bus_for_each_dev+0x78/0xc0) [ 11.680300] [<c0678d64>] (bus_for_each_dev) from [<c0679ebc>] (bus_add_driver+0x170/0x20c) [ 11.688599] [<c0679ebc>] (bus_add_driver) from [<c067c22c>] (driver_register+0x74/0x108) [ 11.696662] [<c067c22c>] (driver_register) from [<bf01006c>] (rpmsg_init+0x6c/0x1000 [virtio_rpmsg_bus]) [ 11.706136] [<bf01006c>] (rpmsg_init [virtio_rpmsg_bus]) from [<c0102090>] (do_one_initcall+0x58/0x2bc) [ 11.713616] usb33: supplied by vdd_usb [ 11.715500] [<c0102090>] (do_one_initcall) from [<c01b6608>] (do_init_module+0x60/0x248) [ 11.715525] [<c01b6608>] (do_init_module) from [<c01b86fc>] (load_module+0x12e8/0x1638) [ 11.715546] [<c01b86fc>] (load_module) from [<c01b8cd4>] (sys_finit_module+0xd4/0x130) [ 11.715575] [<c01b8cd4>] (sys_finit_module) from [<c0100060>] (ret_fast_syscall+0x0/0x54) Having said that, does this guarantee the probe, a good question! Maybe you or Mathieu have the answer, not me... So if we can't conclude, adding completion would be OK for me. Thanks, Arnaud > > Thanks > Guennadi > >>>> Why can it not be called in rpmsg_ns_probe()? The only purpose of this completion >>>> is to make sure that rpmsg_create_ept() for the NS endpoint has completed before >>>> we begin communicating with the remote / host, e.g. by calling >>>> virtio_device_ready() in case of the VirtIO backend, right? >>> >>> How the module driver are probed during device registration is not cristal clear >>> for me here... >>> Your approach looks to me a good compromize, I definitively need to apply and >>> test you patch to well understood the associated scheduling... >> >> I looked in code, trying to understand behavior on device registration. >> >> my understanding is: if driver is already registered (basic built-in or module >> previously loaded) the driver is probed on device registration. No asynchronous >> probing through a work queue or other. >> >> So it seems, (but i'm not enough expert to be 100% sure) that ensuring that the >> rmpsg_ns module is loaded (and so driver registered) before the device register >> is enough, no completion mechanism is needed here. >> >> Regards, >> Arnaud >> >>> >>> Thanks, >>> Arnaud >>> >>>> >>>> Thanks >>>> Guennadi >>>> >>>>> Thanks, >>>>> Arnaud >>>>> >> >> [...] >> >> From 2629298ef1b7eea7a3a7df245abba03914c09e6b Mon Sep 17 00:00:00 2001 >> From: Arnaud Pouliquen <arnaud.pouliquen@xxxxxx> >> Date: Mon, 16 Nov 2020 15:07:14 +0100 >> Subject: [PATCH] rpmsg: specify dependency between virtio rpmsg and virtio_ns >> >> The rpmsg_ns service has to be registered before the first >> message reception. When used as module, this imply and >> dependency of the rpmsg virtio on the rpmsg_ns module. >> this patch solve the dependency issue. >> >> Signed-off-by: Arnaud Pouliquen <arnaud.pouliquen@xxxxxx> >> --- >> drivers/rpmsg/Kconfig | 1 + >> drivers/rpmsg/rpmsg_ns.c | 2 +- >> drivers/rpmsg/virtio_rpmsg_bus.c | 22 ++++++++++++++++++++++ >> 3 files changed, 24 insertions(+), 1 deletion(-) >> >> diff --git a/drivers/rpmsg/Kconfig b/drivers/rpmsg/Kconfig >> index c3fc75e6514b..1394114782d2 100644 >> --- a/drivers/rpmsg/Kconfig >> +++ b/drivers/rpmsg/Kconfig >> @@ -71,5 +71,6 @@ config RPMSG_VIRTIO >> depends on HAS_DMA >> select RPMSG >> select VIRTIO >> + select RPMSG_NS >> >> endmenu >> diff --git a/drivers/rpmsg/rpmsg_ns.c b/drivers/rpmsg/rpmsg_ns.c >> index 5bda7cb44618..f19f3cbd2526 100644 >> --- a/drivers/rpmsg/rpmsg_ns.c >> +++ b/drivers/rpmsg/rpmsg_ns.c >> @@ -104,5 +104,5 @@ module_exit(rpmsg_ns_exit); >> >> MODULE_DESCRIPTION("Name service announcement rpmsg Driver"); >> MODULE_AUTHOR("Arnaud Pouliquen <arnaud.pouliquen@xxxxxx>"); >> -MODULE_ALIAS("rpmsg_ns"); >> +MODULE_ALIAS("rpmsg:rpmsg_ns"); >> MODULE_LICENSE("GPL v2"); >> diff --git a/drivers/rpmsg/virtio_rpmsg_bus.c b/drivers/rpmsg/virtio_rpmsg_bus.c >> index 338f16c6563d..f032e6c3f9a9 100644 >> --- a/drivers/rpmsg/virtio_rpmsg_bus.c >> +++ b/drivers/rpmsg/virtio_rpmsg_bus.c >> @@ -1001,6 +1001,28 @@ static int __init rpmsg_init(void) >> { >> int ret; >> >> +#ifdef MODULE >> + static const char name[] = "rpmsg_ns"; >> + struct module *ns; >> + >> + /* >> + * Make sur that the rpmsg ns module is loaded in case of module. >> + * This ensures that the rpmsg_ns driver is probed immediately when the >> + * associated device is registered during the rpmsg virtio probe. >> + */ >> + mutex_lock(&module_mutex); >> + ns = find_module(name); >> + mutex_unlock(&module_mutex); >> + >> + if (!ns) { >> + ret = request_module(name); >> + if (ret) { >> + pr_err("can not find %s module (err %d)\n", name, ret); >> + return ret; >> + } >> + } >> +#endif >> + >> ret = register_virtio_driver(&virtio_ipc_driver); >> if (ret) >> pr_err("failed to register virtio driver: %d\n", ret); >> -- >> 2.17.1