Hi Ben, On Thu, 2019-04-18 at 12:49 -0500, Benjamin Marzinski wrote: > When "multipath -u" is run, it checks if multipathd is running. > Currently it does this by trying to connect to the mutipathd socket. > This can cause problems during boot. The multipathd.socket systemd > unit > file will cause "multipath -u" to wait until multipathd has been > started > before continuing. Thanks a lot for this. We were hunting several similar problems lately, and were beginning to suspect exactly this. The main reason that we haven't come up with a patch like this is that I refused to believe this could happen - here is why. We've had the patch d7188fc "multipathd: start daemon after udev trigger" out in the field since November 2016. That patch moves multipathd's start (but not the start of multipathd.socket!) to after "udev settle" has finished. What we've been observing lately seems to be this: - start multipathd.socket - start systemd-udev-trigger - start systemd-udev-settle - SCSI disk found, "add" uevent generated - multipath -u - opens socket - systemd queues a job to start multipathd, but doesn't start it because "udev settle" isn't finished yet - now "multipath -u" hangs ... ... which is basically a deadlock situation which causes either the uevent, or "udev settle" itself, or both, to time out before usable devices are found. The system is almost certain to end up in emergency mode. And it's not even a matter of lots of udev activity. What I'm really puzzled about is that I have never observed this in the last 2 years, and it'd be almost certain to generate dozens of bug reports quickly, which it is currently doing. ** Is it possible that something has changed in the way systemd handles opening the socket and starting the associated service? ** > >If there is a lot of activity on the system, > multipathd may not start up immediately, causing block device > initialization to be delayed, potentially until after systemd times > waiting for the device. To avoid this, multipath now checks if > multipathd is running by reading /run/multipathd.pid and checking the > /proc/<pid>/comm to verify that multipathd is really running with > this > pid. This avoids forcing "multipath -u" to wait on multipathd > starting > up. > > As an alternative to this patch, multipath could simply switch the > order > of the calls to systemd_service_enabled() and mpath_connect(). > This would > make multipath only try to connect with multipathd if it wasn't > enabled in > systemd, so that it wouldn't autostart. If my above analysis is correct, it'd be sufficient to revert d7188fc and start multipathd before udev trigger and udev settle again, as it used to be. I've been thinking about this for some time, IMO it would be actually the "Right Thing" in the long term. The problem is that we need to be sure that multipathd would deal gracefully with encountering multipath maps during startup where none of the constituent paths are in the udev database (yet), and where it might take considerable time until these paths appear. > Another alternative is to do away with multipathd.socket. Since > multipathd > needs to always be running in order to get uevents, there isn't much > value > in having it autoactivate when it gets an interactive command. Some people may still be running without the daemon and just work with "multipath" commands. It "works", just that uevents aren't received and paths aren't reinstated; we've never officially deprecated it. I think we need the socket for the time being, for the "delegate dangerous commands to user space" feature in multipath. One of our customers worked around the problem by adding an After=systemd-udev-settle.service to multipathd.socket, too. That way socket activation doesn't happen; it seems to "work" too, but it wouldn't be my preferred solution. > Signed-off-by: Benjamin Marzinski <bmarzins@xxxxxxxxxx> The patch looks good to me. Reviewed-by: Martin Wilck <mwilck@xxxxxxxx> > --- > multipath/main.c | 60 +++++++++++++++++++++++++++++++++++++++++++--- > -- > 1 file changed, 54 insertions(+), 6 deletions(-) > > diff --git a/multipath/main.c b/multipath/main.c > index 69141dbc..008e3d3f 100644 > --- a/multipath/main.c > +++ b/multipath/main.c > @@ -850,6 +850,58 @@ out: > return r; > } > > +int is_multipathd_running(void) > +{ > + FILE *f = NULL; > + char buf[16]; > + char path[PATH_MAX]; > + int pid; > + char *end; > + > + f = fopen(DEFAULT_PIDFILE, "r"); > + if (!f) { > + if (errno != ENOENT) > + condlog(4, "can't open " DEFAULT_PIDFILE ": > %s", > + strerror(errno)); > + return 0; > + } > + if (!fgets(buf, sizeof(buf), f)) { > + if (ferror(f)) > + condlog(4, "read of " DEFAULT_PIDFILE " failed: > %s", > + strerror(errno)); > + fclose(f); > + return 0; > + } > + fclose(f); > + errno = 0; > + strchop(buf); > + pid = strtol(buf, &end, 10); > + if (errno != 0 || pid <= 0 || *end != '\0') { > + condlog(4, "invalid contents in " DEFAULT_PIDFILE ": > '%s'", > + buf); > + return 0; > + } > + snprintf(path, sizeof(path), "/proc/%d/comm", pid); > + f = fopen(path, "r"); > + if (!f) { > + if (errno != ENOENT) > + condlog(4, "can't open %s: %s", path, > strerror(errno));artin > + return 0; > + } > + if (!fgets(buf, sizeof(buf), f)) { > + if (ferror(f)) > + condlog(4, "read of %s failed: %s", path, > + strerror(errno)); > + fclose(f); > + return 0; > + } > + fclose(f); > + strchop(buf); > + if (strcmp(buf, "multipathd") != 0) > + return 0; > + return 1; > +} > + > int > main (int argc, char *argv[]) > { > @@ -1028,17 +1080,13 @@ main (int argc, char *argv[]) > } > if (cmd == CMD_VALID_PATH && > dev_type == DEV_UEVENT) { > - int fd; > - > - fd = mpath_connect(); > - if (fd == -1) { > + if (!is_multipathd_running()) { > condlog(3, "%s: daemon is not running", dev); > if (!systemd_service_enabled(dev)) { > r = print_cmd_valid(RTVL_NO, NULL, > conf); > goto out; > } > - } else > - mpath_disconnect(fd); > + } > } > > if (cmd == CMD_REMOVE_WWID && !dev) { -- dm-devel mailing list dm-devel@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/dm-devel