Re: [PATCH 28/57] libmultipath: use a shared lock to co-operate with udev

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, May 03, 2016 at 09:27:32AM -0500, Benjamin Marzinski wrote:
> On Tue, May 03, 2016 at 07:57:01AM +0200, Hannes Reinecke wrote:
> > On 05/02/2016 06:26 PM, Benjamin Marzinski wrote:
> > > On Wed, Apr 27, 2016 at 01:10:29PM +0200, Hannes Reinecke wrote:
> > >> udev since v214 is placing a shared lock on the device node
> > >> whenever it's processing the event. This introduces a race
> > >> condition with multipathd, as multipathd is processing the
> > >> event for the block device at the same time as udev is
> > >> processing the events for the partitions.
> > >> And a lock on the partitions will also be visible on the
> > >> block device itself, hence multipathd won't be able to
> > >> lock the device.
> > >> When multipath manages to take a lock on the device,
> > >> udev will fail, and consequently ignore this entire event.
> > >> Which in turn might cause the system to malfunction as it
> > >> might have been a crucial event like 'remove' or 'link down'.
> > >>
> > >> So we should better use LOCK_SH here; with that the flock
> > >> call in multipathd _and_ udev will succeed and the events
> > >> can be processed.
> > > 
> > > If we switch this to a shared lock, then what's the point in having it
> > > at all? The whole point of lock_multipath is to keep multipath and
> > > multipathd (or two concurrent calls to multipath) from trying to create
> > > a device at the same time, and both failing. Without an exclusive lock,
> > > this won't stop that. We can either decide that this is an unlikely
> > > scenario, and drop it entirely, or we can have multipath create its own
> > > lockfiles to prevent this issue without interfering with udev. But
> > > unless I'm missing something, this won't actually do anything.
> > > 
> > This is primarily for co-operation with udev.
> > As we typically do _not_ use multipath for creating device-mapper tables
> > the synchronisation problem between multipath and multipathd
> > doesn't typically occur.
> > At least not for us :-)
> 
> Well, it doesn't occur for us either, unless someone runs multipath at
> EXACTLY the wrong time, at which case, I'm pretty sure the same thing
> can happen in any distro. The question is, do we protect against that
> very unlikely occurance.  The only time I see this actually having any
> real chance to occur would be if some multipath user writes a script
> that listens for new devices to be discovered, and then *helpfully* runs
> multipath to update the system. I have seen this. Multiple times, in
> fact. I'm not sure how much work we need to expend saving these people
> from themselves, however.
> 
> > 
> > And as described above, we need this patch to co-operate with udev.
> > Kay was pretty adamant about _not_ changing udev here.
> 
> I'm not suggesting we ask udev to chage. If we want to keep the locking,
> we need to use our own private lockfiles.
> 
> > That said, I'm happy with dropping the lock functionality completely;
> > one possibility would be to use the CLI functionality to route calls
> > from multipath to multipathd.
> 
> I don't see why this same issue can't happen for anyone creating any
> type of dm device build out of multiple physical devices, and AFAIK, lvm
> doesn't bother to protect users against this, even though it can now
> autoassemble. So I'm not against dropping this either.

Actually, after some more thought... There must be something that udev
is protecting itself against by taking the shared lock. Do you know what
that is? Assuming that multipath wants to protect against the same
thing, we probably do want to keep locking here.

So, assuming that we do want to protect ourselves against whatever udev
is protecting itself against, I remove my NAK of the original patch, and
apologize for the confusion. If we don't need to worry about udev's
issue, then we can try just removing the locks.  At worst, we'll find
out quickly what they are important for.

-Ben

> 
> -Ben
> 
> > Cheers,
> > 
> > Hannes
> > -- 
> > Dr. Hannes Reinecke		      zSeries & Storage
> > hare@xxxxxxx			      +49 911 74053 688
> > SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg
> > GF: J. Hawn, J. Guild, F. Imendörffer, HRB 16746 (AG Nürnberg)
> 
> --
> dm-devel mailing list
> dm-devel@xxxxxxxxxx
> https://www.redhat.com/mailman/listinfo/dm-devel

--
dm-devel mailing list
dm-devel@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/dm-devel




[Index of Archives]     [DM Crypt]     [Fedora Desktop]     [ATA RAID]     [Fedora Marketing]     [Fedora Packaging]     [Fedora SELinux]     [Yosemite Discussion]     [KDE Users]     [Fedora Docs]

  Powered by Linux