On Tue, May 03, 2016 at 07:57:01AM +0200, Hannes Reinecke wrote: > On 05/02/2016 06:26 PM, Benjamin Marzinski wrote: > > On Wed, Apr 27, 2016 at 01:10:29PM +0200, Hannes Reinecke wrote: > >> udev since v214 is placing a shared lock on the device node > >> whenever it's processing the event. This introduces a race > >> condition with multipathd, as multipathd is processing the > >> event for the block device at the same time as udev is > >> processing the events for the partitions. > >> And a lock on the partitions will also be visible on the > >> block device itself, hence multipathd won't be able to > >> lock the device. > >> When multipath manages to take a lock on the device, > >> udev will fail, and consequently ignore this entire event. > >> Which in turn might cause the system to malfunction as it > >> might have been a crucial event like 'remove' or 'link down'. > >> > >> So we should better use LOCK_SH here; with that the flock > >> call in multipathd _and_ udev will succeed and the events > >> can be processed. > > > > If we switch this to a shared lock, then what's the point in having it > > at all? The whole point of lock_multipath is to keep multipath and > > multipathd (or two concurrent calls to multipath) from trying to create > > a device at the same time, and both failing. Without an exclusive lock, > > this won't stop that. We can either decide that this is an unlikely > > scenario, and drop it entirely, or we can have multipath create its own > > lockfiles to prevent this issue without interfering with udev. But > > unless I'm missing something, this won't actually do anything. > > > This is primarily for co-operation with udev. > As we typically do _not_ use multipath for creating device-mapper tables > the synchronisation problem between multipath and multipathd > doesn't typically occur. > At least not for us :-) Well, it doesn't occur for us either, unless someone runs multipath at EXACTLY the wrong time, at which case, I'm pretty sure the same thing can happen in any distro. The question is, do we protect against that very unlikely occurance. The only time I see this actually having any real chance to occur would be if some multipath user writes a script that listens for new devices to be discovered, and then *helpfully* runs multipath to update the system. I have seen this. Multiple times, in fact. I'm not sure how much work we need to expend saving these people from themselves, however. > > And as described above, we need this patch to co-operate with udev. > Kay was pretty adamant about _not_ changing udev here. I'm not suggesting we ask udev to chage. If we want to keep the locking, we need to use our own private lockfiles. > That said, I'm happy with dropping the lock functionality completely; > one possibility would be to use the CLI functionality to route calls > from multipath to multipathd. I don't see why this same issue can't happen for anyone creating any type of dm device build out of multiple physical devices, and AFAIK, lvm doesn't bother to protect users against this, even though it can now autoassemble. So I'm not against dropping this either. -Ben > Cheers, > > Hannes > -- > Dr. Hannes Reinecke zSeries & Storage > hare@xxxxxxx +49 911 74053 688 > SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg > GF: J. Hawn, J. Guild, F. Imendörffer, HRB 16746 (AG Nürnberg) -- dm-devel mailing list dm-devel@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/dm-devel