On Wed, 28 Oct 2015, Mike Snitzer wrote: > On Tue, Oct 27 2015 at 7:06pm -0400, > Mikulas Patocka <mpatocka@xxxxxxxxxx> wrote: > > > This fixes a possible race when md->queue->backing_dev_info.congested_fn > > is changed. > > > > Note that Zdenek is seeing some other memory corruption where > > dm_any_congested is called with invalid argument, but it is unlikely to be > > fixed by this patch. > > > > Mikulas > > > > - > > > > From: Mikulas Patocka <mpatocka@xxxxxxxxxx> > > > > The patch bfebd1cdb497a57757c83f5fbf1a29931591e2a4 ("dm: add full blk-mq > > support to request-based DM") moves the initialization of fields > > queuedata, backing_dev_info.congested_fn and > > backing_dev_info.congested_data from the function dm_init_md_queue (that > > is called when the device is created) to dm_init_old_md_queue (that is > > called when type of device is determined). > > > > There is no locking when accessing these variables, thus it is possible > > that other part of the kernel sees queue->backing_dev_info.congested_fn > > initialized and md->queue->backing_dev_info.congested_data uninitialized, > > passing incorrect parameter to the function dm_any_congested. > > > > This patch fixes this race condition by moving initialization of queuedata > > and backing_dev_info.congested_data to the function dm_init_md_queue, so > > that these values are initialized when the device is created. > > > > Signed-off-by: Mikulas Patocka <mpatocka@xxxxxxxxxx> > > Fixes: bfebd1cdb497 ("dm: add full blk-mq support to request-based DM") > > Cc: stable@xxxxxxxxxxxxxxx # v4.1+ > > > > --- > > drivers/md/dm.c | 5 +++-- > > 1 file changed, 3 insertions(+), 2 deletions(-) > > > > Index: linux-4.3-rc7/drivers/md/dm.c > > =================================================================== > > --- linux-4.3-rc7.orig/drivers/md/dm.c 2015-10-27 23:25:41.000000000 +0100 > > +++ linux-4.3-rc7/drivers/md/dm.c 2015-10-27 23:26:45.000000000 +0100 > > @@ -2198,6 +2198,9 @@ static void dm_init_md_queue(struct mapp > > * This queue is new, so no concurrency on the queue_flags. > > */ > > queue_flag_clear_unlocked(QUEUE_FLAG_STACKABLE, md->queue); > > + > > + md->queue->queuedata = md; > > + md->queue->backing_dev_info.congested_data = md; > > } > > I don't like these moving to dm_init_md_queue() because they aren't > needed for blk-mq. No sense establishing data that will go unused in > the blk-mq case. Another possibility is to always set congested_fn = dm_any_congested and test in dm_any_congested if the device is multiqueue. > > static void dm_init_old_md_queue(struct mapped_device *md) > > @@ -2208,9 +2211,7 @@ static void dm_init_old_md_queue(struct > > /* > > * Initialize aspects of queue that aren't relevant for blk-mq > > */ > > - md->queue->queuedata = md; > > md->queue->backing_dev_info.congested_fn = dm_any_congested; > > - md->queue->backing_dev_info.congested_data = md; > > > > blk_queue_bounce_limit(md->queue, BLK_BOUNCE_ANY); > > } > > Wouldn't it be sufficient to simply reorder so the congested_fn > assignment is last? E.g.: > > md->queue->queuedata = md; > md->queue->backing_dev_info.congested_data = md; > md->queue->backing_dev_info.congested_fn = dm_any_congested; This is incorrect - the compiler or processor may reorder memory accesses - it needs write barrier when modifying the fields and read barrier when reading them. The write barrier could be added here, but the read barrier would slow down all calls to the function congested_fn. Mikulas -- dm-devel mailing list dm-devel@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/dm-devel