Hi Jean: Thanks for the review... > On Tue, 4 Mar 2008 09:54:40 -0500, Mark M. Hoffman wrote: > > Here is the first in a batch of patches which aim to fix a common class > > of race conditions in most hwmon drivers. Credit goes to Herbert > > Valerio Riedel for pointing this out, here: > > > > http://lists.lm-sensors.org/pipermail/lm-sensors/2007-November/022014.html > > > > To actually trip on any of these race conditions in practice would be > > *exceedingly* rare. In fact, I don't think we've ever seen a report for > > such a case. OTOH, one can see by looking at this patch that the fixes > > are not always completely trivial. > > > > I will concentrate on cranking out the remaining patches as I have time. > > But of course I would like thorough reviews, especially from the driver > > author/maintainer if possible. To all potential reviewers: look not > > just at the patch itself; please also scan the entire driver to see if > > there are any races that I missed. Testing is also much appreciated. * Jean Delvare <khali at linux-fr.org> [2008-03-18 16:55:34 +0100]: > When looking for this kind of race, you should pay particular attention > to macros and inline functions. Historically, some of our macros were > evaluating their arguments more than once. I have tried to clean this > up as we ported the driver to Linux 2.6 but I wouldn't swear that such > macros are all gone; there could be a couple still lurking. > > As for inline functions, I really don't know how gcc handles these. If > it creates a temporary copy of the parameters on the stack, then we're > safe. If not, then even inline functions evaluating their parameters > more than once could hide the race. No, inline functions must behave as regular functions in this regard. Their arguments are only evaluated once. This is almost the entire reason that inline functions are preferred over macros. That said, careful attention is needed for *all* functions. E.g. passing two values from the driver data structure to a function and assuming that they're accessed coherently is still a race... > > diff --git a/drivers/hwmon/adm1026.c b/drivers/hwmon/adm1026.c > > index 904c6ce..92bed53 100644 > > --- a/drivers/hwmon/adm1026.c > > +++ b/drivers/hwmon/adm1026.c > > @@ -838,20 +838,24 @@ static SENSOR_DEVICE_ATTR(in16_max, S_IRUGO | S_IWUSR, show_in16_max, set_in16_m > > static ssize_t show_fan(struct device *dev, struct device_attribute *attr, > > char *buf) > > { > > - struct sensor_device_attribute *sensor_attr = to_sensor_dev_attr(attr); > > - int nr = sensor_attr->index; > > + int nr = to_sensor_dev_attr(attr)->index; > > struct adm1026_data *data = adm1026_update_device(dev); > > - return sprintf(buf, "%d\n", FAN_FROM_REG(data->fan[nr], > > - data->fan_div[nr])); > > + int val; > > + mutex_lock(&data->update_lock); > > + val = FAN_FROM_REG(data->fan[nr], data->fan_div[nr]); > > + mutex_unlock(&data->update_lock); > > + return sprintf(buf, "%d\n", val); > > } ... and here is a good example too. In the snippet above, even if FAN_FROM_REG() were a regular (not inline) function, the lock/unlock would still be necessary to prevent the race. > As you are making the functions bigger, I think it would be a good idea > to leave a blank like between the variable declarations and the "real" > code, as is common practice for non-trivial functions. Same goes for > the rest of the patch, of course. Good point; fixed. > > diff --git a/drivers/hwmon/adt7470.c b/drivers/hwmon/adt7470.c > > index 6b5325f..2c398d5 100644 > > --- a/drivers/hwmon/adt7470.c > > +++ b/drivers/hwmon/adt7470.c > > (...) > > @@ -678,14 +684,15 @@ static ssize_t show_pwm_auto_temp(struct device *dev, > > struct device_attribute *devattr, > > char *buf) > > { > > - struct sensor_device_attribute *attr = to_sensor_dev_attr(devattr); > > + int index = to_sensor_dev_attr(devattr)->index; > > struct adt7470_data *data = adt7470_update_device(dev); > > - u8 ctrl = data->pwm_auto_temp[attr->index]; > > - > > - if (ctrl) > > - return sprintf(buf, "%d\n", 1 << (ctrl - 1)); > > - else > > - return sprintf(buf, "%d\n", ADT7470_PWM_ALL_TEMPS); > > + u8 ctrl; > > + int val; > > + mutex_lock(&data->lock); > > + ctrl = data->pwm_auto_temp[index]; > > + val = ctrl ? (1 << (ctrl - 1)) : ADT7470_PWM_ALL_TEMPS; > > + mutex_unlock(&data->lock); > > + return sprintf(buf, "%d\n", val); > > } > > Not sure why this change is needed? data->pwm_auto_temp[index] is only > accessed once. Unless gcc optimizes the code to get rid of the ctrl > local variable, of course. I really have no idea what gcc is allowed to > do or not in this respect. Oh my, this is soooo tricky. Oops, that's just a thinko on my part. There was no race there; fixed. The compiler is not allowed to "optimize" that way. Updated patch to follow... Thanks & regards, -- Mark M. Hoffman mhoffman at lightlink.com