On 04/05/2010 12:40 PM, Doug Ledford wrote: > We have incremental mode for supporting hot plug of devices. However, > we don't support hot-unplug. This is something we need in order to > enable automatic device replacement. When implementing the support > though, I found that we had a problem in that the device special file is > already gone (well, not gone, but both open and lstat fail on the device > special file, which really makes Manage_subdevs unhappy) by the time > udev calls us to remove the device (even if I created a new udev rules > file with a very low number this was still true). So, along with adding > the code shamelessly stolen from Hawrylewicz with modifications to make > it work on non-container based arrays, I had to modify Manage_subdevs to > work on sysfs subdevice entries in the fail and remove cases. Anyway, > with this in place, and with the modified udev rules file in place, > hot-unplug now works (or at least it does in my testing, I didn't try a > container device, I'll leave tweaking that to Dan or someone else from > Intel, although it should more or less work, I don't know the Intel > metadata rules and so didn't try to make it work myself). And by > changing Manage_subdevs to use either a valid device special file or > sysfs entries, it will work on the command line with virtually any > kernel, and work from a udev rules file on kernels recent enough to have > the state entry in sysfs subdevs directories. > > Oh, and Neil should review my man page additions to see if they are > sufficient for his tastes. I didn't get real wordy about the support, > it's pretty straight forward. > Minor incremental fixup: In the case of passing in faulty or disconnected as the device name, since we now use the value of tfd to determine if we should attempt ioctls or go straight to using sysfs entries, we now need to make sure we init tdf and then set it properly in both of the loops where we check for faulty and disconnected devices (although I'm now highly suspicious of the faulty check code as I suspect all the faulty devices will have the same problem that our hot unplug code ran into and the faulty devices will not be openable and that will mean that passing in faulty is probably just broken at this point in time...but that's another patch for another day). -- Doug Ledford <dledford@xxxxxxxxxx> GPG KeyID: CFBFF194 http://people.redhat.com/dledford Infiniband specific RPMs available at http://people.redhat.com/dledford/Infiniband
commit ddda49f6daf481471334f5675fbc1fa149d83ad6 Author: Doug Ledford <dledford@xxxxxxxxxx> Date: Tue Apr 6 12:17:25 2010 -0400 Minor bug fix to incremental remove support Signed-off-by: Doug Ledford <dledford@xxxxxxxxxx> diff --git a/Manage.c b/Manage.c index 6539eda..b15586b 100644 --- a/Manage.c +++ b/Manage.c @@ -386,6 +386,7 @@ int Manage_subdevs(char *devname, int fd, next = dv->next; jnext = 0; + tfd = -1; if (strcmp(dv->devname, "failed")==0 || strcmp(dv->devname, "faulty")==0) { @@ -406,6 +407,7 @@ int Manage_subdevs(char *devname, int fd, stb.st_rdev = makedev(disc.major, disc.minor); next = dv; jnext = j+1; + tfd = 0; sprintf(dvname,"%d:%d", disc.major, disc.minor); dnprintable = dvname; break; @@ -440,6 +442,7 @@ int Manage_subdevs(char *devname, int fd, stb.st_rdev = makedev(disc.major, disc.minor); next = dv; jnext = j+1; + tfd = 0; dnprintable = dvname; break; }
Attachment:
signature.asc
Description: OpenPGP digital signature