Dan, On 21 May 2013, at 00:52, Dan Mick wrote: > On 05/20/2013 01:33 PM, Alex Bligh wrote: >> If I want to remove an osd, I use 'ceph out' before taking it down, i.e. stopping the OSD process, and removing the disk. >> >> How do I (preferably programatically) tell when it is safe to stop the OSD process? The documentation says 'ceph -w', which is not especially helpful, (a) if I want to do it programatically, or (b) if there are other problems in the cluster so ceph was not reporting HEALTH_OK to start with. >> >> Is there a better way? >> > > We've had some discussions about this recently, but there's no great way of doing this right now. OK. So would the following conservative rule work for now? * Don't mark the OSD out until and unless you have ceph HEALTH_OK * Then mark it out * Then you are safe to remove only when it returns to ceph HEALTH_OK The instructions at present say watch ceph -w, but don't say exactly what to watch for. > We should probably have a query option that returns "number of PGs on this OSD" or some such. That would be very useful. -- Alex Bligh _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com