I'm by no means an expert, but from what I understand you do need to stick to numbering from zero if you want things to work out in the long term. Is there a chance that the cluster didn't finish bringing things back up to full replication before osd's were removed? If I were moving from 0,1 to 2,3 I'd bring both 2 and 3 up, set the weight of 0,1 to zero and let all of the pg's get active+clean again then remove 0,1. Doing your swap I might bring up 2 under rack az2, set 1 to weight 0, stop 1 after getting active+clean and remake what is now 3 as 1 and bring it back in as 1 with full weight, then finally drop 2 to weight zero and remove after active+clean. I'd follow on doing a similar shuffle for the now inactive former osd 1 the current osd 0 and the future osd 0 which was osd 2. Clear as mud? On Jul 19, 2013, at 7:03 PM, Pawel Veselov <pawel.veselov@xxxxxxxxx> wrote:
|
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com