handling osd removal with ceph-volume?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



After speaking with Alfredo and the orchestrator team, it seems there
are some open questions (well, maybe just questions whose answers need
to be written down) about OSD removal with ceph-volume.

Feel free to expand the scope of this thread to the many different
destruction / deactivation scenarios, but we have been driven
initially by the conversion of one ceph-ansible playbook that removes
a specific OSD from the cluster that boils down to:

  1. ceph-disk deactivate --deactivate-by-id ID --mark-out
  2. ceph-disk destroy --destroy-by-id ID --zap
  3. < manually destroy partitions from `ceph-disk list` >

To accomplish the equivalent without ceph-disk we are doing the following:

  1. ceph osd out ID
  2. systemctl disable ceph-osd@ID
  3. systemctl stop ceph-osd@ID
  4. something equivalent to:
    | osd_devs = ceph-volume lvm list --format json
    | for dev in osd_devs[ID]:
    |    ceph-volume lvm zap dev["path"]
  5. ceph osd purge ID

This list seems to be complete after examining ceph docs and
ceph-volume itself. Is there anything missing? Similar questions here:
http://tracker.ceph.com/issues/22287

Of these steps, the primary question that has popped up is how to
maintain outside of ceph-volume, the inverse of the systemd unit
management that ceph-volume takes care of during OSD creation (e.g.
ceph-osd and ceph-volume units), and whether that inverse operation
should be a part of ceph-volume itself.

My understanding of the systemd process for ceph is that the
ceph-volume unit itself activates the corresponding OSD using the
ceph-osd systemd template--so there isn't any osd-specific unit files
to clean up when an OSD is removed. That still leaves the question of
how to properly remove the ceph-volume units if that is indeed the
process that needs to occur. Glancing over the zap code, it doesn't
look like zap handles that task. Related tracker here:
http://tracker.ceph.com/issues/25029

In the ceph docs it seems to only indicate that the OSD needs to be
stopped, and presumably there are other final clean-up steps?

- Noah



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux