Am 31.08.21 um 04:05 schrieb fcid:
Hi ceph community,
I'm having some trouble trying to delete an OSD.
I've been using cephadm in one of our clusters and it's works fine,
but lately, after an OSD failure, I cannot delete it using the
orchestrator. Since the orchestrator is not working (for some unknown
reason) I tried to manually delete the OSD using the following command:
ceph purge osd <id> --yes-i-really-mean-it
This command removed the OSD from the crush map, but then the warning
CEPHADM_FAILED_DEAMON appeared. So the next step is delete de daemon
in the server that use to host the failed OSD. The command I used here
was the following:
cephadm rm-daemon --name osd.<id> --fsid <FSID>
But this command does not work because, accoding to the log, cephadm
cannot aquire lock:
2021-08-30 21:50:09,712 DEBUG Lock 139899822730784 not acquired on
/run/cephadm/$FSID.lock, waiting 0.05 seconds ...
2021-08-30 21:50:09,762 DEBUG Acquiring lock 139899822730784 on
/run/cephadm/$FSID.lock
2021-08-30 21:50:09,763 DEBUG Lock 139899822730784 not acquired on
/run/cephadm/$FSID.lock, waiting 0.05 seconds ...
The file /run/cephadm/$FSID.lock does exist. Can I safely remove it?
What should I check before doing such task.
Yes, in case you're sure that no other cephadm process (i.e. call `ps`)
is stuck.
I'll really appreciate any hint you can give relating this matter.
Thanks! regards.
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx