Re: Cephadm cannot aquire lock

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




Am 31.08.21 um 04:05 schrieb fcid:
Hi ceph community,

I'm having some trouble trying to delete an OSD.

I've been using cephadm in one of our clusters and it's works fine, but lately, after an OSD failure, I cannot delete it using the orchestrator. Since the orchestrator is not working (for some unknown reason) I tried to manually delete the OSD using the following command:

ceph purge osd <id> --yes-i-really-mean-it

This command removed the OSD from the crush map, but then the warning CEPHADM_FAILED_DEAMON appeared. So the next step is delete de daemon in the server that use to host the failed OSD. The command I used here was the following:

cephadm rm-daemon --name osd.<id> --fsid <FSID>

But this command does not work because, accoding to the log, cephadm cannot aquire lock:

2021-08-30 21:50:09,712 DEBUG Lock 139899822730784 not acquired on /run/cephadm/$FSID.lock, waiting 0.05 seconds ... 2021-08-30 21:50:09,762 DEBUG Acquiring lock 139899822730784 on /run/cephadm/$FSID.lock 2021-08-30 21:50:09,763 DEBUG Lock 139899822730784 not acquired on /run/cephadm/$FSID.lock, waiting 0.05 seconds ...

The file /run/cephadm/$FSID.lock does exist. Can I safely remove it? What should I check before doing such task.

Yes, in case you're sure that no other cephadm process (i.e. call `ps`) is stuck.


I'll really appreciate any hint you can give relating this matter.

Thanks! regards.


_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux