Re: fail delete "daemon(s) not managed by cephadm"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Try on the mentioned host if there is a daemon with:

cephadm ls | grep apcepfpspsp0111

If there is one you can remove it with cephadm rm-daemon …

Sometimes a MGR failover clears up that message:

ceph mgr fail

Zitat von farhad kh <farhad.khedriyan@xxxxxxxxx>:

hi everyone
i have a warning ` 1 stray daemon(s) not managed by cephadm`

# ceph health detail
HEALTH_WARN 1 stray daemon(s) not managed by cephadm
[WRN] CEPHADM_STRAY_DAEMON: 1 stray daemon(s) not managed by cephadm
    stray daemon mon.apcepfpspsp0111 on host apcepfpspsp0111 not
managed by cephadm

and in detail :

# ceph crash info
2023-01-17T18:25:19.601062Z_4affd3b9-f486-4ae5-801b-ede231e2b624
{
    "archived": "2023-01-18 19:31:58.200096",
    "assert_condition": "abort",
    "assert_file":
"/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/17.2.5/rpm/el8/BUILD/ceph-17.2.5/src/mon/MonitorDBStore.h",
    "assert_func": "int
MonitorDBStore::apply_transaction(MonitorDBStore::TransactionRef)",
    "assert_line": 355,
    "assert_msg":
"/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/17.2.5/rpm/el8/BUILD/ceph-17.2.5/src/mon/MonitorDBStore.h:
In function 'int
MonitorDBStore::apply_transaction(MonitorDBStore::TransactionRef)'
thread 7fabeb737700 time
2023-01-17T18:25:19.568046+0000\n/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/17.2.5/rpm/el8/BUILD/ceph-17.2.5/src/mon/MonitorDBStore.h:
355: ceph_abort_msg(\"failed to write to db\")\n",
    "assert_thread_name": "ms_dispatch",
    "backtrace": [
        "/lib64/libpthread.so.0(+0x12cf0) [0x7fabf5949cf0]",
        "gsignal()",
        "abort()",
        "(ceph::__ceph_abort(char const*, int, char const*,
std::__cxx11::basic_string<char, std::char_traits<char>,
std::allocator<char> > const&)+0x197) [0x7fabf799eb5f]",
"(MonitorDBStore::apply_transaction(std::shared_ptr<MonitorDBStore::Transaction>)+0x88f)
[0x55fe9cc3a68f]",
        "(Elector::persist_epoch(unsigned int)+0x184) [0x55fe9ccc64c4]",
        "(ElectionLogic::bump_epoch(unsigned int)+0x5d) [0x55fe9ccccd5d]",
        "(ElectionLogic::propose_classic_prefix(int, unsigned
int)+0x3c7) [0x55fe9ccce8e7]",
        "(ElectionLogic::propose_classic_handler(int, unsigned
int)+0x2b) [0x55fe9cccef9b]",
        "(Elector::handle_propose(boost::intrusive_ptr<MonOpRequest>)+0x6f5)
[0x55fe9ccc6175]",
        "(Elector::dispatch(boost::intrusive_ptr<MonOpRequest>)+0xcdb)
[0x55fe9ccc756b]",
        "(Monitor::dispatch_op(boost::intrusive_ptr<MonOpRequest>)+0x11c2)
[0x55fe9cc0aeb2]",
        "(Monitor::_ms_dispatch(Message*)+0x406) [0x55fe9cc0b5e6]",
        "(Dispatcher::ms_dispatch2(boost::intrusive_ptr<Message>
const&)+0x5d) [0x55fe9cc3bdad]",
        "(Messenger::ms_deliver_dispatch(boost::intrusive_ptr<Message>
const&)+0x478) [0x7fabf7c18c88]",
        "(DispatchQueue::entry()+0x50f) [0x7fabf7c160cf]",
        "(DispatchQueue::DispatchThread::entry()+0x11) [0x7fabf7cdd8f1]",
        "/lib64/libpthread.so.0(+0x81ca) [0x7fabf593f1ca]",
        "clone()"
    ],
    "ceph_version": "17.2.5",
    "crash_id":
"2023-01-17T18:25:19.601062Z_4affd3b9-f486-4ae5-801b-ede231e2b624",
    "entity_name": "mon.apcepfpspsp0101",
    "os_id": "centos",
    "os_name": "CentOS Stream",
    "os_version": "8",
    "os_version_id": "8",
    "process_name": "ceph-mon",
    "stack_sig":
"74f29b024a3ec0f1395579505c5728e8ad5fd5a77c4a92437d696fa6fd1a8e20",
    "timestamp": "2023-01-17T18:25:19.601062Z",
    "utsname_hostname": "apcepfpspsp0101",
    "utsname_machine": "x86_64",
    "utsname_release": "5.4.17-2136.310.7.1.el8uek.x86_64",
    "utsname_sysname": "Linux",
    "utsname_version": "#2 SMP Wed Aug 17 15:14:08 PDT 2022"
}

but i have 3 instance from mon modul

NAME               PORTS        RUNNING  REFRESHED  AGE  PLACEMENT
alertmanager       ?:9093,9094      3/3  2m ago     9M   label:mon
crash                               6/6  9m ago     9M   *
grafana            ?:3000           3/3  2m ago     4M
count-per-host:1;label:mon
mds.rook                            3/3  2m ago     8M
apcepfpspsp0101;apcepfpspsp0103;apcepfpspsp0105;count:3
mgr                                 2/2  2m ago     9M   count:2
mon                                 3/3  2m ago     2h   label:mon
node-exporter      ?:9100           6/6  9m ago     9M   *
osd                                   3  2m ago     -    <unmanaged>
osd.cost_capacity                    15  9m ago     4M   *
prometheus         ?:9095           3/3  2m ago     9M   label:mon

whene i want remove this get

# ceph orch daemon rm mon.apcepfpspsp0111 --force
Error EINVAL: Unable to find daemon(s) ['mon.apcepfpspsp0111']

I don't have such a daemon

mon.apcepfpspsp0101              apcepfpspsp0101               running (88m)
mon.apcepfpspsp0103              apcepfpspsp0103               running (87m)
mon.apcepfpspsp0105              apcepfpspsp0105               running (87m)

What should I do to remove this warning?
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx


_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux