Re: HEALTH_WARN: failed to probe daemons or devices after upgrade to 16.2.6

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

Hmm. 'cephadm ls' running directly on the node does show that there is mon. I don't quite understand where it came from and I don't understand why 'ceph orch ps' didn't show this service.

Thank you very much for your help.

no problem. Maybe you played around and had this node in the placement section previously? Or did it have the mon label? I'm not sure, but the important thing is that you can clean it up.

P.S. Perhaps you know - service in this state, is it normal?

Although both services share the same name ("node-exporter.s-26-9-17") they show different FSIDs:

         "fsid": "1ef45b26-dbac-11eb-a357-616c355f48cb",
         "fsid": "46e2b13c-dab7-11eb-810b-a5ea707f1ea1",  --> error state


This suggests that there has been a different cluster before which has not been cleaned up properly. I would suggest to check 'cephadm ls' on that node and compare the FSIDs, then remove the failed service.

Regards,
Eugen


Zitat von Fyodor Ustinov <ufm@xxxxxx>:

Hi!

Was there a MON running previously on that host? Do you see the daemon
when running 'cephadm ls'? If so, remove it with 'cephadm rm-daemon
--name mon.s-26-9-17'

Hmm. 'cephadm ls' running directly on the node does show that there is mon. I don't quite understand where it came from and I don't understand why 'ceph orch ps' didn't show this service.

Thank you very much for your help.

P.S. Perhaps you know - service in this state, is it normal?
    {
        "style": "cephadm:v1",
        "name": "node-exporter.s-26-9-17",
        "fsid": "46e2b13c-dab7-11eb-810b-a5ea707f1ea1",
"systemd_unit": "ceph-46e2b13c-dab7-11eb-810b-a5ea707f1ea1@node-exporter.s-26-9-17",
        "enabled": true,
        "state": "error",
        "service_name": "node-exporter",
        "ports": [
            9100
        ],
        "ip": null,
        "deployed_by": [
"docker.io/ceph/ceph@sha256:54e95ae1e11404157d7b329d0bef866ebbb214b195a009e87aae4eba9d282949"
        ],
        "memory_request": null,
        "memory_limit": null,
        "container_id": null,
        "container_image_name": "docker.io/prom/node-exporter:v0.18.1",
        "container_image_id": null,
        "container_image_digests": null,
        "version": null,
        "started": null,
        "created": "2021-07-03T01:50:48.371104Z",
        "deployed": "2021-07-03T01:50:47.855103Z",
        "configured": "2021-07-03T01:50:48.371104Z"
    },
    {
        "style": "cephadm:v1",
        "name": "node-exporter.s-26-9-17",
        "fsid": "1ef45b26-dbac-11eb-a357-616c355f48cb",
"systemd_unit": "ceph-1ef45b26-dbac-11eb-a357-616c355f48cb@node-exporter.s-26-9-17",
        "enabled": true,
        "state": "running",
        "service_name": "node-exporter",
        "ports": [
            9100
        ],
        "ip": null,
        "deployed_by": [
"quay.io/ceph/ceph@sha256:8a0f6f285edcd6488e2c91d3f9fa43534d37d7a9b37db1e0ff6691aae6466530", "quay.io/ceph/ceph@sha256:5d042251e1faa1408663508099cf97b256364300365d403ca5563a518060abac"
        ],
        "rank": null,
        "rank_generation": null,
        "memory_request": null,
        "memory_limit": null,
"container_id": "73d4fb20f2fddf9aa5738b5e3c7c9b098862702989a088c32bad528275f90c19",
        "container_image_name": "quay.io/prometheus/node-exporter:v0.18.1",
"container_image_id": "e5a616e4b9cf68dfcad7782b78e118be4310022e874d52da85c55923fb615f87",
        "container_image_digests": [
"docker.io/prom/node-exporter@sha256:a2f29256e53cc3e0b64d7a472512600b2e9410347d53cdc85b49f659c17e02ee", "docker.io/prom/node-exporter@sha256:b630fb29d99b3483c73a2a7db5fc01a967392a3d7ad754c8eccf9f4a67e7ee31", "quay.io/prometheus/node-exporter@sha256:a2f29256e53cc3e0b64d7a472512600b2e9410347d53cdc85b49f659c17e02ee", "quay.io/prometheus/node-exporter@sha256:b630fb29d99b3483c73a2a7db5fc01a967392a3d7ad754c8eccf9f4a67e7ee31"
        ],
        "memory_usage": 26843545,
        "version": "0.18.1",
        "started": "2021-09-17T10:04:31.495483Z",
        "created": "2021-07-03T03:37:26.519462Z",
        "deployed": "2021-09-17T08:12:12.116009Z",
        "configured": "2021-09-17T08:12:15.887998Z"
    },


WBR,
    Fyodor.



_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux