Hi David It's hard to say with so little information what could be wrong, and I have not seen any response yet, so I thought I could give you something that might help you. I've done a video about setting up the Ceph, Grafana, and Prometheus triangle from scratch, the components responsible for hardware metrics and monitoring. Maybe that could help? https://youtu.be/c8R64LF3JjU And I've also done a separate video about disk prediction and smart data in a Ceph cluster that could give you some insights. https://youtu.be/KFBuqTyxalM I hope this helps. Best regards Daniel On Sun, Sep 5, 2021 at 7:26 AM David Yang <gmydw1118@xxxxxxxxx> wrote: > hi, buddy > > I have a ceph file system cluster, using ceph version 15.2.14. > > But the current status of the cluster is HEALTH_ERR. > > health: HEALTH_ERR > Module 'devicehealth' has failed: > > The content in the mgr log is as follows: > > 2021-09-05T13:20:32.922+0800 7f2b8621b700 0 log_channel(audit) log [DBG]: > from='client.2109753 -'entity='client.admin' cmd=[{"prefix": "fs status", > "target": ["mon-mgr", ""]}]: dispatch > 2021-09-05T13:20:32.922+0800 7f2b86a1c700 0 [status ERROR root] > handle_command > > > How to fix this error, please help, thank you > _______________________________________________ > ceph-users mailing list -- ceph-users@xxxxxxx > To unsubscribe send an email to ceph-users-leave@xxxxxxx > _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx