Hi
Thanks, I guess this might have something to do with it:
"
Jun 15 09:44:22 dcn-ceph-01 bash[3278]: debug
2021-06-15T09:44:22.507+0000 7f704e4b3700 -1 mgr notify devicehealth.notify:
Jun 15 09:44:22 dcn-ceph-01 bash[3278]: debug
2021-06-15T09:44:22.507+0000 7f704e4b3700 -1 mgr notify Traceback (most
recent call last):
Jun 15 09:44:22 dcn-ceph-01 bash[3278]: File
"/usr/share/ceph/mgr/devicehealth/module.py", line 229, in notify
Jun 15 09:44:22 dcn-ceph-01 bash[3278]: self.create_device_pool()
Jun 15 09:44:22 dcn-ceph-01 bash[3278]: File
"/usr/share/ceph/mgr/devicehealth/module.py", line 254, in
create_device_pool
Jun 15 09:44:22 dcn-ceph-01 bash[3278]: assert r == 0
Jun 15 09:44:22 dcn-ceph-01 bash[3278]: AssertionError
"
Not sure why it would be creating a pool? I believe it used to work, and
I have this pool:
"
# ceph osd dump | grep pool
pool 9 'device_health_metrics' replicated size 2 min_size 1 crush_rule 1
object_hash rjenkins pg_num 32 pgp_num 32 autoscale_mode on last_change
2630 flags hashpspool stripe_width 0 compression_algorithm snappy
compression_mode aggressive application health_metrics
"
Mvh.
Torkil
On 15/06/2021 11.38, Sebastian Wagner wrote:
Hi Torkil,
you should see more information in the MGR log file.
Might be an idea to restart the MGR to get some recent logs.
Am 15.06.21 um 09:41 schrieb Torkil Svensgaard:
Hi
Looking at this error in v15.2.13:
"
[ERR] MGR_MODULE_ERROR: Module 'devicehealth' has failed:
Module 'devicehealth' has failed:
"
It used to work. Since the module is always on I can't seem to restart
it and I've found no clue as to why it failed. I've tried rebooting
all hosts to no avail.
Suggestions?
Thanks,
Torkil
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx