Can't clear UPGRADE_REDEPLOY_DAEMON after fix

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi, 

I just upgraded my cluster from 16.2.4 to 16.2.5 and I had an error during the upgrade of the first osd daemon (cf below). I fixed the error (I just created the missing directory on all hosts), then resume the upgrade. Now, everything is OK but I still have a warning:" [WRN] UPGRADE_REDEPLOY_DAEMON: Upgrading daemon osd.0 on host cephp01 failed" and "ceph health detail" continues to show the old error: 

HEALTH_OK (muted: UPGRADE_REDEPLOY_DAEMON) 
(MUTED) [WRN] UPGRADE_REDEPLOY_DAEMON: Upgrading daemon osd.0 on host cephp01 failed. 
Upgrade daemon: osd.0: cephadm exited with an error code: 1, stderr:Redeploy daemon osd.0 ... 
Traceback (most recent call last): 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 8230, in <module> 
main() 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 8218, in main 
r = ctx.func(ctx) 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 1759, in _default_image 
return func(ctx) 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 4326, in command_deploy 
ports=daemon_ports) 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 2632, in deploy_daemon 
c, osd_fsid=osd_fsid, ports=ports) 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 2801, in deploy_daemon_units 
install_sysctl(ctx, fsid, daemon_type) 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 2963, in install_sysctl 
_write(conf, lines) 
File "/var/lib/ceph/70126dbe-cdd7-11eb-a661-b49691b21313/cephadm.d4237e4639c108308fe13147b1c08af93c3d5724d9ff21ae797eb4b78fea3931", line 2948, in _write 
with open(conf, 'w') as f: 
FileNotFoundError: [Errno 2] No such file or directory: '/usr/lib/sysctl.d/90-ceph-70126dbe-cdd7-11eb-a661-b49691b21313-osd.conf' 


I checked that the osd.0 daemon has been correctly upgraded to v16.2.5 and is running (using dashboard and "cephadm ls" on the node), and "ceph orch upgrade status" returns that upgrade is completed. 
I use "ceph health mute UPGRADE_REDEPLOY_DAEMON" to hide the warning but I would prefer to clean it and I can't find something related to this warning in the documentation. 

Do you know if there is a command to clean this warning while the cluster stills up and serving cephfs clients ? 

Regards, 
Arnaud 

_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux