Its Smart Storage battery, which was disabled due to high ambient temperature.
All OSD processes/daemon working as is...but those OSDs not responding to other OSD due to high CPU utilization..
Don't observe the clock skew issue.
On Tue, Apr 16, 2019 at 12:49 PM Marco Gaiarin <gaio@xxxxxxxxx> wrote:
Mandi! M Ranga Swami Reddy
In chel di` si favelave...
> Hello - Recevenlt we had an issue with storage node's battery failure, which
> cause ceph client IO dropped to '0' bytes. Means ceph cluster couldn't perform
> IO operations on the cluster till the node takes out. This is not expected from
> Ceph, as some HW fails, those respective OSDs should mark as out/down and IO
> should go as is..
> Please let me know if anyone seen the similar behavior and is this issue
> resolved?
'battery' mean 'CMOS battery'?
OSDs and MONs need accurate clock sync between them. So, if a node
reboot with a clock skew more than (AFAI Remember well) 5 seconds, OSD
does not start.
Provide a stable NTP server for all your OSDs and MONs, and restart
OSDs after clock are in sync.
--
dott. Marco Gaiarin GNUPG Key ID: 240A3D66
Associazione ``La Nostra Famiglia'' http://www.lanostrafamiglia.it/
Polo FVG - Via della Bontà, 7 - 33078 - San Vito al Tagliamento (PN)
marco.gaiarin(at)lanostrafamiglia.it t +39-0434-842711 f +39-0434-842797
Dona il 5 PER MILLE a LA NOSTRA FAMIGLIA!
http://www.lanostrafamiglia.it/index.php/it/sostienici/5x1000
(cf 00307430132, categoria ONLUS oppure RICERCA SANITARIA)
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com