Hi, Just died today again :/ Temporarily set a cronjob to restart every day until I can discuss with other teams we will use something else rather than Prometheus. Istvan Szabo Senior Infrastructure Engineer --------------------------------------------------- Agoda Services Co., Ltd. e: istvan.szabo@xxxxxxxxx<mailto:istvan.szabo@xxxxxxxxx> --------------------------------------------------- From: Szabo, Istvan (Agoda) Sent: Tuesday, January 18, 2022 5:16 AM To: Peter Lieven <pl@xxxxxxx> Cc: Ceph Users <ceph-users@xxxxxxx>; Daniel Tönnissen <dt@xxxxxxx>; Marco Horch <horch@xxxxxxx> Subject: Re: 14.2.22 dashboard periodically dies and didn't failover Hello, I’ve restarted the mgr service on all the 3 mgrs and it failovered to another, I’m curious that this mgr will have issue or not, if not I might reinstall the manager on that specific node. Istvan Szabo Senior Infrastructure Engineer --------------------------------------------------- Agoda Services Co., Ltd. e: istvan.szabo@xxxxxxxxx<mailto:istvan.szabo@xxxxxxxxx> --------------------------------------------------- On 2022. Jan 17., at 12:42, Peter Lieven <pl@xxxxxxx<mailto:pl@xxxxxxx>> wrote: Email received from the internet. If in doubt, don't click any link nor open any attachment ! ________________________________ Am 13.01.22 um 09:19 schrieb Szabo, Istvan (Agoda): But in your case the election is successful to the other mgr, am I correct? So the dash always up for you? Not sure for me why not, maybe I need to disable it really :/ Has disabling the prometheus module prevented further crashes? Best, Peter _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx