One thing to track that is not part of the usual suspects is the heal counts.
They should always be 0 unless you have a problem somewhere.
Hello
How do you keep track of the health status of your Gluster volumes? When Brick went down (crash, failure, shutdown), node failure, peering issue, on-going healing?
Gluster Tendrl is complex and sometimes it's broken, Prometheus exporter still lacking, gstatus is basic.
Currently, to monitor a Gluster volume, a custom script should be used to gather whatever info needed for monitoring or a combination of the mentioned tools.
Can Gluster have something similar to Ceph and display the health of the entire cluster? I know Ceph uses it’s “Monitors” to keep track of everything going inside the cluster, but Gluster should also have a way to keep track of the cluster’s health.
How’s the community experience with Gluster monitoring? How are you managing and tracking alerts and issues? Any recommendations?
Thank you.
--
RespectfullyMahdi
________ Community Meeting Calendar: Schedule - Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC Bridge: https://bluejeans.com/441850968 Gluster-users mailing list Gluster-users@xxxxxxxxxxx https://lists.gluster.org/mailman/listinfo/gluster-users
-- Alvin Starr || land: (647)478-6285 Netvel Inc. || Cell: (416)806-0133 alvin@xxxxxxxxxx ||
________ Community Meeting Calendar: Schedule - Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC Bridge: https://bluejeans.com/441850968 Gluster-users mailing list Gluster-users@xxxxxxxxxxx https://lists.gluster.org/mailman/listinfo/gluster-users