Re: Gluster monitoring

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

We also looked into tendrl some time ago, but in an enterprise environment it simply can not be used (talking about 40+ gluster clusters), and it indeed randomly fails without proper ways to 'get it up' again.
Apart from regular process monitoring we also make use of a collectd plugin for gluster which works decent enough (https://github.com/gluster/gluster-collectd) and allows for gluster metrics, along-side system metrics to be monitored.

We don't even yet use all the metrics exposed by that collectd plugin, but as a sample dashboard we currently use something like:

Regards,
Nico van Roijen


Van: "Alvin Starr" <alvin@xxxxxxxxxx>
Aan: "gluster-users" <gluster-users@xxxxxxxxxxx>
Verzonden: Dinsdag 27 oktober 2020 18:08:03
Onderwerp: Re: Gluster monitoring

We have been using zabbix for tracking gluster but  that works because we are using zabbix for the rest of our monitoring of things like network and disk IO.

One thing to track that is not part of the usual suspects is the heal counts.
They should always be 0 unless you have a problem somewhere.

On 10/27/20 12:25 AM, Mahdi Adnan wrote:

Hello


 How do you keep track of the health status of your Gluster volumes? When Brick went down (crash, failure, shutdown), node failure, peering issue, on-going healing?


Gluster Tendrl is complex and sometimes it's broken, Prometheus exporter still lacking, gstatus is basic.

Currently, to monitor a Gluster volume, a custom script should be used to gather whatever info needed for monitoring or a combination of the mentioned tools.


Can Gluster have something similar to Ceph and display the health of the entire cluster? I know Ceph uses it’s “Monitors” to keep track of everything going inside the cluster, but Gluster should also have a way to keep track of the cluster’s health.


How’s the community experience with Gluster monitoring? How are you managing and tracking alerts and issues? Any recommendations?


Thank you.


--
Respectfully
Mahdi

________



Community Meeting Calendar:

Schedule -
Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC
Bridge: https://bluejeans.com/441850968

Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-users

-- 
Alvin Starr                   ||   land:  (647)478-6285
Netvel Inc.                   ||   Cell:  (416)806-0133
alvin@xxxxxxxxxx              ||


________



Community Meeting Calendar:

Schedule -
Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC
Bridge: https://bluejeans.com/441850968

Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-users
________



Community Meeting Calendar:

Schedule -
Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC
Bridge: https://meet.google.com/cpu-eiue-hvk
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-users

[Index of Archives]     [Gluster Development]     [Linux Filesytems Development]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux