Re: Feeding pool utilization data to time series for trending

Ruben Kerkhof <ruben@xxxxxxxxxxxxxxxx> · Tue, 20 Dec 2016 12:43:51 +0100

On Tue, Dec 20, 2016 at 5:19 AM, Shubhendu Tripathi <shtripat@xxxxxxxxxx> wrote:
> Hi Team,
>
> Our team is currently working on project named "tendrl" [1][2].
> Tendrl is a management platform for software defined storage system like
> Ceph, Gluster etc.
>
> As part of tendrl we are integrating with collectd to collect performance
> data and we maintain the time series data in graphite.
>
> I have a question at this juncture regarding pool utilization data.
> As our thought process goes, we think of using output from command "ceph df"
> and parse it to figure out pool utilization data and push it to graphite
> using collectd.
> The question here is what is/would be performance impact of running "ceph
> df" command on ceph nodes. We should be running this command only on mon
> nodes I feel.
>
> Wanted to verify with the team here if this thought process is in right
> direction and if so what ideally should be frequency of running the command
> "ceph df" from collectd.

Have you looked at Collectd's Ceph plugin
(https://collectd.org/documentation/manpages/collectd.conf.5.shtml#plugin_ceph)

Kind regards,

Ruben Kerkhof
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html