On Tue, Dec 20, 2016 at 5:19 AM, Shubhendu Tripathi <shtripat@xxxxxxxxxx> wrote: > Hi Team, > > Our team is currently working on project named "tendrl" [1][2]. > Tendrl is a management platform for software defined storage system like > Ceph, Gluster etc. > > As part of tendrl we are integrating with collectd to collect performance > data and we maintain the time series data in graphite. > > I have a question at this juncture regarding pool utilization data. > As our thought process goes, we think of using output from command "ceph df" > and parse it to figure out pool utilization data and push it to graphite > using collectd. > The question here is what is/would be performance impact of running "ceph > df" command on ceph nodes. We should be running this command only on mon > nodes I feel. > > Wanted to verify with the team here if this thought process is in right > direction and if so what ideally should be frequency of running the command > "ceph df" from collectd. Have you looked at Collectd's Ceph plugin (https://collectd.org/documentation/manpages/collectd.conf.5.shtml#plugin_ceph) Kind regards, Ruben Kerkhof -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html