Re: Understanding "ceph -w" output - cluster monitoring

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Mar 15, 2016 at 6:38 AM, Blade Doyle <blade.doyle@xxxxxxxxx> wrote:
>
>
> On Mon, Mar 14, 2016 at 3:48 PM, Christian Balzer <chibi@xxxxxxx> wrote:
>>
>>
>> Hello,
>>
>> On Mon, 14 Mar 2016 09:16:13 -0700 Blade Doyle wrote:
>>
>> > Hi Ceph Community,
>> >
>> > I am trying to use "ceph -w" output to monitor my ceph cluster.  The
>> > basic setup is:
>> >
>> > A python script runs ceph -w and processes each line of output.  It
>> > finds
>> > the data it wants and reports it to InfluxDB.  I view the data using
>> > Grafana, and Ceph Dashboard.
>> >
>>
>> A much richer and more precise source of information would be the various
>> performance counters and using collectd to feed them into graphite and
>> friends.
>> http://lists.ceph.com/pipermail/ceph-users-ceph.com/2014-May/039953.html
>>
>> I'm using the DWM one, YMMV.
>
>
> Thanks much for your reply, Christian.
>
> Ugh.  Ok, then it looks like the key info here is to get the data from the
> osd/mon sockets.  Forgive me for not digging too deep yet, but it looks like
> I would do something like:
>
> ceph --admin-daemon /var/run/ceph/ceph-osd.4.asok perf dump

Only if you want per-daemon stats.

> * which of that data is read/write bytes?
> * Is that data for the entire cluster, or just that osd?  (would I need to
> read data from each individual osd sock in the cluster?)

Please have a look at the link I posted.  There is an existing piece
of code there for doing stats collection, and it supports both
gathering stats from every daemon (you can sum them yourself) or
gathering the already-summed stats from the mon (much simpler if you
don't need more detail).

Remember that the diamond code is free software: even if you don't
want to use diamond you're completely free to just copy what it does.

As for the meaning of stats, you'll mostly find that it's either
obvious ("num_read_kb", "num_write_kb" etc) or completely obscure
("num_evict_mode_some").  As long as you only want the obvious ones
you'll be fine :-)

John
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux