Re: separate monitoring node

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Quoting Reed Dier (reed.dier@xxxxxxxxxxx):
> 
> > On Jun 22, 2018, at 2:14 AM, Stefan Kooman <stefan@xxxxxx> wrote:
> > 
> > Just checking here: Are you using the telegraf ceph plugin on the nodes?
> > In that case you _are_ duplicating data. But the good news is that you
> > don't need to. There is a Ceph mgr telegraf plugin now (mimic) which
> > also works on luminous: http://docs.ceph.com/docs/master/mgr/telegraf/
> 
> Hi Stefan,
> 
> I’m just curious what the advantage you see to the telegraf plugin,
> then feeding into influxdb, rather than the influxdb plugin in
> ceph-mgr already existing.  Just generally curious what the advantage
> is to outputting into telegraf then into influx, unless you are
> outputting to a different TSDB from Telegraf.

We have ceph running in a "storage vrf", which uses routable IPv6, but
not available from "internet vrf". Besides that we have a out-of-band
manamgent inteface running in a network namespace that can reach the
Internet. We use that to send data to monitoring / influx. We have two
telegraf instances running on each host: 1) default (ceph) network
namespace with a listener for ceph (mgr) data and as well pushing
telegraf data to a telegraf-mgmt instance with a
listener (socket). By using sockets we can "escape" the namespace
barrier only for telegraf data. TL;DR: namespaces gives you isolation,
but makes you jump through hoops. That was the main reason why we
sponsored the development of a telegraf mgr plugin.

> 
> Still have my OSD’s reporting their own stats in collectd daemons on
> all of my OSD nodes, as a supplement to the direct ceph-mgr ->
> influxdb statistics.  Almost moved everything to telegraf after
> Luminous broke some collectd data collection, but it all got sorted
> out.

Yeah, not all info is available in the manager yet :/. I hope this will
change. There are some PR's out from Wido, that should fix this. The
telegraf mgr plugin is a drop-in replacement for influx module (and
providing some extra metrics). It also gives you the possibility to
configure more advanced stuff (tls handling) in a seperate tool, instead
of limited functionality in ceph module.

This whole thing gets rendered out of use pretty quickly, with new
dashboard v2 from SuSE, and the prometheus / proxy support for grafana
... options are still a good thing I think ;).

Gr. Stefan

-- 
| BIT BV  http://www.bit.nl/        Kamer van Koophandel 09090351
| GPG: 0xD14839C6                   +31 318 648 688 / info@xxxxxx
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux