> Op 27 november 2017 om 12:19 schreef John Spray <jspray@xxxxxxxxxx>: > > > On Mon, Nov 27, 2017 at 10:29 AM, Wido den Hollander <wido@xxxxxxxx> wrote: > > Hi, > > > > For the Zabbix plugin for the Mgr I wanted to report the amount of block and/or slow requests the cluster is experiencing. > > > > There is no item with a int value in the JSON returned by the Monitors. > > > > What would be the easiest way to obtain these values in a Mgr Module? > > > > Or would we need to expand the JSON the MON reports? > > > > I'd like to make a trigger in Zabbix that if num slow requests is > X a admin is alerted. > > > > Right now you would have to parse a string which isn't very stable. > > Kefu has been working on the health checks for slow requests: > https://github.com/ceph/ceph/pull/18614 > https://github.com/ceph/ceph/pull/19114 > > Currently, health checks are very string-ish, but I would really like > them to have more machine-readable stuff (i.e. expand the > health_check_t structure with a generic map to store json-encodable > metadata), and populate that in the same places we generate strings > (e.g. in this instance where PGMap generates the > REQUEST_SLOW/REQUEST_STUCK health checks). > Good! That would be nice, something like this in a JSON: { "block_requests": 13, "slow_requests": 35 } The Zabbix module could pick this values up and send them to Zabbix for further processing. > BTW, I'm curious about the use case for thresholding slow requests on > the number of slow requests: wouldn't you want to alert the admin even > if there was only one? If there are false positives then maybe > mon_osd_warn_op_age is the thing to adjust > Well, not sure. You probably want to alert if one or more occur. I would just want a integer in a JSON somewhere and use it in Zabbix for graphing and alerting. Wido > John -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html