On Mon, Dec 09, 2019 at 05:01:04PM -0800, Paul Choi wrote: > Hello, > Anybody seeing the Prometheus endpoint hanging with the new 13.2.7 > release? > With 13.2.6 the endpoint would respond with a payload of 15MB in less > than 10 seconds. I'd guess its not the prometheus module itself: $ git diff v13.2.6 v13.2.7 src/pybind/mgr/prometheus diff --git a/src/pybind/mgr/prometheus/module.py b/src/pybind/mgr/prometheus/module.py index 2d4472434a..3a398e0b0c 100644 --- a/src/pybind/mgr/prometheus/module.py +++ b/src/pybind/mgr/prometheus/module.py @@ -142,7 +142,8 @@ class Metric(object): def promethize(path): ''' replace illegal metric name characters ''' - result = path.replace('.', '_').replace('+', '_plus').replace('::', '_') + result = path.replace('.', '_').replace( + '+', '_plus').replace('::', '_').replace(' ', '_') # Hyphens usually turn into underscores, unless they are # trailing @@ -720,7 +721,8 @@ class Module(MgrModule): raise cherrypy.HTTPError(503, 'No MON connection') # Make the cache timeout for collecting configurable - self.collect_timeout = self.get_localized_config('scrape_interval', 5.0) + self.collect_timeout = float(self.get_localized_config( + 'scrape_interval', 5.0)) server_addr = self.get_localized_config('server_addr', DEFAULT_ADDR) server_port = self.get_localized_config('server_port', DEFAULT_PORT) So the mgr would be a likely suspect. If you could open a tracker ticket, ideally with mgr debug logs attached, this can be looked at. > Now, if you restart ceph-mgr, the Prometheus endpoint responds quickly > for the first run, then successive runs get slower and slower, until it > takes several minutes. > I have no customization for the mgr module. Except for the Prometheus > module, the "status" module and "Zabbix" module are working fine. > This is on Ubuntu 16 LTS: > ii ceph-mgr 13.2.7-1xenial > amd64 manager for the ceph distributed storage > system > I'd love to know if there's a way to diagnose this issue - I tried > upping the debug ms level but that doesn't seem to yield useful > information. > I don't know if this useful, but "prometheus self-test" is fine too. > $ sudo ceph tell mgr.0 prometheus self-test > Self-test OK > pchoi@pchoi-desktop:~$ ceph mgr module ls > { > "enabled_modules": [ > "prometheus", > "status", > "zabbix" > ], > "disabled_modules": [ > { > "name": "balancer", > "can_run": true, > "error_string": "" > }, > { > "name": "crash", > "can_run": true, > "error_string": "" > }, > { > "name": "dashboard", > "can_run": true, > "error_string": "" > }, > { > "name": "hello", > "can_run": true, > "error_string": "" > }, > { > "name": "influx", > "can_run": false, > "error_string": "influxdb python module not found" > }, > { > "name": "iostat", > "can_run": true, > "error_string": "" > }, > { > "name": "localpool", > "can_run": true, > "error_string": "" > }, > { > "name": "restful", > "can_run": true, > "error_string": "" > }, > { > "name": "selftest", > "can_run": true, > "error_string": "" > }, > { > "name": "smart", > "can_run": true, > "error_string": "" > }, > { > "name": "telegraf", > "can_run": true, > "error_string": "" > }, > { > "name": "telemetry", > "can_run": true, > "error_string": "" > } > ] > } > pchoi@pchoi-desktop:~$ ceph mgr services > { > "prometheus": "[1]http://woodenbox2:9283/" > } > >References > > 1. http://woodenbox2:9283/ >_______________________________________________ >ceph-users mailing list -- ceph-users@xxxxxxx >To unsubscribe send an email to ceph-users-leave@xxxxxxx -- Jan Fajerski Senior Software Engineer Enterprise Storage SUSE Software Solutions Germany GmbH Maxfeldstr. 5, 90409 Nürnberg, Germany (HRB 36809, AG Nürnberg) Geschäftsführer: Felix Imendörffer _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx