Given the reference to that cherrypy backports stuff in the traceback, I'll just mention we are in the process of removing that from the code as we've seen issues with it in our testing as well ( https://github.com/ceph/ceph/pull/60602 / https://tracker.ceph.com/issues/68802). We want that patch in squid, reef, and quincy so FWIW the next release of each of those branches shouldn't have this issue any more. On Tue, Nov 19, 2024 at 8:15 AM Chris Palmer <chris.palmer@xxxxxxxxx> wrote: > I've just applied routine Centos 9 updates to one node of a Reef 18.2.4 > system (package install). They include some python3 updates that break > the MGR in at least two ways. > > When the MGR starts in standby, it immediately logs the following two > crashes. Only the dashboard and restful services are enabled (prometheus > is not enabled). We have mgr/dashboard/FEATURE_TOGGLE_DASHBOARD = false. > > { > "archived": "2024-11-19 10:30:46.031489", > "backtrace": [ > " File \"/usr/share/ceph/mgr/dashboard/__init__.py\", line 60, > in <module>\n from .module import Module, StandbyModule # noqa: F401", > " File \"/usr/share/ceph/mgr/dashboard/module.py\", line 51, > in <module>\n patch_cherrypy(cherrypy.__version__)", > " File > \"/usr/share/ceph/mgr/dashboard/cherrypy_backports.py\", line 197, in > patch_cherrypy\n accept_exceptions_from_builtin_ssl(ver)", > " File > \"/usr/share/ceph/mgr/dashboard/cherrypy_backports.py\", line 113, in > accept_exceptions_from_builtin_ssl\n patch_builtin_ssl_wrap(v, > accept_ssl_errors)", > " File > \"/usr/share/ceph/mgr/dashboard/cherrypy_backports.py\", line 75, in > patch_builtin_ssl_wrap\n from cherrypy.wsgiserver.ssl_builtin import > BuiltinSSLAdapter as builtin_ssl", > "ModuleNotFoundError: No module named 'cherrypy.wsgiserver'" > ], > "ceph_version": "18.2.4", > "crash_id": > "2024-11-19T09:19:40.427015Z_f96f4d70-2112-47b5-96f7-5e9bd463e8eb", > "entity_name": "mgr.ceph1", > "mgr_module": "dashboard", > "mgr_module_caller": "PyModule::load_subclass_of", > "mgr_python_exception": "ModuleNotFoundError", > "os_id": "centos", > "os_name": "CentOS Stream", > "os_version": "9", > "os_version_id": "9", > "process_name": "ceph-mgr", > "stack_sig": > "9c98ebf1b6831bfca2823f54c9e6be01306090c2a7749def2ed8a15167fc527a", > "timestamp": "2024-11-19T09:19:40.427015Z", > "utsname_hostname": "ceph1.xxxxxx", > "utsname_machine": "x86_64", > "utsname_release": "6.1.112-1.el9.elrepo.x86_64", > "utsname_sysname": "Linux", > "utsname_version": "#1 SMP PREEMPT_DYNAMIC Mon Sep 30 13:59:36 EDT > 2024" > } > > > { > "archived": "2024-11-19 10:30:46.056007", > "backtrace": [ > " File \"/usr/share/ceph/mgr/prometheus/__init__.py\", line 2, > in <module>\n from .module import Module, StandbyModule", > " File \"/usr/share/ceph/mgr/prometheus/module.py\", line 38, > in <module>\n v = Version(cherrypy.__version__)", > " File > \"/lib/python3.9/site-packages/pkg_resources/_vendor/packaging/version.py\", > > line 277, in __init__\n raise InvalidVersion(\"Invalid version: > '{0}'\".format(version))", > "pkg_resources.extern.packaging.version.InvalidVersion: Invalid > version: 'unknown'" > ], > "ceph_version": "18.2.4", > "crash_id": > "2024-11-19T09:19:42.188291Z_c112de10-cdd5-4ed3-86b5-04dffd660cb8", > "entity_name": "mgr.ceph1", > "mgr_module": "prometheus", > "mgr_module_caller": "PyModule::load_subclass_of", > "mgr_python_exception": "InvalidVersion", > "os_id": "centos", > "os_name": "CentOS Stream", > "os_version": "9", > "os_version_id": "9", > "process_name": "ceph-mgr", > "stack_sig": > "7fb0c6c17573887e8772c311aac1f3547c8366e50d6f2f8d8bd38deb6e0e9405", > "timestamp": "2024-11-19T09:19:42.188291Z", > "utsname_hostname": "ceph1.xxxxxx", > "utsname_machine": "x86_64", > "utsname_release": "6.1.112-1.el9.elrepo.x86_64", > "utsname_sysname": "Linux", > "utsname_version": "#1 SMP PREEMPT_DYNAMIC Mon Sep 30 13:59:36 EDT > 2024" > } > > > The MGR can be made active, but attempts to use the dashboard generate > more of these. > > The culprit turns out to be python3-jaraco-text from the epel > repository, which upgraded from python3-jaraco-text-3.2.0-6.el9.noarch > to python3-jaraco-text-4.0.0-2.el9.noarch. (It also installed > python3-jaraco-context-6.0.1-3.el9.noarch as a new dependency). > Reverting to python3-jaraco-text-3.2.0 has avoided the problem for now, > but that's not a sustainable long-term fix. > > _______________________________________________ > ceph-users mailing list -- ceph-users@xxxxxxx > To unsubscribe send an email to ceph-users-leave@xxxxxxx > _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx