Hello Community!
I would appreciate any help/suggestions with the massive RGWs outage we are
The cluster's overall status is acceptable (HEALTH_WARN because of some pgs
not scrubbed in time), and the cluster is operational.
However, all RGWs fail to start with a core dump.
The only issue I see at the moment is the RGW GC queue (radosgs-admin gc
list) that contains 600K records.
I believe this could be the root cause of the issue. When I pause OSD iops
(ceph osd pause), all RGWs starting with no issues.
There are no large OMAPs or any other warnings in ceph -s output.

I would appreciate any help or suggestions you can provide.

