Re: Ceph PGs stuck inactive after rebuild node

Eugen Block <eblock@xxxxxx> · Mon, 02 May 2022 08:58:32 +0000

Just to update this thread, apparently you were right, we did hit the  
limit of mon_max_pg_per_osd * osd_max_pg_per_osd_hard_ratio (250 * 3 =  
750), this was found in the logs:

2022-04-06 14:24:55.256 7f8bb5a0e700  1 osd.8 43377  
maybe_wait_for_max_pg withhold creation of pg 75.56s16: 750 >= 750

This message first came up for the last up OSD on that host after all  
other OSDs were purged and then again for the first up after the  
rebuild. I'm currently playing around with the osdmaptool, I have a  
feeling that this could also be an issue in newer releases, but that  
is just speculation at the moment.
As a workaround we'll increase osd_max_pg_per_osd_hard_ratio to 5 and  
see how the next attempt will go.

Thanks,
Eugen

Zitat von Josh Baergen <jbaergen@xxxxxxxxxxxxxxxx>:

On Wed, Apr 6, 2022 at 11:20 AM Eugen Block <eblock@xxxxxx> wrote:
I'm pretty sure that their cluster isn't anywhere near the limit for
mon_max_pg_per_osd, they currently have around 100 PGs per OSD and the
configs have not been touched, it's pretty basic.

How is the host being "rebuilt"? Depending on the CRUSH rule, if the
host's OSDs are all marked destroyed and then re-created one at a time
with normal weight, CRUSH may decide to put a large number of PGs on
the first OSD that is created, and so on, until the rest of the host's
OSDs are available to take those OSDs.

Josh

_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx