Re: OSD spend too much time on "waiting for readable" -> slow ops -> laggy pg -> rgw stop -> worst case osd restart

Manuel Lausch <manuel.lausch@xxxxxxxx> · Fri, 5 Nov 2021 14:16:31 +0100

Hi Sage,

I tested again with setting paxos_propose_interval = 0.3
Now stopping OSDs causes way less slow ops. While starting OSDs the
slows seems gone. 
With osd_fast_shutdown_notify_mon = true the slow ops are gone
completely. So I would like to keep the shutdown notify enabled.

As far as I understand the paxos propose interval. This is the interval
in which messages are processed and leads, if neccessary, to new
OSDMaps. Correct?
I wonder if a smaller value could lead to load issues on bigger
clusters while some impact happens, like a host or a whole rack goes
down.

Thanks
Manuel

On Thu, 4 Nov 2021 17:51:55 -0500
Sage Weil <sage@xxxxxxxxxxxx> wrote:

> Can you try setting paxos_propose_interval to a smaller number,
> like .3 (by default it is 2 seconds) and see if that has any effect.
> 
> It sounds like the problem is not related to getting the OSD marked
> down (or at least that is not the only thing going on).  My next
> guess is that the peering process that follows needs to get OSDs'
> up_thru values to update and there is delay there.
> 
> Thanks!
> sage
> 
> 
> On Thu, Nov 4, 2021 at 4:15 AM Manuel Lausch <manuel.lausch@xxxxxxxx>
> wrote:
> 
> > On Tue, 2 Nov 2021 09:02:31 -0500
> > Sage Weil <sage@xxxxxxxxxxxx> wrote:
> >
> >  
> > >
> > > Just to be clear, you should try
> > >   osd_fast_shutdown = true
> > >   osd_fast_shutdown_notify_mon = false  
> >
> > I added some logs to the tracker ticket with this options set.
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx