Re: monitor dispatch queue seems backed up?

"Jim Schutt" <jaschut@xxxxxxxxxx> · Mon, 7 Mar 2011 08:26:05 -0700

On Fri, 2011-03-04 at 16:56 -0700, Sage Weil wrote:
> On Fri, 4 Mar 2011, Jim Schutt wrote:
> > On Fri, 2011-03-04 at 12:48 -0700, Sage Weil wrote:
> > > > Hi,
> > > > 
> > > > I'm continuing my testing of the master branch
> > > > (commit 1ed2d8c587) against 96 osds.
> > > > 
> > > > I'm having trouble getting a new filesystem to
> > > > start up.  FWIW this size filesystem starts up
> > > > in a minute or two at most under the stable branch.
> > > 
> > > The main thing that's changed since then is the default number of PGs has 
> > > gone up.  Can you try changing the osd_pg_bits to 6 in common/config.cc, 
> > > rebuild, and re-mkcephfs, and see if that gives you behavior similar to 
> > > stable?  (mkcephfs isn't currently letting you adjust that yet.)
> > 
> > OK, I tried osd_pg_bits = 6 as you suggested.
> > 
> > The startup behavior is much like stable, except my "ceph -w"
> > seems to have some trouble connecting - it often needs to 
> > try multiple times before it is successful.
> 
> Okay.  We'll have more nodes up next week so hopefully I'll be able to 
> reproduce this behavior.
> 
> > Unfortunately, the behavior under my 64-client write
> > test is also much like stable - I'm still getting
> > osds wrongly marked down.
> > 
> > I didn't dig into it enough yet to see if this is due to 
> > the same type of mysterious delays as I was finding
> > on stable.
> 
> I just pushed something to master that sprinkles debug prints all through 
> the heartbeat thread.  Let's try to figure out where it is blocking.
> 
> My worry is that this is all the VM's fault: we're writing data, hit the 
> dirty page limit, and thereafter any memory allocations or writes block.  
> I've been assuming that only the thread that does the write gets blocked, 
> but it may be that the VM blocks all threads in the process, or that it is 
> a memory allocation (in the heartbeat thread) that is hitting the limit 
> (balance_dirty_pages() IIRC).  We may need to make sure the heartbeat 
> thread uses preallocated/locked memory or something.. :(
> 
> Anyway, getting more detailed logs will tell us which thread(s) are 
> blocking when.  

OK.  FWIW, when I was digging into this on stable, one
thing I noticed is that when processing of a heatbeat
message stalled, the time between when it was dequeued
from the dispatch queue and when processing finished
was full of Reader/Writer thread processing, with the
occasional journal commit.

I'll recreate with your debugging, and see if I get
a similar behavior.  In any event, I'll have the logs
available.

-- Jim

> 
> Thanks!
> sage
> 

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html