Re: cosd multi-second stalls cause "wrongly marked me down"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, 2011-03-02 at 16:20 -0700, Gregory Farnum wrote:
> On Wed, Mar 2, 2011 at 2:57 PM, Jim Schutt <jaschut@xxxxxxxxxx> wrote:
> >
> > On Wed, 2011-03-02 at 14:59 -0700, Jim Schutt wrote:
> >>
> >> On Wed, 2011-03-02 at 14:45 -0700, Sage Weil wrote:
> >> > On Wed, 2 Mar 2011, Jim Schutt wrote:
> >> > >
> >> > > On Wed, 2011-03-02 at 10:10 -0700, Sage Weil wrote:
> >> > > > > I'll see if I see the same signature with master,
> >> > > > > and post logs.
> >> > > >
> >> > > > Thanks!  Keep us posted.
> >> > >
> >> > > Hmmm, I'm not having much luck with master (commit
> >> > > 0fb5ef2ce92 + extra debugging) on a 96-osd filesystem;
> >> > > lots of dead OSDs during startup.
> >> >
> >> > Commit c916905a8a14029653aae45f0a9fb6c9b4c39e05 (master) should fix this.
> >>
> >> I try it out, thanks!
> >
> > I don't get any more core files with master commit 67355779ecc.
> > Now my cosds just die - no stack trace in the log, no core
> > file, nothing in syslog or dmesg ...
> Another commit got in that changed the logging behavior slightly --
> which log file are you opening?
> 

Well, I don't have any specific logging config.
So my logs would show up in /var/log/ceph, and
they still seem to be there.  They contain logging
info, just not a stack trace that might explain
why the cosd died.

-- Jim



--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux