Re: cosd multi-second stalls cause "wrongly marked me down"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, 2011-03-02 at 10:10 -0700, Sage Weil wrote:
> > I'll see if I see the same signature with master,
> > and post logs.
> 
> Thanks!  Keep us posted.

Hmmm, I'm not having much luck with master (commit 
0fb5ef2ce92 + extra debugging) on a 96-osd filesystem;
lots of dead OSDs during startup.

I used to use a global chdir option to redirect
my core files; my servers are readonly NFS-root, 
with /root on a ramdisk, so being able to point
those core files at persistent storage was useful.
That seems to have changed somewhat.

The backtraces all seem to look like this:

(gdb) bt
#0  0x00007f5d496d79dd in raise (sig=<value optimized out>) at ../nptl/sysdeps/unix/sysv/linux/pt-raise.c:41
#1  0x000000000089bd67 in handle_fatal_signal (signum=6) at common/signal.cc:78
#2  <signal handler called>
#3  0x00007f5d48542265 in raise (sig=<value optimized out>) at ../nptl/sysdeps/unix/sysv/linux/raise.c:64
#4  0x00007f5d48543d10 in abort () at abort.c:88
#5  0x00007f5d48db8cb4 in __gnu_cxx::__verbose_terminate_handler() () from /usr/lib64/libstdc++.so.6
#6  0x00007f5d48db6db6 in ?? () from /usr/lib64/libstdc++.so.6
#7  0x00007f5d48db6de3 in std::terminate() () from /usr/lib64/libstdc++.so.6
#8  0x00007f5d48db6eca in __cxa_throw () from /usr/lib64/libstdc++.so.6
#9  0x000000000066b1ac in ceph::buffer::list::iterator::copy (this=0x7f5d3f4fde40, len=2, dest=0x7f5d3f4fd77e "") at ./include/buffer.h:637
#10 0x00000000006d998c in decode_raw<__le16> (t=@0x7f5d3f4fd77e, p=...) at ./include/encoding.h:35
#11 0x0000000000769a27 in decode (v=@0x7f5d3f4fd7ae, p=...) at ./include/encoding.h:82
#12 0x000000000079c857 in OSDMap::Incremental::decode (this=0x7f5d3f4fd8a0, p=...) at osd/OSDMap.h:204
#13 0x0000000000731803 in OSD::handle_osd_map (this=0x2266af0, m=0x24b2c30) at osd/OSD.cc:2859
#14 0x0000000000733198 in OSD::_dispatch (this=0x2266af0, m=0x24b2c30) at osd/OSD.cc:2428
#15 0x00000000007344e0 in OSD::ms_dispatch (this=0x2266af0, m=0x24b2c30) at osd/OSD.cc:2301
#16 0x000000000068ab9d in Messenger::ms_deliver_dispatch (this=0x220aa30, m=0x24b2c30) at msg/Messenger.h:97
#17 0x0000000000677894 in SimpleMessenger::dispatch_entry (this=0x220aa30) at msg/SimpleMessenger.cc:357
#18 0x000000000066eb2d in SimpleMessenger::DispatchThread::entry (this=0x220aed0) at msg/SimpleMessenger.h:534
#19 0x000000000068b258 in Thread::_entry_func (arg=0x220aed0) at ./common/Thread.h:47
#20 0x00007f5d496cf73d in start_thread (arg=<value optimized out>) at pthread_create.c:301
#21 0x00007f5d485e5f6d in clone () from /lib64/libc.so.6

-- Jim

> sage
> 


--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux