Re: Assert in OSDMap::Incremental::decode with recent git

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Gregory Farnum wrote:
> On Tue, Nov 15, 2011 at 3:55 AM, Christoph Hellwig <hch@xxxxxxxxxxxxx> wrote:
> > I hit the same when trying to bring up a test cluster on a single
> > physical machine.  As soon as moved to vstart.sh I couldn't reproduce
> > it anymore.
> 
> Hmm, interesting that it doesn't happen on vstart, since that's
> supposed to use the new mon bootstrapping pieces as well.
> 
> Josh, can you turn up monitor debugging and send me the log/post it
> somewhere? Presumably the big refactor Sage referred to broke
> something here.

http://joshp.no-ip.com:8080/20111116-ceph-mon.2.log

I've inlined a snippet from the end.

-Josh

2011-11-16 06:07:13.828322 7fce87736700 -- 192.168.122.74:6789/0 <== mon.0 192.168.122.95:6789/0 30 ==== paxos(osdmap lease lc 144 fc 142 pn 0 opn 0) v1 ==== 84+0+0 (3125715891 0 0) 0x17f2900 con 0x1735780
2011-11-16 06:07:13.828329 7fce87736700 mon.2@2(peon) e1 have connection
2011-11-16 06:07:13.828336 7fce87736700 mon.2@2(peon) e1 ms_dispatch existing session MonSession: mon.0 192.168.122.95:6789/0 is openallow * for mon.0 192.168.122.95:6789/0
2011-11-16 06:07:13.828340 7fce87736700 mon.2@2(peon) e1  caps allow *
2011-11-16 06:07:13.828347 7fce87736700 mon.2@2(peon).paxos(osdmap active c 141..144) handle_lease on 144 now 2011-11-16 06:07:17.183529
2011-11-16 06:07:13.828356 7fce87736700 -- 192.168.122.74:6789/0 --> mon.0 192.168.122.95:6789/0 -- paxos(osdmap lease_ack lc 144 fc 141 pn 0 opn 0) v1 -- ?+0 0x17f2b40
2011-11-16 06:07:13.828471 7fce87736700 mon.2@2(peon).paxos(osdmap active c 141..144) trim_to 142 (was 141), latest_stashed 141
2011-11-16 06:07:13.828485 7fce87736700 store(/data/mon2) set_int osdmap/first_committed = 141
2011-11-16 06:07:14.126377 7fce86431700 mon.2@2(peon) e1 ms_verify_authorizer 192.168.122.1:0/1024415 client protocol 0
2011-11-16 06:07:18.370639 7fce87736700 mon.2@2(peon).paxosservice(osdmap) _active
2011-11-16 06:07:18.370657 7fce87736700 mon.2@2(peon).osd e130 update_from_paxos paxos e 144, my e 130
2011-11-16 06:07:18.370691 7fce87736700 store(/data/mon2) get_bl osdmap/131 No such file or directory
2011-11-16 06:07:18.370698 7fce87736700 mon.2@2(peon).osd e130 update_from_paxos  applying incremental 131
*** Caught signal (Aborted) **
 in thread 7fce87736700
 ceph version 0.38-181-g2e19550 (commit:2e195500b5d3a8ab8512bcf2a219a6b7ff922c97)
 1: /usr/bin/ceph-mon() [0x5c2fa6]
 2: (()+0x10060) [0x7fce8b209060]
 3: (gsignal()+0x35) [0x7fce8998a3a5]
 4: (abort()+0x17b) [0x7fce8998db0b]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x11d) [0x7fce8a248d7d]
 6: (()+0xb9f26) [0x7fce8a246f26]
 7: (()+0xb9f53) [0x7fce8a246f53]
 8: (()+0xba04e) [0x7fce8a24704e]
 9: (ceph::buffer::list::iterator::copy(unsigned int, char*)+0x127) [0x596237]
 10: (OSDMap::Incremental::decode(ceph::buffer::list::iterator&)+0x3f) [0x573b6f]
 11: (OSDMonitor::update_from_paxos()+0x7b0) [0x49a9c0]
 12: (PaxosService::_active()+0x39) [0x4933f9]
 13: (Context::complete(int)+0xa) [0x47c12a]
 14: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xca) [0x47da8a]
 15: (Paxos::handle_lease(MMonPaxos*)+0x36d) [0x48aafd]
 16: (Paxos::dispatch(PaxosServiceMessage*)+0x21b) [0x48f4db]
 17: (Monitor::_ms_dispatch(Message*)+0xcbf) [0x47b66f]
 18: (Monitor::ms_dispatch(Message*)+0x35) [0x486425]
 19: (SimpleMessenger::dispatch_entry()+0x84b) [0x583e8b]
 20: (SimpleMessenger::DispatchThread::entry()+0x1c) [0x46612c]
 21: (()+0x7efc) [0x7fce8b200efc]
 22: (clone()+0x6d) [0x7fce89a3589d]
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux