Hi, As you said, one of the osds crashed: ================= log ======================== 2011-05-10 21:46:38.990311 4bc90940 osd2 8 pg[3.13a( v 8'1 (0'0,8'1] n=1 ec=2 les=6 5/5/4) [2,3] r=0 mlcod 0'0 active+clean] oi.user_version=8'2 is_modify=0 2011-05-10 21:46:38.990386 4bc90940 osd2 8 pg[3.13a( v 8'1 (0'0,8'1] n=1 ec=2 les=6 5/5/4) [2,3] r=0 mlcod 0'0 active+clean] oi.user_version=8'2 is_modify=1 *** Caught signal (Segmentation fault) ** in thread 0x45382940 ========================================= I tried again, this time, i done "rbd create foo --size 1024" successfully, but when I run the code of testlibrbd.c, one of the osds crash again: ================= log ======================== 2011-05-10 22:08:20.008871 4c115940 osd3 10 pg[4.1( v 9'4 (9'2,9'4] n=1 ec=9 les=9 9/9/9) [3,0] r=0 mlcod 9'3 active+clean snaptrimq=[1~1]] dump_watchers testimg.rbd/head testimg.rbd/head(9'4 client4107.0:14 wrlock_by=unknown0.0:0) 2011-05-10 22:08:20.008903 4c115940 osd3 10 pg[4.1( v 9'4 (9'2,9'4] n=1 ec=9 les=9 9/9/9) [3,0] r=0 mlcod 9'3 active+clean snaptrimq=[1~1]] * obc->watcher: client4107 session=0xc80990 2011-05-10 22:08:20.008925 4c115940 osd3 10 pg[4.1( v 9'4 (9'2,9'4] n=1 ec=9 les=9 9/9/9) [3,0] r=0 mlcod 9'3 active+clean snaptrimq=[1~1]] * oi->watcher: client4107 cookie=2 2011-05-10 22:08:20.009232 4b914940 osd3 10 pg[4.1( v 9'4 (9'2,9'4] n=1 ec=9 les=9 9/9/9) [3,0] r=0 mlcod 9'3 active+clean] oi.user_version=10'5 is_modify=1 2011-05-10 22:08:20.009267 4b914940 expires 2011-05-10 23:08:19.890032 now 2011-05-10 22:08:20.009260 2011-05-10 22:08:20.009284 napshots_list 2011-05-10 22:08:20.009307 4b914940 osd3 10 pg[4.1( v 9'4 (9'2,9'4] n=1 ec=9 les=9 9/9/9) [3,0] r=0 mlcod 9'3 active+clean] oi.user_version=10'5 is_modify=0 2011-05-10 22:08:20.009375 4b914940 osd3 10 pg[4.1( v 9'4 (9'2,9'4] n=1 ec=9 les=9 9/9/9) [3,0] r=0 mlcod 9'3 active+clean] oi.user_version=10'5 is_modify=1 *** Caught signal (Segmentation fault) ** in thread 0x4eb1c940 ========================================= Thx! Simon 2011/5/10 Yehuda Sadeh Weinraub <yehudasa@xxxxxxxxx>: > On Tue, May 10, 2011 at 6:39 AM, Simon Tian <aixt2006@xxxxxxxxx> wrote: >> Sorry, I didn't learn the wiki carefullllllly.. >> >> There is another problem: >> ==================================================== >> [root@mon-00 ~]# rbd create foo --size 1024 >> 2011-05-11 02:37:15.400313 42d6a940 -- 10.250.6.98:0/1010326 >> >> 10.250.6.30:6801/3051 pipe(0x63f140 sd=6 pgs=0 cs=0 l=0).fault first >> fault > > This looks like your osd is not reachable, might have crashed? > >> terminate called after throwing an instance of 'ceph::buffer::end_of_buffer' >> Âwhat(): Âbuffer::end_of_buffer >> *** Caught signal (Aborted) ** > > This shouldn't happen anyway, but probably bad error handling. Can you > verify whether one of the daemons went down? > > Thanks, > Yehuda > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html