On 03/25/2014 04:04 AM, Ilya Dryomov wrote: > On Tue, Mar 25, 2014 at 10:39 AM, Olivier Bonvalet <ceph.list@xxxxxxxxx> wrote: >> Hi, >> >> what can/should I do to help fix that problem ? >> >> for now, RBD kernel client hang on : >> Assertion failure in rbd_img_obj_callback() at line 2131: >> rbd_assert(which >= img_request->next_completion); If you can build your own kernel as Ilya says I'd like to see the values of which and img_request->next_completion here. If you can't, is there anything interesting about your workload that might help reproduce the problem? Thanks. -Alex >> >> or on : >> Assertion failure in rbd_img_obj_callback() at line 2127: >> rbd_assert(img_request != NULL); >> >> >> I have both case at least once per week, on latest 3.13.5 kernels. >> >> It seems that the problem occurs only on more loaded servers (I have 4 >> near same servers, and crash occurs on two of them. If I move the VM, >> crash follows...). > > Hi, > > I'm looking into this, but it doesn't reproduce on my boxes. This must > be some sort of race and the fact that there is nothing else in the > dmesg right before the splats doesn't help. > > Can you send me your 3.13.5 rbd.ko and libceph.ko? Also, just so > I know, can you build your own kernel? > > Thanks, > > Ilya > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@xxxxxxxxxxxxxxx > More majordomo info at http://vger.kernel.org/majordomo-info.html > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html