Re: Ceph kRBD stuck after disk crash

Eric Eastman <eric0e@xxxxxxx> · Tue, 5 Aug 2014 10:17:41 -0400 (EDT)

This could be related to bug 8818.  See 
http://tracker.ceph.com/issues/8818

I've just experienced a disk crash in my ceph cluster. This seems to
have caused an error in the kernel since every I/O command sent to any
mapped RBD is defered indefinitely, even an hour later, the OSD is
already taken out of the cluster and ceph -s says HEALTH_OK.
...
A kernel dump is provided at this pastebin link: 
http://pastebin.com/WMqmiUsM
...
I'm running the linux kernel 3.15.7 and are using layered rbd devices.

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html