This could be related to bug 8818. See
http://tracker.ceph.com/issues/8818
I've just experienced a disk crash in my ceph cluster. This seems to
have caused an error in the kernel since every I/O command sent to any
mapped RBD is defered indefinitely, even an hour later, the OSD is
already taken out of the cluster and ceph -s says HEALTH_OK.
...
A kernel dump is provided at this pastebin link:
http://pastebin.com/WMqmiUsM
...
I'm running the linux kernel 3.15.7 and are using layered rbd devices.
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html