On 04/01/2015 05:04 PM, John Spray wrote:
On 01/04/2015 22:57, Mark Nelson wrote:
It seems to me that the OSD potentially would flash the LED on it's
way down if it thinks it's drive is dead/dying?
That's a good idea for the case where ceph-osd is proactively
identifying a failing drive. I'm also thinking about the case where we
come back from a reboot and a drive is sufficiently unreadable that
ceph-disk doesn't see the OSD partitions and ceph-osd never gets
started, or the OSD's local filesystem is unmountable. Because the
keyring lives on that local filesystem, OSDs couldn't phone home in that
case, even to report a failure.
If things are that bad, I think it should get picked up lower in the
stack. IE there should be some kind of daemon on the system that knows
when there are scsi errors or whatever and blinks drives that are that
far gone (in the case of RAID controllers, they may already do this anyway).
John
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html