Please review. If the monitor sees an osdmap go by where nodes go down (or up) it will scan its pg_map and mark any pg whose primary is down as 'stale'. If/when the pg recovers, that will get refreshed. If not, the admin will know something is up. We'll soon be adding the last_active, last_clean, and now last_unstale (?) fields so that bigger alarms can go off when the pg stays stale for more than a few seconds... sage -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html