On Mon, Jul 16, 2018 at 2:08 PM, John Spray <jspray@xxxxxxxxxx> wrote: > BTW there's already an interesting opportunity for someone to write a > chaosmonkey-type ceph-mgr module that periodically does things like > taking an OSD out and letting the cluster rebalance, randomly killing > an MDS from time to time, etc. What's the goal here? Are there tickets or something about this? I ask because anybody working on something like this should at least be aware of the thrasher code in https://github.com/ceph/ceph/blob/master/qa/tasks/ceph_manager.py#L98 I would not say it is, uh, *good*, but we use it extensively in teuthology testing. If we're going to build another one into the manager it might be nice to switch to relying on that instead, assuming it's feasible. Fewer code bases are generally better! (Of course, then we've got a great circular testing thing to work out too...) -Greg -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html