Things have stabilized here. In the end there were a few different issues: - We have a bug in teh paxos code that wasn't trimming old keys, causing the store to grow indefinitely. - leveldb does very poorly with "rolling key ranges", where regions of the keyspace are deleted and never touched again. This fails to trigger the automatic compaction causing disk space to grow. We've resolved this by triggering compaction explicitly every so often over the ranges of keys we've used, and doing that compaction in an async thread. This is controlled by the 'mon compact on trim' option, which defaults to true. - Over the course of debugging this issue we added several similar options to compact at various other times, but they should not be neceessary for any normal users. At this point we are in a pretty good position. There are still some hard to hit mon bugs present, but they are not dangerous (ceph-mon restart will do the trick). They are largely fixed in the master branch but we want to test further before risking any backports. Meanwhile, the next development release will be changing the way all the pg metadata in the monitor is stored to be much more efficient and to take advantage of leveldb's capabilities; this will be present in 0.66 (dumpling - 1). sage -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html