BTW, man cmds says "--journal_check" and cmds says usage: cmds -i name [flags] [[--journal_check]|[--hot-standby][rank]] <snip> --journal-check rank replay the journal for rank, then exit with underscore in the summary and dash in the option details. Trying all possibble combinations I found the right syntax: cmds -i node01 --journal-check 0 The problem wasn't dash or underscore; both of them work. Omitting the rank is what doesn't, so that's an error in the man page: [--journal_check [rank]] should be [--journal_check rank]. Anyway, cmds --journal-check on both nodes didn't fix the problem. I also tried --journal-reset. It returned 2011-04-15 17:07:44.494238 7f372f185720 -- 0.0.0.0:6804/11266 messenger.start BUG: there are 1 child threads already started that will now die! call messenger.start() sooner. but didn't fix the suicide either. Z -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html