2012/2/27 Székelyi Szabolcs <szekelyi@xxxxxxx>: > Hello, > > whenever I restart osd.0 I see a pair of messages like > > 2012-02-27 17:26:00.132666 mon.0 <osd_1_ip>:6789/0 106 : [INF] osd.0 > <osd_0_ip>:6801/29931 failed (by osd.1 <osd_1_ip>:6806/20125) > 2012-02-27 17:26:21.074926 osd.0 <osd_0_ip>:6801/29931 1 : [WRN] map e370 > wrongly marked me down or wrong addr To elaborate, if you look at the IP you'll see that it takes the form ip:port/random_number. That random number is to prevent exactly the situation you're asking about, where the new instance of a daemon gets stuck with stuff from its previous instance. We'll also probably want "debug osd = 10" to help diagnose what's going on, if you can get us full logs of it happening... :) -Greg -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html