On Tue, Jun 5, 2012 at 9:37 PM, Martin Wilderoth <martin.wilderoth@xxxxxxxxxx> wrote: > 0> 2012-06-06 05:38:37.200297 7f2d5ea08700 -1 mds/AnchorServer.cc: In function 'virtual void AnchorServer::handle_query(MMDSTableRequest*)' thread 7f2d5ea08700 time 2012-06-06 05:38:37.198981 > mds/AnchorServer.cc: 249: FAILED assert(anchor_map.count(curino) == 1) > > ceph version 0.46 (commit:cb7f1c9c7520848b0899b26440ac34a8acea58d1) > 1: (AnchorServer::handle_query(MMDSTableRequest*)+0x175) [0x6bdc95] > 2: (MDS::handle_deferrable_message(Message*)+0xd84) [0x4b0474] > 3: (MDS::_dispatch(Message*)+0xaf8) [0x4c50b8] ... > root@ceph1:~# ceph -v > ceph version 0.47.2 (commit:8bf9fde89bd6ebc4b0645b2fe02dadb1c17ad372) > root@ceph1:~# ssh ceph2 ceph -v > ceph version 0.47.2 (commit:8bf9fde89bd6ebc4b0645b2fe02dadb1c17ad372) > root@ceph1:~# ssh ceph3 ceph -v > ceph version 0.47.2 (commit:8bf9fde89bd6ebc4b0645b2fe02dadb1c17ad372) > root@ceph1:~# ssh ceph4 ceph -v > ceph version 0.47.2 (commit:8bf9fde89bd6ebc4b0645b2fe02dadb1c17ad372) > > is the 0.46 above reporting when the error occurred or am I running the wrong binaries > i use the debian packages ? That sounds weird. The ways I can see that being possible are 1. you had daemons still running from when you had 0.46 debs installed, and the package upgrade didn't restart them -- ceph.git uses "dh_installinit --no-start", so this might actually have happened? 2. at some point, you checked out the 0.46 source and ran make install, and the binaries from that are installed at a path that takes precedence over the 0.47.2 ones in /usr/sbin But having mixed versions in your cluster might explain the crashes. -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html