I had this also once. If you update all nodes and then systemctl restart 'ceph-osd@*' on all nodes, you should be fine. But first the monitors of course -----Original Message----- From: Thomas Gebhardt [mailto:gebhardt@xxxxxxxxxxxxxxxxxx] Sent: woensdag 30 augustus 2017 14:10 To: ceph-users@xxxxxxxxxxxxxx Subject: osd heartbeat protocol issue on upgrade v12.1.0 ->v12.2.0 Hello, when I upgraded (yet a single osd node) from v12.1.0 -> v12.2.0 its osds start flapping and finally got all marked as down. As far as I can see, this is due to an incompatibility of the osd heartbeat protocol between the two versions: v12.2.0 node: 7f4f7b6e6700 -1 osd.X 3879 heartbeat_check: no reply from x.x.x.x:yyyy osd.Y ever on either front or back, first ping sent ... v12.1.0 node: 7fd854ebf700 -1 failed to decode message of type 70 v4: buffer::malformed_input: void osd_peer_stat_t::decode(ceph::buffer::list::iterator&) no longer understand old encoding version 1 < struct_compat ( it is puzzling that the *older* v12.1.0 node complains about the *old* encoding version of the *newer* v12.2.0 node.) Any idea how I can go ahead? Kind regards, Thomas _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com