Re: osd heartbeat protocol issue on upgrade v12.1.0 ->v12.2.0

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



 
I had this also once. If you update all nodes and then systemctl restart 
'ceph-osd@*' on all nodes, you should be fine. But first the monitors of 
course



-----Original Message-----
From: Thomas Gebhardt [mailto:gebhardt@xxxxxxxxxxxxxxxxxx] 
Sent: woensdag 30 augustus 2017 14:10
To: ceph-users@xxxxxxxxxxxxxx
Subject:  osd heartbeat protocol issue on upgrade v12.1.0 
->v12.2.0

Hello,

when I upgraded (yet a single osd node) from v12.1.0 -> v12.2.0 its osds 
start flapping and finally got all marked as down.

As far as I can see, this is due to an incompatibility of the osd 
heartbeat protocol between the two versions:

v12.2.0 node:
7f4f7b6e6700 -1 osd.X 3879 heartbeat_check: no reply from x.x.x.x:yyyy 
osd.Y ever on either front or back, first ping sent ...

v12.1.0 node:
7fd854ebf700 -1 failed to decode message of type 70 v4:
buffer::malformed_input: void
osd_peer_stat_t::decode(ceph::buffer::list::iterator&) no longer 
understand old encoding version 1 < struct_compat

( it is puzzling that the *older* v12.1.0 node complains about the *old* 
encoding version of the *newer* v12.2.0 node.)

Any idea how I can go ahead?

Kind regards, Thomas
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com


_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux