Dear Everyone,
First of all, guys, seriously, Thank you for Ceph.
now to the problem, upgrading ceph from 0.94.6 (e832001feaf8c176593e0325c8298e3f16dfb403) to 12.2.12-218-g9fd889f (9fd889fe09c652512ca78854702d5ad9bf3059bb), ceph-mon seems unable to upgrade it's database, problem is gone if i --force-sync.
This is the message:
terminate called after throwing an instance of 'ceph::buffer::malformed_input'
what(): buffer::malformed_input: void object_stat_sum_t::decode(ceph::buffer::list::iterator&) decode past end of struct encoding
*** Caught signal (Aborted) **
what(): buffer::malformed_input: void object_stat_sum_t::decode(ceph::buffer::list::iterator&) decode past end of struct encoding
*** Caught signal (Aborted) **
attached is full log, the output of:
ceph-mon --debug_mon 100 -i node-1 -d
---
Armin ranjbar
Armin ranjbar
2019-07-22 19:17:54.429120 7f2064488f40 0 ceph version 12.2.12-218-g9fd889f (9fd889fe09c652512ca78854702d5ad9bf3059bb) luminous (stable), process ceph-mon, pid 908122 2019-07-22 19:17:54.429229 7f2064488f40 0 pidfile_write: ignore empty --pid-file 2019-07-22 19:17:54.438472 7f2064488f40 0 load: jerasure load: lrc load: isa 2019-07-22 19:17:54.438908 7f2064488f40 1 leveldb: Recovering log #4402802 2019-07-22 19:17:54.489204 7f2064488f40 1 leveldb: Delete type=0 #4402802 2019-07-22 19:17:54.489263 7f2064488f40 1 leveldb: Delete type=3 #4402801 2019-07-22 19:17:54.489547 7f2064488f40 10 obtain_monmap terminate called after throwing an instance of 'ceph::buffer::malformed_input' what(): buffer::malformed_input: void object_stat_sum_t::decode(ceph::buffer::list::iterator&) decode past end of struct encoding *** Caught signal (Aborted) ** in thread 7f2064488f40 thread_name:ceph-mon 2019-07-22 19:17:54.489654 7f2064488f40 10 obtain_monmap read last committed monmap ver 3 2019-07-22 19:17:54.490558 7f2064488f40 0 starting mon.node-1 rank 2 at public addr 192.168.1.16:6789/0 at bind addr 192.168.1.16:6789/0 mon_data /var/lib/ceph/mon/ceph-node-1 fsid cf635990-70fa-43ed-978d-96f92f9ccc92 2019-07-22 19:17:54.490737 7f2064488f40 0 starting mon.node-1 rank 2 at 192.168.1.16:6789/0 mon_data /var/lib/ceph/mon/ceph-node-1 fsid cf635990-70fa-43ed-978d-96f92f9ccc92 2019-07-22 19:17:54.491279 7f2064488f40 1 mon.node-1@-1(probing) e3 preinit fsid cf635990-70fa-43ed-978d-96f92f9ccc92 2019-07-22 19:17:54.491351 7f2064488f40 10 mon.node-1@-1(probing) e3 check_fsid cluster_uuid contains 'cf635990-70fa-43ed-978d-96f92f9ccc92' 2019-07-22 19:17:54.491363 7f2064488f40 10 mon.node-1@-1(probing) e3 features compat={},rocompat={},incompat={1=initial feature set (~v.18),3=single paxos with k/v store (v0.?),4=support erasure code pools,5=new-style osdmap encoding,6=support isa/lrc erasure code} 2019-07-22 19:17:54.491371 7f2064488f40 10 mon.node-1@-1(probing) e3 calc_quorum_requirements required_features 18416819765248 2019-07-22 19:17:54.491374 7f2064488f40 10 mon.node-1@-1(probing) e3 required_features 18416819765248 2019-07-22 19:17:54.491381 7f2064488f40 10 mon.node-1@-1(probing) e3 has_ever_joined = 1 2019-07-22 19:17:54.491411 7f2064488f40 10 mon.node-1@-1(probing) e3 sync_last_committed_floor 0 2019-07-22 19:17:54.491413 7f2064488f40 10 mon.node-1@-1(probing) e3 init_paxos 2019-07-22 19:17:54.491516 7f2064488f40 1 mon.node-1@-1(probing).mds e0 Unable to load 'last_metadata' 2019-07-22 19:17:54.491558 7f2064488f40 10 mon.node-1@-1(probing).health init 2019-07-22 19:17:54.491574 7f2064488f40 10 mon.node-1@-1(probing) e3 refresh_from_paxos 2019-07-22 19:17:54.491608 7f2064488f40 1 mon.node-1@-1(probing).paxosservice(pgmap 21727587..21728259) refresh upgraded, format 0 -> 1 2019-07-22 19:17:54.491612 7f2064488f40 1 mon.node-1@-1(probing).pg v0 on_upgrade discarding in-core PGMap 2019-07-22 19:17:54.491635 7f2064488f40 10 mon.node-1@-1(probing).pg v0 update_from_paxos v0, read_full 2019-07-22 19:17:54.491638 7f2064488f40 10 mon.node-1@-1(probing).pg v0 read_pgmap_meta ceph version 12.2.12-218-g9fd889f (9fd889fe09c652512ca78854702d5ad9bf3059bb) luminous (stable) 1: (()+0x96b249) [0x7f2063e73249] 2: (()+0x10330) [0x7f20628a0330] 3: (gsignal()+0x37) [0x7f2060e8bc37] 4: (abort()+0x148) [0x7f2060e8f028] 5: (__gnu_cxx::__verbose_terminate_handler()+0x155) [0x7f206179a535] 6: (()+0x5e6d6) [0x7f20617986d6] 7: (()+0x5e703) [0x7f2061798703] 8: (()+0x5e922) [0x7f2061798922] 9: (object_stat_sum_t::decode(ceph::buffer::list::iterator&)+0x650) [0x7f2063c81be0] 10: (object_stat_collection_t::decode(ceph::buffer::list::iterator&)+0x4f) [0x7f2063c9627f] 11: (pg_stat_t::decode(ceph::buffer::list::iterator&)+0x1d5) [0x7f2063c96965] 12: (PGMap::update_pg(pg_t, ceph::buffer::list&)+0xf4) [0x7f20639d93b4] 13: (PGMonitor::read_pgmap_full()+0x161) [0x7f20639a8a81] 14: (PGMonitor::update_from_paxos(bool*)+0x699) [0x7f20639b0479] 15: (PaxosService::refresh(bool*)+0x1a3) [0x7f2063a55103] 16: (Monitor::refresh_from_paxos(bool*)+0x183) [0x7f206390cd53] 17: (Monitor::init_paxos()+0xfd) [0x7f206390d12d] 18: (Monitor::preinit()+0xa7e) [0x7f206390dbee] 19: (main()+0x3bf4) [0x7f206383cde4] 20: (__libc_start_main()+0xf5) [0x7f2060e76f45] 21: (()+0x3db4fe) [0x7f20638e34fe] 2019-07-22 19:17:54.495504 7f2064488f40 -1 *** Caught signal (Aborted) ** in thread 7f2064488f40 thread_name:ceph-mon ceph version 12.2.12-218-g9fd889f (9fd889fe09c652512ca78854702d5ad9bf3059bb) luminous (stable) 1: (()+0x96b249) [0x7f2063e73249] 2: (()+0x10330) [0x7f20628a0330] 3: (gsignal()+0x37) [0x7f2060e8bc37] 4: (abort()+0x148) [0x7f2060e8f028] 5: (__gnu_cxx::__verbose_terminate_handler()+0x155) [0x7f206179a535] 6: (()+0x5e6d6) [0x7f20617986d6] 7: (()+0x5e703) [0x7f2061798703] 8: (()+0x5e922) [0x7f2061798922] 9: (object_stat_sum_t::decode(ceph::buffer::list::iterator&)+0x650) [0x7f2063c81be0] 10: (object_stat_collection_t::decode(ceph::buffer::list::iterator&)+0x4f) [0x7f2063c9627f] 11: (pg_stat_t::decode(ceph::buffer::list::iterator&)+0x1d5) [0x7f2063c96965] 12: (PGMap::update_pg(pg_t, ceph::buffer::list&)+0xf4) [0x7f20639d93b4] 13: (PGMonitor::read_pgmap_full()+0x161) [0x7f20639a8a81] 14: (PGMonitor::update_from_paxos(bool*)+0x699) [0x7f20639b0479] 15: (PaxosService::refresh(bool*)+0x1a3) [0x7f2063a55103] 16: (Monitor::refresh_from_paxos(bool*)+0x183) [0x7f206390cd53] 17: (Monitor::init_paxos()+0xfd) [0x7f206390d12d] 18: (Monitor::preinit()+0xa7e) [0x7f206390dbee] 19: (main()+0x3bf4) [0x7f206383cde4] 20: (__libc_start_main()+0xf5) [0x7f2060e76f45] 21: (()+0x3db4fe) [0x7f20638e34fe] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. --- begin dump of recent events --- -61> 2019-07-22 19:17:54.421648 7f2064488f40 5 asok(0x7f206dcc6380) register_command perfcounters_dump hook 0x7f206dc52190 -60> 2019-07-22 19:17:54.421691 7f2064488f40 5 asok(0x7f206dcc6380) register_command 1 hook 0x7f206dc52190 -59> 2019-07-22 19:17:54.421697 7f2064488f40 5 asok(0x7f206dcc6380) register_command perf dump hook 0x7f206dc52190 -58> 2019-07-22 19:17:54.421707 7f2064488f40 5 asok(0x7f206dcc6380) register_command perfcounters_schema hook 0x7f206dc52190 -57> 2019-07-22 19:17:54.421710 7f2064488f40 5 asok(0x7f206dcc6380) register_command perf histogram dump hook 0x7f206dc52190 -56> 2019-07-22 19:17:54.421715 7f2064488f40 5 asok(0x7f206dcc6380) register_command 2 hook 0x7f206dc52190 -55> 2019-07-22 19:17:54.421723 7f2064488f40 5 asok(0x7f206dcc6380) register_command perf schema hook 0x7f206dc52190 -54> 2019-07-22 19:17:54.421731 7f2064488f40 5 asok(0x7f206dcc6380) register_command perf histogram schema hook 0x7f206dc52190 -53> 2019-07-22 19:17:54.421746 7f2064488f40 5 asok(0x7f206dcc6380) register_command perf reset hook 0x7f206dc52190 -52> 2019-07-22 19:17:54.421750 7f2064488f40 5 asok(0x7f206dcc6380) register_command config show hook 0x7f206dc52190 -51> 2019-07-22 19:17:54.421761 7f2064488f40 5 asok(0x7f206dcc6380) register_command config help hook 0x7f206dc52190 -50> 2019-07-22 19:17:54.421773 7f2064488f40 5 asok(0x7f206dcc6380) register_command config set hook 0x7f206dc52190 -49> 2019-07-22 19:17:54.421781 7f2064488f40 5 asok(0x7f206dcc6380) register_command config get hook 0x7f206dc52190 -48> 2019-07-22 19:17:54.421785 7f2064488f40 5 asok(0x7f206dcc6380) register_command config diff hook 0x7f206dc52190 -47> 2019-07-22 19:17:54.421791 7f2064488f40 5 asok(0x7f206dcc6380) register_command config diff get hook 0x7f206dc52190 -46> 2019-07-22 19:17:54.421794 7f2064488f40 5 asok(0x7f206dcc6380) register_command log flush hook 0x7f206dc52190 -45> 2019-07-22 19:17:54.421798 7f2064488f40 5 asok(0x7f206dcc6380) register_command log dump hook 0x7f206dc52190 -44> 2019-07-22 19:17:54.421803 7f2064488f40 5 asok(0x7f206dcc6380) register_command log reopen hook 0x7f206dc52190 -43> 2019-07-22 19:17:54.421821 7f2064488f40 5 asok(0x7f206dcc6380) register_command dump_mempools hook 0x7f206dc751e8 -42> 2019-07-22 19:17:54.429120 7f2064488f40 0 ceph version 12.2.12-218-g9fd889f (9fd889fe09c652512ca78854702d5ad9bf3059bb) luminous (stable), process ceph-mon, pid 908122 -41> 2019-07-22 19:17:54.429229 7f2064488f40 0 pidfile_write: ignore empty --pid-file -40> 2019-07-22 19:17:54.431720 7f2064488f40 5 asok(0x7f206dcc6380) init /var/run/ceph/ceph-mon.node-1.asok -39> 2019-07-22 19:17:54.431751 7f2064488f40 5 asok(0x7f206dcc6380) bind_and_listen /var/run/ceph/ceph-mon.node-1.asok -38> 2019-07-22 19:17:54.431866 7f2064488f40 5 asok(0x7f206dcc6380) register_command 0 hook 0x7f206dc4e0c0 -37> 2019-07-22 19:17:54.431876 7f2064488f40 5 asok(0x7f206dcc6380) register_command version hook 0x7f206dc4e0c0 -36> 2019-07-22 19:17:54.431889 7f2064488f40 5 asok(0x7f206dcc6380) register_command git_version hook 0x7f206dc4e0c0 -35> 2019-07-22 19:17:54.431895 7f2064488f40 5 asok(0x7f206dcc6380) register_command help hook 0x7f206dc521d0 -34> 2019-07-22 19:17:54.431904 7f2064488f40 5 asok(0x7f206dcc6380) register_command get_command_descriptions hook 0x7f206dc522d0 -33> 2019-07-22 19:17:54.431968 7f205ec61700 5 asok(0x7f206dcc6380) entry start -32> 2019-07-22 19:17:54.438472 7f2064488f40 0 load: jerasure load: lrc load: isa -31> 2019-07-22 19:17:54.438908 7f2064488f40 1 leveldb: Recovering log #4402802 -30> 2019-07-22 19:17:54.489204 7f2064488f40 1 leveldb: Delete type=0 #4402802 -29> 2019-07-22 19:17:54.489263 7f2064488f40 1 leveldb: Delete type=3 #4402801 -28> 2019-07-22 19:17:54.489547 7f2064488f40 10 obtain_monmap -27> 2019-07-22 19:17:54.489654 7f2064488f40 10 obtain_monmap read last committed monmap ver 3 -26> 2019-07-22 19:17:54.490318 7f205cf95700 2 Event(0x7f206dcc4080 nevent=5000 time_id=1).set_owner idx=0 owner=139776975525632 -25> 2019-07-22 19:17:54.490397 7f205c794700 2 Event(0x7f206dcc5680 nevent=5000 time_id=1).set_owner idx=1 owner=139776967132928 -24> 2019-07-22 19:17:54.490427 7f205bf93700 2 Event(0x7f206dcc5280 nevent=5000 time_id=1).set_owner idx=2 owner=139776958740224 -23> 2019-07-22 19:17:54.490558 7f2064488f40 0 starting mon.node-1 rank 2 at public addr 192.168.1.16:6789/0 at bind addr 192.168.1.16:6789/0 mon_data /var/lib/ceph/mon/ceph-node-1 fsid cf635990-70fa-43ed-978d-96f92f9ccc92 -22> 2019-07-22 19:17:54.490690 7f2064488f40 1 -- 192.168.1.16:6789/0 learned_addr learned my addr 192.168.1.16:6789/0 -21> 2019-07-22 19:17:54.490696 7f2064488f40 1 -- 192.168.1.16:6789/0 _finish_bind bind my_inst.addr is 192.168.1.16:6789/0 -20> 2019-07-22 19:17:54.490737 7f2064488f40 0 starting mon.node-1 rank 2 at 192.168.1.16:6789/0 mon_data /var/lib/ceph/mon/ceph-node-1 fsid cf635990-70fa-43ed-978d-96f92f9ccc92 -19> 2019-07-22 19:17:54.490772 7f2064488f40 5 adding auth protocol: cephx -18> 2019-07-22 19:17:54.490773 7f2064488f40 5 adding auth protocol: cephx -17> 2019-07-22 19:17:54.490817 7f2064488f40 10 log_channel(cluster) update_config to_monitors: true to_syslog: false syslog_facility: daemon prio: info to_graylog: false graylog_host: 127.0.0.1 graylog_port: 12201) -16> 2019-07-22 19:17:54.490820 7f2064488f40 10 log_channel(audit) update_config to_monitors: true to_syslog: false syslog_facility: local0 prio: info to_graylog: false graylog_host: 127.0.0.1 graylog_port: 12201) -15> 2019-07-22 19:17:54.491279 7f2064488f40 1 mon.node-1@-1(probing) e3 preinit fsid cf635990-70fa-43ed-978d-96f92f9ccc92 -14> 2019-07-22 19:17:54.491351 7f2064488f40 10 mon.node-1@-1(probing) e3 check_fsid cluster_uuid contains 'cf635990-70fa-43ed-978d-96f92f9ccc92' -13> 2019-07-22 19:17:54.491363 7f2064488f40 10 mon.node-1@-1(probing) e3 features compat={},rocompat={},incompat={1=initial feature set (~v.18),3=single paxos with k/v store (v0.?),4=support erasure code pools,5=new-style osdmap encoding,6=support isa/lrc erasure code} -12> 2019-07-22 19:17:54.491371 7f2064488f40 10 mon.node-1@-1(probing) e3 calc_quorum_requirements required_features 18416819765248 -11> 2019-07-22 19:17:54.491374 7f2064488f40 10 mon.node-1@-1(probing) e3 required_features 18416819765248 -10> 2019-07-22 19:17:54.491381 7f2064488f40 10 mon.node-1@-1(probing) e3 has_ever_joined = 1 -9> 2019-07-22 19:17:54.491411 7f2064488f40 10 mon.node-1@-1(probing) e3 sync_last_committed_floor 0 -8> 2019-07-22 19:17:54.491413 7f2064488f40 10 mon.node-1@-1(probing) e3 init_paxos -7> 2019-07-22 19:17:54.491516 7f2064488f40 1 mon.node-1@-1(probing).mds e0 Unable to load 'last_metadata' -6> 2019-07-22 19:17:54.491558 7f2064488f40 10 mon.node-1@-1(probing).health init -5> 2019-07-22 19:17:54.491574 7f2064488f40 10 mon.node-1@-1(probing) e3 refresh_from_paxos -4> 2019-07-22 19:17:54.491608 7f2064488f40 1 mon.node-1@-1(probing).paxosservice(pgmap 21727587..21728259) refresh upgraded, format 0 -> 1 -3> 2019-07-22 19:17:54.491612 7f2064488f40 1 mon.node-1@-1(probing).pg v0 on_upgrade discarding in-core PGMap -2> 2019-07-22 19:17:54.491635 7f2064488f40 10 mon.node-1@-1(probing).pg v0 update_from_paxos v0, read_full -1> 2019-07-22 19:17:54.491638 7f2064488f40 10 mon.node-1@-1(probing).pg v0 read_pgmap_meta 0> 2019-07-22 19:17:54.495504 7f2064488f40 -1 *** Caught signal (Aborted) ** in thread 7f2064488f40 thread_name:ceph-mon ceph version 12.2.12-218-g9fd889f (9fd889fe09c652512ca78854702d5ad9bf3059bb) luminous (stable) 1: (()+0x96b249) [0x7f2063e73249] 2: (()+0x10330) [0x7f20628a0330] 3: (gsignal()+0x37) [0x7f2060e8bc37] 4: (abort()+0x148) [0x7f2060e8f028] 5: (__gnu_cxx::__verbose_terminate_handler()+0x155) [0x7f206179a535] 6: (()+0x5e6d6) [0x7f20617986d6] 7: (()+0x5e703) [0x7f2061798703] 8: (()+0x5e922) [0x7f2061798922] 9: (object_stat_sum_t::decode(ceph::buffer::list::iterator&)+0x650) [0x7f2063c81be0] 10: (object_stat_collection_t::decode(ceph::buffer::list::iterator&)+0x4f) [0x7f2063c9627f] 11: (pg_stat_t::decode(ceph::buffer::list::iterator&)+0x1d5) [0x7f2063c96965] 12: (PGMap::update_pg(pg_t, ceph::buffer::list&)+0xf4) [0x7f20639d93b4] 13: (PGMonitor::read_pgmap_full()+0x161) [0x7f20639a8a81] 14: (PGMonitor::update_from_paxos(bool*)+0x699) [0x7f20639b0479] 15: (PaxosService::refresh(bool*)+0x1a3) [0x7f2063a55103] 16: (Monitor::refresh_from_paxos(bool*)+0x183) [0x7f206390cd53] 17: (Monitor::init_paxos()+0xfd) [0x7f206390d12d] 18: (Monitor::preinit()+0xa7e) [0x7f206390dbee] 19: (main()+0x3bf4) [0x7f206383cde4] 20: (__libc_start_main()+0xf5) [0x7f2060e76f45] 21: (()+0x3db4fe) [0x7f20638e34fe] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. --- logging levels --- 0/ 5 none 0/ 1 lockdep 0/ 1 context 1/ 1 crush 1/ 5 mds 1/ 5 mds_balancer 1/ 5 mds_locker 1/ 5 mds_log 1/ 5 mds_log_expire 1/ 5 mds_migrator 0/ 1 buffer 0/ 1 timer 0/ 1 filer 0/ 1 striper 0/ 1 objecter 0/ 5 rados 0/ 5 rbd 0/ 5 rbd_mirror 0/ 5 rbd_replay 0/ 5 journaler 0/ 5 objectcacher 0/ 5 client 1/ 5 osd 0/ 5 optracker 0/ 5 objclass 1/ 3 filestore 1/ 3 journal 0/ 5 ms 100/100 mon 0/10 monc 1/ 5 paxos 0/ 5 tp 1/ 5 auth 1/ 5 crypto 1/ 1 finisher 1/ 1 reserver 1/ 5 heartbeatmap 1/ 5 perfcounter 1/ 5 rgw 1/10 civetweb 1/ 5 javaclient 1/ 5 asok 1/ 1 throttle 0/ 0 refs 1/ 5 xio 1/ 5 compressor 1/ 5 bluestore 1/ 5 bluefs 1/ 3 bdev 1/ 5 kstore 4/ 5 rocksdb 4/ 5 leveldb 4/ 5 memdb 1/ 5 kinetic 1/ 5 fuse 1/ 5 mgr 1/ 5 mgrc 1/ 5 dpdk 1/ 5 eventtrace -2/-2 (syslog threshold) 99/99 (stderr threshold) max_recent 10000 max_new 1000 log_file --- end dump of recent events ---
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com