I'm upgrading my cluster from luminous to mimic. I've upgraded my monitors and am attempting to upgrade the mgrs. Unfortunately, after an upgrade the mgr daemon exits immediately with error
code 1.
I've tried running ceph-mgr in debug mode to try to see what's happening but the output (below) is a bit cryptic for me. It looks like authentication might be failing but it was working prior
to the upgrade.
I do have "auth supported = cephx" in the global section of ceph.conf.
Thanks.
/usr/bin/ceph-mgr -f --cluster ceph --id 8 --setuser ceph --setgroup ceph -d --debug_ms 5
2019-01-04 07:01:38.457 7f808f83f700 2 Event(0x30c42c0 nevent=5000 time_id=1).set_owner idx=0 owner=140190140331776
2019-01-04 07:01:38.457 7f808f03e700 2 Event(0x30c4500 nevent=5000 time_id=1).set_owner idx=1 owner=140190131939072
2019-01-04 07:01:38.457 7f808e83d700 2 Event(0x30c4e00 nevent=5000 time_id=1).set_owner idx=2 owner=140190123546368
2019-01-04 07:01:38.457 7f809dd5b380 1 Processor -- start
2019-01-04 07:01:38.477 7f809dd5b380 1 -- - start start
2019-01-04 07:01:38.481 7f809dd5b380 1 -- - -->
192.168.253.147:6789/0 -- auth(proto 0 26 bytes epoch 0) v1 -- 0x32a6780 con 0
2019-01-04 07:01:38.481 7f809dd5b380 1 -- - -->
192.168.253.148:6789/0 -- auth(proto 0 26 bytes epoch 0) v1 -- 0x32a6a00 con 0
ked_seq 0 vs out_seq 0
ked_seq 0 vs out_seq 0
1 0x30c5440 mon_map magic: 0 v1
1 0x30c5680 mon_map magic: 0 v1
2 0x32a6780 auth_reply(proto 2 0 (0) Success) v1
2 0x32a6a00 auth_reply(proto 2 0 (0) Success) v1
ce00
d500
3 0x32a6f00 auth_reply(proto 2 -22 (22) Invalid argument) v1
0 con 0x332ce00
3 0x32a6780 auth_reply(proto 2 -22 (22) Invalid argument) v1
80 con 0x332d500
failed to fetch mon config (--no-mon-config to skip)