Hello,
I've a cluster of three node (two osd on each node). First I've
updated on node - osd is ok and running, but ceph-mon crashed.
cephus@ceph3:~$ sudo /usr/bin/ceph-mon
--cluster=ceph -i ceph3 -f --setuser ceph --setgroup ceph
--debug_mon 20
starting mon.ceph3 rank 2 at 192.168.49.103:6789/0 mon_data
/var/lib/ceph/mon/ceph-ceph3 fsid
3c58a184-bf27-4273-8000-405513006a7b
mds/FSMap.cc: In function 'void FSMap::sanity() const' thread
7fc9f74ac4c0 time 2016-11-09 14:57:03.743773
mds/FSMap.cc: 628: FAILED assert(i.second.state ==
MDSMap::STATE_STANDBY)
ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int,
char const*)+0x8b) [0x55c5ddd031eb]
2: (FSMap::sanity() const+0x932) [0x55c5ddc28112]
3: (MDSMonitor::update_from_paxos(bool*)+0x450)
[0x55c5dda53160]
4: (PaxosService::refresh(bool*)+0x19a) [0x55c5dd9c6b4a]
5: (Monitor::refresh_from_paxos(bool*)+0x143) [0x55c5dd963433]
6: (Monitor::init_paxos()+0x85) [0x55c5dd963845]
7: (Monitor::preinit()+0x925) [0x55c5dd973ec5]
8: (main()+0x236d) [0x55c5dd901e9d]
9: (__libc_start_main()+0xf5) [0x7fc9f4a2bf45]
10: (()+0x26106a) [0x55c5dd95406a]
NOTE: a copy of the executable, or `objdump -rdS
<executable>` is needed to interpret this.
2016-11-09 14:57:03.748124 7fc9f74ac4c0 -1 mds/FSMap.cc: In
function 'void FSMap::sanity() const' thread 7fc9f74ac4c0 time
2016-11-09 14:57:03.743773
mds/FSMap.cc: 628: FAILED assert(i.second.state ==
MDSMap::STATE_STANDBY)
ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int,
char const*)+0x8b) [0x55c5ddd031eb]
2: (FSMap::sanity() const+0x932) [0x55c5ddc28112]
3: (MDSMonitor::update_from_paxos(bool*)+0x450)
[0x55c5dda53160]
4: (PaxosService::refresh(bool*)+0x19a) [0x55c5dd9c6b4a]
5: (Monitor::refresh_from_paxos(bool*)+0x143) [0x55c5dd963433]
6: (Monitor::init_paxos()+0x85) [0x55c5dd963845]
7: (Monitor::preinit()+0x925) [0x55c5dd973ec5]
8: (main()+0x236d) [0x55c5dd901e9d]
9: (__libc_start_main()+0xf5) [0x7fc9f4a2bf45]
10: (()+0x26106a) [0x55c5dd95406a]
NOTE: a copy of the executable, or `objdump -rdS
<executable>` is needed to interpret this.
0> 2016-11-09 14:57:03.748124 7fc9f74ac4c0 -1
mds/FSMap.cc: In function 'void FSMap::sanity() const' thread
7fc9f74ac4c0 time 2016-11-09 14:57:03.743773
mds/FSMap.cc: 628: FAILED assert(i.second.state ==
MDSMap::STATE_STANDBY)
ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int,
char const*)+0x8b) [0x55c5ddd031eb]
2: (FSMap::sanity() const+0x932) [0x55c5ddc28112]
3: (MDSMonitor::update_from_paxos(bool*)+0x450)
[0x55c5dda53160]
4: (PaxosService::refresh(bool*)+0x19a) [0x55c5dd9c6b4a]
5: (Monitor::refresh_from_paxos(bool*)+0x143) [0x55c5dd963433]
6: (Monitor::init_paxos()+0x85) [0x55c5dd963845]
7: (Monitor::preinit()+0x925) [0x55c5dd973ec5]
8: (main()+0x236d) [0x55c5dd901e9d]
9: (__libc_start_main()+0xf5) [0x7fc9f4a2bf45]
10: (()+0x26106a) [0x55c5dd95406a]
NOTE: a copy of the executable, or `objdump -rdS
<executable>` is needed to interpret this.
*** Caught signal (Aborted) **
in thread 7fc9f74ac4c0 thread_name:ceph-mon
ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b)
1: (()+0x4f6222) [0x55c5ddbe9222]
2: (()+0x10330) [0x7fc9f67ba330]
3: (gsignal()+0x37) [0x7fc9f4a40c37]
4: (abort()+0x148) [0x7fc9f4a44028]
5: (ceph::__ceph_assert_fail(char const*, char const*, int,
char const*)+0x265) [0x55c5ddd033c5]
6: (FSMap::sanity() const+0x932) [0x55c5ddc28112]
7: (MDSMonitor::update_from_paxos(bool*)+0x450)
[0x55c5dda53160]
8: (PaxosService::refresh(bool*)+0x19a) [0x55c5dd9c6b4a]
9: (Monitor::refresh_from_paxos(bool*)+0x143) [0x55c5dd963433]
10: (Monitor::init_paxos()+0x85) [0x55c5dd963845]
11: (Monitor::preinit()+0x925) [0x55c5dd973ec5]
12: (main()+0x236d) [0x55c5dd901e9d]
13: (__libc_start_main()+0xf5) [0x7fc9f4a2bf45]
14: (()+0x26106a) [0x55c5dd95406a]
2016-11-09 14:57:03.757446 7fc9f74ac4c0 -1 *** Caught signal
(Aborted) **
in thread 7fc9f74ac4c0 thread_name:ceph-mon
ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b)
1: (()+0x4f6222) [0x55c5ddbe9222]
2: (()+0x10330) [0x7fc9f67ba330]
3: (gsignal()+0x37) [0x7fc9f4a40c37]
4: (abort()+0x148) [0x7fc9f4a44028]
5: (ceph::__ceph_assert_fail(char const*, char const*, int,
char const*)+0x265) [0x55c5ddd033c5]
6: (FSMap::sanity() const+0x932) [0x55c5ddc28112]
7: (MDSMonitor::update_from_paxos(bool*)+0x450)
[0x55c5dda53160]
8: (PaxosService::refresh(bool*)+0x19a) [0x55c5dd9c6b4a]
9: (Monitor::refresh_from_paxos(bool*)+0x143) [0x55c5dd963433]
10: (Monitor::init_paxos()+0x85) [0x55c5dd963845]
11: (Monitor::preinit()+0x925) [0x55c5dd973ec5]
12: (main()+0x236d) [0x55c5dd901e9d]
13: (__libc_start_main()+0xf5) [0x7fc9f4a2bf45]
14: (()+0x26106a) [0x55c5dd95406a]
NOTE: a copy of the executable, or `objdump -rdS
<executable>` is needed to interpret this.
0> 2016-11-09 14:57:03.757446 7fc9f74ac4c0 -1 *** Caught
signal (Aborted) **
in thread 7fc9f74ac4c0 thread_name:ceph-mon
ceph version 10.2.3 (ecc23778eb545d8dd55e2e4735b53cc93f92e65b)
1: (()+0x4f6222) [0x55c5ddbe9222]
2: (()+0x10330) [0x7fc9f67ba330]
3: (gsignal()+0x37) [0x7fc9f4a40c37]
4: (abort()+0x148) [0x7fc9f4a44028]
5: (ceph::__ceph_assert_fail(char const*, char const*, int,
char const*)+0x265) [0x55c5ddd033c5]
6: (FSMap::sanity() const+0x932) [0x55c5ddc28112]
7: (MDSMonitor::update_from_paxos(bool*)+0x450)
[0x55c5dda53160]
8: (PaxosService::refresh(bool*)+0x19a) [0x55c5dd9c6b4a]
9: (Monitor::refresh_from_paxos(bool*)+0x143) [0x55c5dd963433]
10: (Monitor::init_paxos()+0x85) [0x55c5dd963845]
11: (Monitor::preinit()+0x925) [0x55c5dd973ec5]
12: (main()+0x236d) [0x55c5dd901e9d]
13: (__libc_start_main()+0xf5) [0x7fc9f4a2bf45]
14: (()+0x26106a) [0x55c5dd95406a]
NOTE: a copy of the executable, or `objdump -rdS
<executable>` is needed to interpret this.
Ticket on tracker.ceph.com
http://tracker.ceph.com/issues/17837
Best regards,
Alex
|
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com