Hi (again), meanwhile I tried "ceph-bluestore-tool fsck --path /var/lib/ceph/osd/ceph-0" but that resulted in a segfault (please see attached console log). Regards Martin Am 10.12.2017 um 14:34 schrieb Martin Preuss: > Hi, > > I'm new to Ceph. I started a ceph cluster from scratch on DEbian 9, > consisting of 3 hosts, each host has 3-4 OSDs (using 4TB hdds, currently > totalling 10 hdds). > > Right from the start I always received random scrub errors telling me > that some checksums didn't match the expected value, fixable with "ceph > pg repair". > > I looked at the ceph-osd logfiles on each of the hosts and compared with > the corresponding syslogs. I never found any hardware error, so there > was no problem reading or writing a sector hardware-wise. Also there was > never any other suspicious syslog entry around the time of checksum > error reporting. > > When I looked at the checksum error entries I found that the reported > bad checksum always was "0x6706be76". > > Could someone please tell me where to look further for the source of the > problem? > > I appended an excerpt of the osd logs. > > > Kind regards > Martin > > > > > _______________________________________________ > ceph-users mailing list > ceph-users@xxxxxxxxxxxxxx > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > -- "Things are only impossible until they're not"
# ceph-bluestore-tool fsck --path /var/lib/ceph/osd/ceph-0 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: In function 'int BlueStore::_fsck(bool, bool)' thread 7f054c5b8e00 time 2017-12-10 19:38:42.634360 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: 6122: FAILED assert(used_blocks.size() > count) ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable) 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x7f0542e198b2] 2: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19] 3: (main()+0xf9e) [0x55e6d51e064e] 4: (__libc_start_main()+0xf1) [0x7f05400742b1] 5: (_start()+0x2a) [0x55e6d5261dea] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. 2017-12-10 19:38:42.636097 7f054c5b8e00 -1 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: In function 'int BlueStore::_fsck(bool, bool)' thread 7f054c5b8e00 time 2017-12-10 19:38:42.634360 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: 6122: FAILED assert(used_blocks.size() > count) ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable) 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x7f0542e198b2] 2: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19] 3: (main()+0xf9e) [0x55e6d51e064e] 4: (__libc_start_main()+0xf1) [0x7f05400742b1] 5: (_start()+0x2a) [0x55e6d5261dea] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. 0> 2017-12-10 19:38:42.636097 7f054c5b8e00 -1 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: In function 'int BlueStore::_fsck(bool, bool)' thread 7f054c5b8e00 time 2017-12-10 19:38:42.634360 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: 6122: FAILED assert(used_blocks.size() > count) ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable) 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x7f0542e198b2] 2: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19] 3: (main()+0xf9e) [0x55e6d51e064e] 4: (__libc_start_main()+0xf1) [0x7f05400742b1] 5: (_start()+0x2a) [0x55e6d5261dea] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. *** Caught signal (Aborted) ** in thread 7f054c5b8e00 thread_name:ceph-bluestore- ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable) 1: (()+0x410f74) [0x55e6d54c8f74] 2: (()+0x110c0) [0x7f05416f10c0] 3: (gsignal()+0xcf) [0x7f0540086fcf] 4: (abort()+0x16a) [0x7f05400883fa] 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x7f0542e19a3e] 6: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19] 7: (main()+0xf9e) [0x55e6d51e064e] 8: (__libc_start_main()+0xf1) [0x7f05400742b1] 9: (_start()+0x2a) [0x55e6d5261dea] 2017-12-10 19:38:42.636699 7f054c5b8e00 -1 *** Caught signal (Aborted) ** in thread 7f054c5b8e00 thread_name:ceph-bluestore- ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable) 1: (()+0x410f74) [0x55e6d54c8f74] 2: (()+0x110c0) [0x7f05416f10c0] 3: (gsignal()+0xcf) [0x7f0540086fcf] 4: (abort()+0x16a) [0x7f05400883fa] 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x7f0542e19a3e] 6: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19] 7: (main()+0xf9e) [0x55e6d51e064e] 8: (__libc_start_main()+0xf1) [0x7f05400742b1] 9: (_start()+0x2a) [0x55e6d5261dea] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. 0> 2017-12-10 19:38:42.636699 7f054c5b8e00 -1 *** Caught signal (Aborted) ** in thread 7f054c5b8e00 thread_name:ceph-bluestore- ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable) 1: (()+0x410f74) [0x55e6d54c8f74] 2: (()+0x110c0) [0x7f05416f10c0] 3: (gsignal()+0xcf) [0x7f0540086fcf] 4: (abort()+0x16a) [0x7f05400883fa] 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x7f0542e19a3e] 6: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19] 7: (main()+0xf9e) [0x55e6d51e064e] 8: (__libc_start_main()+0xf1) [0x7f05400742b1] 9: (_start()+0x2a) [0x55e6d5261dea] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. Abgebrochen
Attachment:
signature.asc
Description: OpenPGP digital signature
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com