Re: Random checksum errors (bluestore on Luminous)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi (again),

meanwhile I tried

"ceph-bluestore-tool fsck --path /var/lib/ceph/osd/ceph-0"

but that resulted in a segfault (please see attached console log).


Regards
Martin


Am 10.12.2017 um 14:34 schrieb Martin Preuss:
> Hi,
> 
> I'm new to Ceph. I started a ceph cluster from scratch on DEbian 9,
> consisting of 3 hosts, each host has 3-4 OSDs (using 4TB hdds, currently
> totalling 10 hdds).
> 
> Right from the start I always received random scrub errors telling me
> that some checksums didn't match the expected value, fixable with "ceph
> pg repair".
> 
> I looked at the ceph-osd logfiles on each of the hosts and compared with
> the corresponding syslogs. I never found any hardware error, so there
> was no problem reading or writing a sector hardware-wise. Also there was
> never any other suspicious syslog entry around the time of checksum
> error reporting.
> 
> When I looked at the checksum error entries I found that the reported
> bad checksum always was "0x6706be76".
> 
> Could someone please tell me where to look further for the source of the
> problem?
> 
> I appended an excerpt of the osd logs.
> 
> 
> Kind regards
> Martin
> 
> 
> 
> 
> _______________________________________________
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
> 


-- 
"Things are only impossible until they're not"
# ceph-bluestore-tool fsck --path /var/lib/ceph/osd/ceph-0
/build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: In function 'int BlueStore::_fsck(bool, bool)' thread 7f054c5b8e00 time 2017-12-10 19:38:42.634360
/build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: 6122: FAILED assert(used_blocks.size() > count)
 ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x7f0542e198b2]
 2: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19]
 3: (main()+0xf9e) [0x55e6d51e064e]
 4: (__libc_start_main()+0xf1) [0x7f05400742b1]
 5: (_start()+0x2a) [0x55e6d5261dea]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
2017-12-10 19:38:42.636097 7f054c5b8e00 -1 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: In function 'int BlueStore::_fsck(bool, bool)' thread 7f054c5b8e00 time 2017-12-10 19:38:42.634360
/build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: 6122: FAILED assert(used_blocks.size() > count)

 ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x7f0542e198b2]
 2: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19]
 3: (main()+0xf9e) [0x55e6d51e064e]
 4: (__libc_start_main()+0xf1) [0x7f05400742b1]
 5: (_start()+0x2a) [0x55e6d5261dea]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

     0> 2017-12-10 19:38:42.636097 7f054c5b8e00 -1 /build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: In function 'int BlueStore::_fsck(bool, bool)' thread 7f054c5b8e00 time 2017-12-10 19:38:42.634360
/build/ceph-12.2.2/src/os/bluestore/BlueStore.cc: 6122: FAILED assert(used_blocks.size() > count)

 ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x7f0542e198b2]
 2: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19]
 3: (main()+0xf9e) [0x55e6d51e064e]
 4: (__libc_start_main()+0xf1) [0x7f05400742b1]
 5: (_start()+0x2a) [0x55e6d5261dea]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

*** Caught signal (Aborted) **
 in thread 7f054c5b8e00 thread_name:ceph-bluestore-
 ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable)
 1: (()+0x410f74) [0x55e6d54c8f74]
 2: (()+0x110c0) [0x7f05416f10c0]
 3: (gsignal()+0xcf) [0x7f0540086fcf]
 4: (abort()+0x16a) [0x7f05400883fa]
 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x7f0542e19a3e]
 6: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19]
 7: (main()+0xf9e) [0x55e6d51e064e]
 8: (__libc_start_main()+0xf1) [0x7f05400742b1]
 9: (_start()+0x2a) [0x55e6d5261dea]
2017-12-10 19:38:42.636699 7f054c5b8e00 -1 *** Caught signal (Aborted) **
 in thread 7f054c5b8e00 thread_name:ceph-bluestore-

 ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable)
 1: (()+0x410f74) [0x55e6d54c8f74]
 2: (()+0x110c0) [0x7f05416f10c0]
 3: (gsignal()+0xcf) [0x7f0540086fcf]
 4: (abort()+0x16a) [0x7f05400883fa]
 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x7f0542e19a3e]
 6: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19]
 7: (main()+0xf9e) [0x55e6d51e064e]
 8: (__libc_start_main()+0xf1) [0x7f05400742b1]
 9: (_start()+0x2a) [0x55e6d5261dea]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

     0> 2017-12-10 19:38:42.636699 7f054c5b8e00 -1 *** Caught signal (Aborted) **
 in thread 7f054c5b8e00 thread_name:ceph-bluestore-

 ceph version 12.2.2 (cf0baeeeeba3b47f9427c6c97e2144b094b7e5ba) luminous (stable)
 1: (()+0x410f74) [0x55e6d54c8f74]
 2: (()+0x110c0) [0x7f05416f10c0]
 3: (gsignal()+0xcf) [0x7f0540086fcf]
 4: (abort()+0x16a) [0x7f05400883fa]
 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x7f0542e19a3e]
 6: (BlueStore::_fsck(bool, bool)+0x7c09) [0x55e6d5322f19]
 7: (main()+0xf9e) [0x55e6d51e064e]
 8: (__libc_start_main()+0xf1) [0x7f05400742b1]
 9: (_start()+0x2a) [0x55e6d5261dea]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

Abgebrochen

Attachment: signature.asc
Description: OpenPGP digital signature

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux