rocksdb report bluestore corruption

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

    Just got this assert from one of our cluster, the drive is an SSD
and looks fine through smart.   Ceph cluster is running 11.2.0 with
bluestore with some tunings:

bluestore_min_alloc_size_ssd = 65536
bluestore_cache_size = 3221225472
bluestore_rocksdb_options =
compression=kNoCompression,max_write_buffer_number=32,min_write_buffer_number_to_merge=2,recycle_log_file_num=32,compaction_style=kCompactionStyleLevel,write_buffer_size=536870912,target_file_size_base=67108864,max_background_compactions=31,level0_file_num_compaction_trigger=8,level0_slowdown_writes_trigger=32,level0_stop_writes_trigger=64,num_levels=7,max_bytes_for_level_base=536870912,max_bytes_for_level_multiplier=8,compaction_threads=32,flusher_threads=8,compaction_readahead_size=2097152


    The OSD and the data are still keeping for further debugging, more
than happy to provide more details.

   -14> 2017-08-06 04:03:38.356707 7f349df00700  4 rocksdb: (Original
Log Time 2017/08/06-04:03:38.356624) EVENT_LOG_v1 {"time_micros":
1502017418356373, "job": 3, "event": "compaction_finished",
"compaction_time_micros": 3608342, "output_level": 1,
"num_output_files": 3, "total_output_size": 204943307,
"num_input_records": 1888691, "num_output_records": 784200,
"num_subcompactions": 1, "num_single_delete_mismatches": 0,
"num_single_delete_fallthrough": 0, "lsm_state": [8, 8, 1, 0, 0, 0,
0]}
   -13> 2017-08-06 04:03:38.356711 7f349df00700  2 rocksdb: Waiting
after background compaction error: Corruption: block checksum
mismatch, Accumulated background error counts: 1
   -12> 2017-08-06 04:03:39.357038 7f349df00700  5 rocksdb: [JOB 3]
Delete db/000474.sst type=2 #474 -- OK

   -11> 2017-08-06 04:03:39.357068 7f349df00700  4 rocksdb:
EVENT_LOG_v1 {"time_micros": 1502017419357061, "job": 3, "event":
"table_file_deletion", "file_number": 474}
   -10> 2017-08-06 04:03:39.357085 7f349df00700  5 rocksdb: [JOB 3]
Delete db/000473.sst type=2 #473 -- OK

    -9> 2017-08-06 04:03:39.357091 7f349df00700  4 rocksdb:
EVENT_LOG_v1 {"time_micros": 1502017419357089, "job": 3, "event":
"table_file_deletion", "file_number": 473}
    -8> 2017-08-06 04:03:39.357129 7f349df00700  5 rocksdb: [JOB 3]
Delete db/000472.sst type=2 #472 -- OK

    -7> 2017-08-06 04:03:39.357138 7f349df00700  4 rocksdb:
EVENT_LOG_v1 {"time_micros": 1502017419357136, "job": 3, "event":
"table_file_deletion", "file_number": 472}
    -6> 2017-08-06 04:03:42.188494 7f34c0d86a40  0 osd.31 2168 load_pgs
    -5> 2017-08-06 04:03:42.191759 7f34c0d86a40  5 osd.31 pg_epoch:
2168 pg[2.fb3(unlocked)] enter Initial
    -4> 2017-08-06 04:03:42.395360 7f34c0d86a40  5 osd.31 pg_epoch:
2168 pg[2.fb3( v 1813'28879 (615'24904,1813'28879] local-les=2168
n=522 ec=156 les/c/f 2168/2168/0 2166/2167/156) [92,82,31] r=2 lpr=0
pi=156-2166/3 crt=1813'28879 lcod 0'0 inactive NOTIFY NIBBLEWISE] exit
Initial 0.203606 0 0.000000
    -3> 2017-08-06 04:03:42.395432 7f34c0d86a40  5 osd.31 pg_epoch:
2168 pg[2.fb3( v 1813'28879 (615'24904,1813'28879] local-les=2168
n=522 ec=156 les/c/f 2168/2168/0 2166/2167/156) [92,82,31] r=2 lpr=0
pi=156-2166/3 crt=1813'28879 lcod 0'0 inactive NOTIFY NIBBLEWISE]
enter Reset
    -2> 2017-08-06 04:03:42.395457 7f34c0d86a40  5
write_log_and_missing with: dirty_to: 0'0, dirty_from:
4294967295'18446744073709551615, writeout_from:
4294967295'18446744073709551615, trimmed: , clear_divergent_priors: 0
    -1> 2017-08-06 04:03:42.395816 7f34926e9700 -1 rocksdb:
submit_transaction error: Corruption: block checksum mismatch code = 2
Rocksdb transaction:
Put( Prefix = O key =
0x7f8000000000000002cdf0000021213dfffffffffffffffeffffffffffffffff'o'
Value size = 29)
Put( Prefix = S key = 'nid_max' Value size = 8)
Put( Prefix = S key = 'blobid_max' Value size = 8)
     0> 2017-08-06 04:03:42.399704 7f34926e9700 -1
/tmp/buildd/ceph-11.2.0/src/os/bluestore/BlueStore.cc: In function
'void BlueStore::_kv_sync_thread()' thread 7f34926e9700 time
2017-08-06 04:03:42.395856
/tmp/buildd/ceph-11.2.0/src/os/bluestore/BlueStore.cc: 6708: FAILED
assert(r == 0)

 ceph version 11.2.0 (f223e27eeb35991352ebc1f67423d4ebc252adb7)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char
const*)+0x80) [0x55b6dd7b9380]
 2: (BlueStore::_kv_sync_thread()+0x1563) [0x55b6dd4d4e03]
 3: (BlueStore::KVSyncThread::entry()+0xd) [0x55b6dd50e9dd]
 4: (()+0x76ba) [0x7f34bfe1a6ba]
 5: (clone()+0x6d) [0x7f34be4e83dd]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this.
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux