Latency spikes in OSD's based on bluestore

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




Hi all, 

I'm trying to get an better understanding regarding the "latency spikes" we see on *every* SSD-OSD (it may very well be on HDD's as well but to be honest I haven't bothered looking there since they are of no interest at the moment). 

Anyway, we are running a cluster with version 'ceph version 12.2.8-89.el7cp (2f66ab2fa63b2879913db6d6cf314572a83fd1f0) luminous (stable)' and recently upgraded all our OSD's to bluestore. Unfortunately the way the upgrade was done, earlier graphs / historical data of the cluster was lost (something about when switching from collectd to the promethius approach) - none the less, I can't say if the spikes were there before switching to bluestore.

I seem to have narrowed down the spikes to when the OSD's are doing 'compaction_started' and 'compaction_finished'. During that time (which of course is different on every OSD) we see spikes in the area between ~20ms - ~80 ms. Which seems worrying. Looking at the average latency it varies between ~0.2ms - ~9ms. 

I've tried to understand the underlying mechanism's involved here and which parameters to tune, but I find it extremely hard. 

The current config of an OSD regarding bluestore looks like the following, 

$ > ceph -n osd.88 --show-config | grep bluestore
debug_bluestore = 1/5
bluestore_2q_cache_kin_ratio = 0.500000
bluestore_2q_cache_kout_ratio = 0.500000
bluestore_allocator = stupid
bluestore_bitmapallocator_blocks_per_zone = 1024
bluestore_bitmapallocator_span_size = 1024
bluestore_blobid_prealloc = 10240
bluestore_block_create = true
bluestore_block_db_create = false
bluestore_block_db_path =
bluestore_block_db_size = 0
bluestore_block_path =
bluestore_block_preallocate_file = false
bluestore_block_size = 10737418240
bluestore_block_wal_create = false
bluestore_block_wal_path =
bluestore_block_wal_size = 100663296
bluestore_bluefs = true
bluestore_bluefs_balance_interval = 1.000000
bluestore_bluefs_env_mirror = false
bluestore_bluefs_gift_ratio = 0.020000
bluestore_bluefs_max_ratio = 0.900000
bluestore_bluefs_min = 1073741824
bluestore_bluefs_min_free = 1073741824
bluestore_bluefs_min_ratio = 0.020000
bluestore_bluefs_reclaim_ratio = 0.200000
bluestore_cache_autotune = true
bluestore_cache_autotune_chunk_size = 33554432
bluestore_cache_autotune_interval = 5.000000
bluestore_cache_kv_ratio = 0.400000
bluestore_cache_meta_ratio = 0.400000
bluestore_cache_size = 0
bluestore_cache_size_hdd = 1073741824
bluestore_cache_size_ssd = 3221225472
bluestore_cache_trim_interval = 0.050000
bluestore_cache_trim_max_skip_pinned = 64
bluestore_cache_type = 2q
bluestore_clone_cow = true
bluestore_compression_algorithm = snappy
bluestore_compression_max_blob_size = 0
bluestore_compression_max_blob_size_hdd = 524288
bluestore_compression_max_blob_size_ssd = 65536
bluestore_compression_min_blob_size = 0
bluestore_compression_min_blob_size_hdd = 131072
bluestore_compression_min_blob_size_ssd = 8192
bluestore_compression_mode = none
bluestore_compression_required_ratio = 0.875000
bluestore_csum_max_block = 65536
bluestore_csum_min_block = 4096
bluestore_csum_type = crc32c
bluestore_debug_freelist = false
bluestore_debug_fsck_abort = false
bluestore_debug_inject_read_err = false
bluestore_debug_misc = false
bluestore_debug_no_reuse_blocks = false
bluestore_debug_omit_block_device_write = false
bluestore_debug_omit_kv_commit = false
bluestore_debug_permit_any_bdev_label = false
bluestore_debug_prefill = 0.000000
bluestore_debug_prefragment_max = 1048576
bluestore_debug_random_read_err = 0.000000
bluestore_debug_randomize_serial_transaction = 0
bluestore_debug_small_allocations = 0
bluestore_default_buffered_read = true
bluestore_default_buffered_write = false
bluestore_deferred_batch_ops = 0
bluestore_deferred_batch_ops_hdd = 64
bluestore_deferred_batch_ops_ssd = 16
bluestore_extent_map_inline_shard_prealloc_size = 256
bluestore_extent_map_shard_max_size = 1200
bluestore_extent_map_shard_min_size = 150
bluestore_extent_map_shard_target_size = 500
bluestore_extent_map_shard_target_size_slop = 0.200000
bluestore_freelist_blocks_per_key = 128
bluestore_fsck_on_mkfs = true
bluestore_fsck_on_mkfs_deep = false
bluestore_fsck_on_mount = false
bluestore_fsck_on_mount_deep = true
bluestore_fsck_on_umount = false
bluestore_fsck_on_umount_deep = true
bluestore_gc_enable_blob_threshold = 0
bluestore_gc_enable_total_threshold = 0
bluestore_kvbackend = rocksdb
bluestore_max_alloc_size = 0
bluestore_max_blob_size = 0
bluestore_max_blob_size_hdd = 524288
bluestore_max_blob_size_ssd = 65536
bluestore_max_deferred_txc = 32
bluestore_min_alloc_size = 0
bluestore_min_alloc_size_hdd = 65536
bluestore_min_alloc_size_ssd = 16384
bluestore_nid_prealloc = 1024
bluestore_prefer_deferred_size = 0
bluestore_prefer_deferred_size_hdd = 32768
bluestore_prefer_deferred_size_ssd = 0
bluestore_rocksdb_options = compression=kNoCompression,max_write_buffer_number=4,min_write_buffer_number_to_merge=1,recycle_log_file_num=4,write_buffer_size=268435456,writable_file_max_buffer_size=0,compaction_readahead_size=2097152
bluestore_shard_finishers = false
bluestore_spdk_coremask = 0x3
bluestore_spdk_max_io_completion = 0
bluestore_spdk_mem = 512
bluestore_sync_submit_transaction = false
bluestore_throttle_bytes = 67108864
bluestore_throttle_cost_per_io = 0
bluestore_throttle_cost_per_io_hdd = 670000
bluestore_throttle_cost_per_io_ssd = 4000
bluestore_throttle_deferred_bytes = 134217728
mon_debug_no_require_bluestore_for_ec_overwrites = false


The logs from a specific OSD during the timeperiod where one "spike occurs" looks like this (the graph suggests the time period around 2019-04-07 04:08), 

= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = 

2019-04-07 04:01:33.393575 7fba248c4700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_write.cc:684] reusing log 17652 from recycle list

2019-04-07 04:01:33.393726 7fba248c4700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_write.cc:725] [default] New memtable created with log file: #17656. Immutable memtables: 0.

2019-04-07 04:01:33.394191 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:01:33.393750) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_compaction_flush.cc:1158] Calling FlushMemTableToOutputFile with column family [default], flush slots available 1, compaction slots allowed 1, compaction slots scheduled 1
2019-04-07 04:01:33.394200 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:264] [default] [JOB 6774] Flushing memtable with next log file: 17656

2019-04-07 04:01:33.394211 7fba0e097700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602493394204, "job": 6774, "event": "flush_started", "num_memtables": 1, "num_entries": 328448, "num_deletes": 72449, "memory_usage": 260051496}
2019-04-07 04:01:33.394213 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:293] [default] [JOB 6774] Level-0 flush table #17657: started
2019-04-07 04:01:33.540589 7fba0e097700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602493540576, "cf_name": "default", "job": 6774, "event": "table_file_creation", "file_number": 17657, "file_size": 12268280, "table_properties": {"data_size": 11712643, "index_size": 178205, "filter_size": 376443, "raw_key_size": 7426084, "raw_average_key_size": 53, "raw_value_size": 9209707, "raw_average_value_size": 65, "num_data_blocks": 2909, "num_entries": 139996, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "72424", "kMergeOperands": "2671"}}
2019-04-07 04:01:33.540599 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:319] [default] [JOB 6774] Level-0 flush table #17657: 12268280 bytes OK
2019-04-07 04:01:33.541237 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_files.cc:242] adding log 17654 to recycle list

2019-04-07 04:01:33.541243 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:01:33.540606) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/memtable_list.cc:360] [default] Level-0 commit table #17657 started
2019-04-07 04:01:33.541246 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:01:33.541206) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/memtable_list.cc:383] [default] Level-0 commit table #17657: memtable #1 done
2019-04-07 04:01:33.541248 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:01:33.541216) EVENT_LOG_v1 {"time_micros": 1554602493541211, "job": 6774, "event": "flush_finished", "lsm_state": [3, 1, 9, 0, 0, 0, 0], "immutable_memtables": 0}
2019-04-07 04:01:33.541250 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:01:33.541230) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_compaction_flush.cc:132] [default] Level summary: base level 1 max bytes base 268435456 files[3 1 9 0 0 0 0] max score 0.75

2019-04-07 04:01:33.541255 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_files.cc:388] [JOB 6774] Try to delete WAL files size 253619970, prev total WAL file size 253778896, number of live WAL files 2.

2019-04-07 04:07:38.822080 7fba248c4700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_write.cc:684] reusing log 17654 from recycle list

2019-04-07 04:07:38.822307 7fba248c4700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_write.cc:725] [default] New memtable created with log file: #17658. Immutable memtables: 0.

2019-04-07 04:07:38.823004 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:07:38.822334) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_compaction_flush.cc:1158] Calling FlushMemTableToOutputFile with column family [default], flush slots available 1, compaction slots allowed 1, compaction slots scheduled 1
2019-04-07 04:07:38.823012 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:264] [default] [JOB 6775] Flushing memtable with next log file: 17658

2019-04-07 04:07:38.823023 7fba0e097700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602858823017, "job": 6775, "event": "flush_started", "num_memtables": 1, "num_entries": 319003, "num_deletes": 70609, "memory_usage": 260049200}
2019-04-07 04:07:38.823037 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:293] [default] [JOB 6775] Level-0 flush table #17659: started
2019-04-07 04:07:38.972070 7fba0e097700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602858972057, "cf_name": "default", "job": 6775, "event": "table_file_creation", "file_number": 17659, "file_size": 10264801, "table_properties": {"data_size": 9777447, "index_size": 148893, "filter_size": 337472, "raw_key_size": 6697519, "raw_average_key_size": 53, "raw_value_size": 7537516, "raw_average_value_size": 59, "num_data_blocks": 2436, "num_entries": 126115, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "70615", "kMergeOperands": "1900"}}
2019-04-07 04:07:38.972080 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:319] [default] [JOB 6775] Level-0 flush table #17659: 10264801 bytes OK
2019-04-07 04:07:38.972562 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_files.cc:242] adding log 17656 to recycle list

2019-04-07 04:07:38.972570 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:07:38.972087) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/memtable_list.cc:360] [default] Level-0 commit table #17659 started
2019-04-07 04:07:38.972574 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:07:38.972525) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/memtable_list.cc:383] [default] Level-0 commit table #17659: memtable #1 done
2019-04-07 04:07:38.972576 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:07:38.972536) EVENT_LOG_v1 {"time_micros": 1554602858972532, "job": 6775, "event": "flush_finished", "lsm_state": [4, 1, 9, 0, 0, 0, 0], "immutable_memtables": 0}
2019-04-07 04:07:38.972578 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:07:38.972555) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_compaction_flush.cc:132] [default] Level summary: base level 1 max bytes base 268435456 files[4 1 9 0 0 0 0] max score 1.00

2019-04-07 04:07:38.972583 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_files.cc:388] [JOB 6775] Try to delete WAL files size 253805042, prev total WAL file size 253877756, number of live WAL files 2.

2019-04-07 04:07:38.972728 7fba230c1700  3 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl.cc:447] ------- DUMPING STATS -------
2019-04-07 04:07:38.972734 7fba230c1700  3 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl.cc:448]
** DB Stats **
Uptime(secs): 1155670.4 total, 1463.9 interval
Cumulative writes: 332M writes, 1661M keys, 332M commit groups, 1.0 writes per commit group, ingest: 1194.88 GB, 1.06 MB/s
Cumulative WAL: 332M writes, 150M syncs, 2.21 writes per sync, written: 1194.88 GB, 1.06 MB/s
Cumulative stall: 00:00:0.000 H:M:S, 0.0 percent
Interval writes: 277K writes, 1291K keys, 277K commit groups, 1.0 writes per commit group, ingest: 967.91 MB, 0.66 MB/s
Interval WAL: 277K writes, 135K syncs, 2.06 writes per sync, written: 0.95 MB, 0.66 MB/s
Interval stall: 00:00:0.000 H:M:S, 0.0 percent

** Compaction Stats [default] **
Level    Files   Size     Score Read(GB)  Rn(GB) Rnp1(GB) Write(GB) Wnew(GB) Moved(GB) W-Amp Rd(MB/s) Wr(MB/s) Comp(sec) Comp(cnt) Avg(sec) KeyIn KeyDrop
----------------------------------------------------------------------------------------------------------------------------------------------------------
  L0      4/0   41.90 MB   1.0      0.0     0.0      0.0      54.6     54.6       0.0   1.0      0.0     73.3       763      5061    0.151       0      0
  L1      1/0   26.39 MB   0.7     75.9    54.6     21.3      37.3     16.0       0.0   0.7     75.4     37.1      1030      1265    0.814   1073M   221M
  L2      9/0   542.29 MB   0.2    234.9    16.0    218.9     219.0      0.1       0.0  13.7    104.5     97.4      2302       448    5.139   1202M   411M
 Sum     14/0   610.57 MB   0.0    310.8    70.6    240.2     310.9     70.8       0.0   5.7     77.7     77.7      4096      6774    0.605   2275M   633M
 Int      0/0    0.00 KB   0.0      0.1     0.0      0.0       0.1      0.0       0.0   1.6     46.4     53.7         1         5    0.254    889K   206K
Uptime(secs): 1155670.4 total, 1155670.4 interval
Flush(GB): cumulative 54.628, interval 0.041
AddFile(GB): cumulative 0.000, interval 0.000
AddFile(Total Files): cumulative 0, interval 0
AddFile(L0 Files): cumulative 0, interval 0
AddFile(Keys): cumulative 0, interval 0
Cumulative compaction: 310.92 GB write, 0.28 MB/s write, 310.78 GB read, 0.28 MB/s read, 4095.7 seconds
Interval compaction: 0.07 GB write, 0.00 MB/s write, 0.06 GB read, 0.00 MB/s read, 1.3 seconds
Stalls(count): 0 level0_slowdown, 0 level0_slowdown_with_compaction, 0 level0_numfiles, 0 level0_numfiles_with_compaction, 0 stop for pending_compaction_bytes, 0 slowdown for pending_compaction_bytes, 0 memtable_compaction, 0 memtable_slowdown, interval 0 total count

** File Read Latency Histogram By Level [default] **

** Compaction Stats [default] **
Level    Files   Size     Score Read(GB)  Rn(GB) Rnp1(GB) Write(GB) Wnew(GB) Moved(GB) W-Amp Rd(MB/s) Wr(MB/s) Comp(sec) Comp(cnt) Avg(sec) KeyIn KeyDrop
----------------------------------------------------------------------------------------------------------------------------------------------------------
  L0      4/0   41.90 MB   1.0      0.0     0.0      0.0      54.6     54.6       0.0   1.0      0.0     73.3       763      5061    0.151       0      0
  L1      1/0   26.39 MB   0.7     75.9    54.6     21.3      37.3     16.0       0.0   0.7     75.4     37.1      1030      1265    0.814   1073M   221M
  L2      9/0   542.29 MB   0.2    234.9    16.0    218.9     219.0      0.1       0.0  13.7    104.5     97.4      2302       448    5.139   1202M   411M
 Sum     14/0   610.57 MB   0.0    310.8    70.6    240.2     310.9     70.8       0.0   5.7     77.7     77.7      4096      6774    0.605   2275M   633M
 Int      0/0    0.00 KB   0.0      0.0     0.0      0.0       0.0      0.0       0.0   0.0      0.0      0.0         0         0    0.000       0      0
Uptime(secs): 1155670.4 total, 1155670.4 interval
Flush(GB): cumulative 54.628, interval 0.000
AddFile(GB): cumulative 0.000, interval 0.000
AddFile(Total Files): cumulative 0, interval 0
AddFile(L0 Files): cumulative 0, interval 0
AddFile(Keys): cumulative 0, interval 0
Cumulative compaction: 310.92 GB write, 0.28 MB/s write, 310.78 GB read, 0.28 MB/s read, 4095.7 seconds
Interval compaction: 0.00 GB write, 0.00 MB/s write, 0.00 GB read, 0.00 MB/s read, 0.0 seconds
Stalls(count): 0 level0_slowdown, 0 level0_slowdown_with_compaction, 0 level0_numfiles, 0 level0_numfiles_with_compaction, 0 stop for pending_compaction_bytes, 0 slowdown for pending_compaction_bytes, 0 memtable_compaction, 0 memtable_slowdown, interval 0 total count

** File Read Latency Histogram By Level [default] **

2019-04-07 04:07:38.972770 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1403] [default] [JOB 6776] Compacting 4@0 + 1@1 files to L1, score 1.00
2019-04-07 04:07:38.972777 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1407] [default] Compaction start summary: Base version 6775 Base level 0, inputs: [17659(10024KB) 17657(11MB) 17655(10096KB) 17653(10MB)], [17651(26MB)]

2019-04-07 04:07:38.972795 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602858972780, "job": 6776, "event": "compaction_started", "files_L0": [17659, 17657, 17655, 17653], "files_L1": [17651], "score": 1, "input_data_size": 71601163}
2019-04-07 04:07:39.921386 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6776] Generated table #17660: 967828 keys, 33787488 bytes
2019-04-07 04:07:39.921410 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602859921397, "cf_name": "default", "job": 6776, "event": "table_file_creation", "file_number": 17660, "file_size": 33787488, "table_properties": {"data_size": 30798279, "index_size": 498454, "filter_size": 2489763, "raw_key_size": 50167310, "raw_average_key_size": 51, "raw_value_size": 15631664, "raw_average_value_size": 16, "num_data_blocks": 7644, "num_entries": 967828, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "849562", "kMergeOperands": "16907"}}
2019-04-07 04:07:39.921478 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1173] [default] [JOB 6776] Compacted 4@0 + 1@1 files to L1 => 33787488 bytes
2019-04-07 04:07:39.922027 7fba230c1700  4 rocksdb: (Original Log Time 2019/04/07-04:07:39.921926) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:621] [default] compacted to: base level 1 max bytes base 268435456 files[0 1 9 0 0 0 0] max score 1.15, MB/sec: 75.5 rd, 35.6 wr, level 1, files in(4, 1) out(1) MB in(41.9, 26.4) out(32.2), read-write-amplify(2.4) write-amplify(0.8) OK, records in: 1203386, records dropped: 229905

2019-04-07 04:07:39.922035 7fba230c1700  4 rocksdb: (Original Log Time 2019/04/07-04:07:39.921943) EVENT_LOG_v1 {"time_micros": 1554602859921935, "job": 6776, "event": "compaction_finished", "compaction_time_micros": 948672, "output_level": 1, "num_output_files": 1, "total_output_size": 33787488, "num_input_records": 1197733, "num_output_records": 967828, "num_subcompactions": 1, "num_single_delete_mismatches": 0, "num_single_delete_fallthrough": 0, "lsm_state": [0, 1, 9, 0, 0, 0, 0]}
2019-04-07 04:07:39.922062 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602859922059, "job": 6776, "event": "table_file_deletion", "file_number": 17659}
2019-04-07 04:07:39.922073 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602859922072, "job": 6776, "event": "table_file_deletion", "file_number": 17657}
2019-04-07 04:07:39.922082 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602859922080, "job": 6776, "event": "table_file_deletion", "file_number": 17655}
2019-04-07 04:07:39.922089 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602859922088, "job": 6776, "event": "table_file_deletion", "file_number": 17653}
2019-04-07 04:07:39.922099 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602859922097, "job": 6776, "event": "table_file_deletion", "file_number": 17651}
2019-04-07 04:07:39.922129 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1403] [default] [JOB 6777] Compacting 1@1 + 9@2 files to L2, score 1.15
2019-04-07 04:07:39.922141 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1407] [default] Compaction start summary: Base version 6776 Base level 1, inputs: [17660(32MB)], [17625(66MB) 17626(66MB) 17627(66MB) 17628(66MB) 17629(66MB) 17630(65MB) 17631(65MB) 17632(66MB) 17633(14MB)]

2019-04-07 04:07:39.922156 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602859922146, "job": 6777, "event": "compaction_started", "files_L1": [17660], "files_L2": [17625, 17626, 17627, 17628, 17629, 17630, 17631, 17632, 17633], "score": 1.1502, "input_data_size": 602419380}
2019-04-07 04:07:40.744829 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17661: 276977 keys, 69274076 bytes
2019-04-07 04:07:40.744852 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602860744839, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17661, "file_size": 69274076, "table_properties": {"data_size": 67111211, "index_size": 1318246, "filter_size": 843632, "raw_key_size": 18534383, "raw_average_key_size": 66, "raw_value_size": 60026009, "raw_average_value_size": 216, "num_data_blocks": 17264, "num_entries": 276977, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:41.511959 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17662: 140524 keys, 69244885 bytes
2019-04-07 04:07:41.511981 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602861511969, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17662, "file_size": 69244885, "table_properties": {"data_size": 67111835, "index_size": 1631726, "filter_size": 500337, "raw_key_size": 12306449, "raw_average_key_size": 87, "raw_value_size": 62685069, "raw_average_value_size": 446, "num_data_blocks": 17643, "num_entries": 140524, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:42.069833 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17663: 140480 keys, 69251237 bytes
2019-04-07 04:07:42.069869 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602862069856, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17663, "file_size": 69251237, "table_properties": {"data_size": 67109985, "index_size": 1640069, "filter_size": 500196, "raw_key_size": 12302449, "raw_average_key_size": 87, "raw_value_size": 62675074, "raw_average_value_size": 446, "num_data_blocks": 17640, "num_entries": 140480, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:42.642029 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17664: 139272 keys, 69253218 bytes
2019-04-07 04:07:42.642052 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602862642040, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17664, "file_size": 69253218, "table_properties": {"data_size": 67110463, "index_size": 1643899, "filter_size": 497869, "raw_key_size": 12204956, "raw_average_key_size": 87, "raw_value_size": 62802099, "raw_average_value_size": 450, "num_data_blocks": 17411, "num_entries": 139272, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:43.193906 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17665: 137243 keys, 69246759 bytes
2019-04-07 04:07:43.193929 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602863193917, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17665, "file_size": 69246759, "table_properties": {"data_size": 67109536, "index_size": 1642532, "filter_size": 493704, "raw_key_size": 12037721, "raw_average_key_size": 87, "raw_value_size": 62982021, "raw_average_value_size": 458, "num_data_blocks": 17097, "num_entries": 137243, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:43.761758 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17666: 135162 keys, 69051020 bytes
2019-04-07 04:07:43.761787 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602863761769, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17666, "file_size": 69051020, "table_properties": {"data_size": 67109890, "index_size": 1450482, "filter_size": 489661, "raw_key_size": 12376157, "raw_average_key_size": 91, "raw_value_size": 61429503, "raw_average_value_size": 454, "num_data_blocks": 16988, "num_entries": 135162, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:44.327604 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17667: 133378 keys, 69020961 bytes
2019-04-07 04:07:44.327626 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602864327612, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17667, "file_size": 69020961, "table_properties": {"data_size": 67110737, "index_size": 1425244, "filter_size": 483993, "raw_key_size": 12534775, "raw_average_key_size": 93, "raw_value_size": 61140547, "raw_average_value_size": 458, "num_data_blocks": 17387, "num_entries": 133378, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:45.044919 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17668: 396785 keys, 69611983 bytes
2019-04-07 04:07:45.044942 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865044930, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17668, "file_size": 69611983, "table_properties": {"data_size": 67109377, "index_size": 1359121, "filter_size": 1142498, "raw_key_size": 16160193, "raw_average_key_size": 40, "raw_value_size": 57883184, "raw_average_value_size": 145, "num_data_blocks": 16787, "num_entries": 396785, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "0"}}
2019-04-07 04:07:45.363895 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1116] [default] [JOB 6777] Generated table #17669: 438661 keys, 14797153 bytes
2019-04-07 04:07:45.363918 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865363906, "cf_name": "default", "job": 6777, "event": "table_file_creation", "file_number": 17669, "file_size": 14797153, "table_properties": {"data_size": 13573795, "index_size": 95878, "filter_size": 1126494, "raw_key_size": 7895898, "raw_average_key_size": 18, "raw_value_size": 7018576, "raw_average_value_size": 16, "num_data_blocks": 3324, "num_entries": 438661, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "0", "kMergeOperands": "1"}}
2019-04-07 04:07:45.363942 7fba230c1700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:1173] [default] [JOB 6777] Compacted 1@1 + 9@2 files to L2 => 568751292 bytes
2019-04-07 04:07:45.364584 7fba230c1700  4 rocksdb: (Original Log Time 2019/04/07-04:07:45.364412) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/compaction_job.cc:621] [default] compacted to: base level 1 max bytes base 268435456 files[0 0 9 0 0 0 0] max score 0.21, MB/sec: 110.7 rd, 104.5 wr, level 2, files in(1, 9) out(9) MB in(32.2, 542.3) out(542.4), read-write-amplify(34.7) write-amplify(16.8) OK, records in: 2906082, records dropped: 950676

2019-04-07 04:07:45.364591 7fba230c1700  4 rocksdb: (Original Log Time 2019/04/07-04:07:45.364429) EVENT_LOG_v1 {"time_micros": 1554602865364421, "job": 6777, "event": "compaction_finished", "compaction_time_micros": 5441771, "output_level": 2, "num_output_files": 9, "total_output_size": 568751292, "num_input_records": 2889158, "num_output_records": 1938482, "num_subcompactions": 1, "num_single_delete_mismatches": 0, "num_single_delete_fallthrough": 0, "lsm_state": [0, 0, 9, 0, 0, 0, 0]}
2019-04-07 04:07:45.364621 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364618, "job": 6777, "event": "table_file_deletion", "file_number": 17660}
2019-04-07 04:07:45.364634 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364632, "job": 6777, "event": "table_file_deletion", "file_number": 17633}
2019-04-07 04:07:45.364641 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364640, "job": 6777, "event": "table_file_deletion", "file_number": 17632}
2019-04-07 04:07:45.364648 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364646, "job": 6777, "event": "table_file_deletion", "file_number": 17631}
2019-04-07 04:07:45.364655 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364653, "job": 6777, "event": "table_file_deletion", "file_number": 17630}
2019-04-07 04:07:45.364662 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364661, "job": 6777, "event": "table_file_deletion", "file_number": 17629}
2019-04-07 04:07:45.364668 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364666, "job": 6777, "event": "table_file_deletion", "file_number": 17628}
2019-04-07 04:07:45.364673 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364672, "job": 6777, "event": "table_file_deletion", "file_number": 17627}
2019-04-07 04:07:45.364682 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364681, "job": 6777, "event": "table_file_deletion", "file_number": 17626}
2019-04-07 04:07:45.364690 7fba230c1700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554602865364688, "job": 6777, "event": "table_file_deletion", "file_number": 17625}
2019-04-07 04:13:57.436121 7fba248c4700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_write.cc:684] reusing log 17656 from recycle list

2019-04-07 04:13:57.436193 7fba248c4700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_write.cc:725] [default] New memtable created with log file: #17670. Immutable memtables: 0.

2019-04-07 04:13:57.436823 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:13:57.436219) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_compaction_flush.cc:1158] Calling FlushMemTableToOutputFile with column family [default], flush slots available 1, compaction slots allowed 1, compaction slots scheduled 1
2019-04-07 04:13:57.436830 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:264] [default] [JOB 6778] Flushing memtable with next log file: 17670

2019-04-07 04:13:57.436841 7fba0e097700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554603237436835, "job": 6778, "event": "flush_started", "num_memtables": 1, "num_entries": 324345, "num_deletes": 71811, "memory_usage": 260051432}
2019-04-07 04:13:57.436844 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:293] [default] [JOB 6778] Level-0 flush table #17671: started
2019-04-07 04:13:57.576432 7fba0e097700  4 rocksdb: EVENT_LOG_v1 {"time_micros": 1554603237576418, "cf_name": "default", "job": 6778, "event": "table_file_creation", "file_number": 17671, "file_size": 10994576, "table_properties": {"data_size": 10476335, "index_size": 160495, "filter_size": 356757, "raw_key_size": 7048685, "raw_average_key_size": 52, "raw_value_size": 8119636, "raw_average_value_size": 60, "num_data_blocks": 2614, "num_entries": 133225, "filter_policy_name": "rocksdb.BuiltinBloomFilter", "kDeletedKeys": "71818", "kMergeOperands": "2631"}}
2019-04-07 04:13:57.576442 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/flush_job.cc:319] [default] [JOB 6778] Level-0 flush table #17671: 10994576 bytes OK
2019-04-07 04:13:57.576892 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_files.cc:242] adding log 17658 to recycle list

2019-04-07 04:13:57.576898 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:13:57.576448) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/memtable_list.cc:360] [default] Level-0 commit table #17671 started
2019-04-07 04:13:57.576901 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:13:57.576861) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/memtable_list.cc:383] [default] Level-0 commit table #17671: memtable #1 done
2019-04-07 04:13:57.576903 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:13:57.576870) EVENT_LOG_v1 {"time_micros": 1554603237576865, "job": 6778, "event": "flush_finished", "lsm_state": [1, 0, 9, 0, 0, 0, 0], "immutable_memtables": 0}
2019-04-07 04:13:57.576905 7fba0e097700  4 rocksdb: (Original Log Time 2019/04/07-04:13:57.576885) [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_compaction_flush.cc:132] [default] Level summary: base level 1 max bytes base 268435456 files[1 0 9 0 0 0 0] max score 0.25

2019-04-07 04:13:57.576908 7fba0e097700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_files.cc:388] [JOB 6778] Try to delete WAL files size 253685940, prev total WAL file size 253731998, number of live WAL files 2.

2019-04-07 04:21:13.590940 7fba248c4700  4 rocksdb: [/builddir/build/BUILD/ceph-12.2.8/src/rocksdb/db/db_impl_write.cc:684] reusing log 17658 from recycle list

= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = 


And finally,  the "spikes" I'm talking about looks like this, https://imgur.com/rAbUycf and is gathered with the following query in grafana, 
"""" max( irate(node_disk_write_time_ms{device=~"dm-.*",instance=~"[[osd_servers]].*"}[30s]) / irate(node_disk_writes_completed{device=~"dm-.*",instance=~"[[osd_servers]].*"}[30s]) ) by(device) """" 


Is this "normal" and thus something one not should worry about, or should one try to tune some parameters (and if so which one's) to get rid of them ? 
Non the less it's a bit "worrying" with these latency spikes.  

Any insight on this would be valuable. 

Ps. Sorry if this has been posted or asked before. I couldn't find any current threads/issues that seemed spot on to me.

Best Regards, 
Patrik Martinsson - Linux System Administrator
Sweden


_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux