Re: Possibly a bug on rocksdb

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Samuel,

You can use https://tracker.ceph.com/issues/41211 to provide the
information that Brad requested, along with debug_osd=20, using
debug_rocksdb=20 and debug_bluestore=20 might be useful.

Thanks,
Neha



On Sun, Aug 11, 2019 at 4:18 PM Brad Hubbard <bhubbard@xxxxxxxxxx> wrote:
>
> Could you create a tracker for this?
>
> Also, if you can reproduce this could you gather a log with
> debug_osd=20 ? That should show us the superblock it was trying to
> decode as well as additional details.
>
> On Mon, Aug 12, 2019 at 6:29 AM huxiaoyu@xxxxxxxxxxxx
> <huxiaoyu@xxxxxxxxxxxx> wrote:
> >
> > Dear folks,
> >
> > I had an OSD down, not because of a bad disk, but most likely a bug hit on Rockdb. Any one had similar issue?
> >
> > I am using Luminous 12.2.12 version. Log attached below
> >
> > thanks,
> > Samuel
> >
> > ******************************************************************************
> > [root@horeb72 ceph]# head -400 ceph-osd.4.log
> > 2019-08-11 07:30:02.186519 7f69bd020700  0 -- 192.168.10.72:6805/5915 >> 192.168.10.73:6801/4096 conn(0x56549cfc0800 :6805 s=STATE_ACCEPTING_WAIT_CONNECT_MSG_AUTH pgs=0 cs=0 l=0).handle_connect_msg accept connect_seq 15 vs existing csq=15 existing_state=STATE_STANDBY
> > 2019-08-11 07:30:02.186871 7f69bd020700  0 -- 192.168.10.72:6805/5915 >> 192.168.10.73:6801/4096 conn(0x56549cfc0800 :6805 s=STATE_ACCEPTING_WAIT_CONNECT_MSG_AUTH pgs=0 cs=0 l=0).handle_connect_msg accept connect_seq 16 vs existing csq=15 existing_state=STATE_STANDBY
> > 2019-08-11 07:30:02.242291 7f69bc81f700  0 -- 192.168.10.72:6805/5915 >> 192.168.10.71:6805/5046 conn(0x5654b93ed000 :6805 s=STATE_ACCEPTING_WAIT_CONNECT_MSG_AUTH pgs=0 cs=0 l=0).handle_connect_msg accept connect_seq 15 vs existing csq=15 existing_state=STATE_STANDBY
> > 2019-08-11 07:30:02.242554 7f69bc81f700  0 -- 192.168.10.72:6805/5915 >> 192.168.10.71:6805/5046 conn(0x5654b93ed000 :6805 s=STATE_ACCEPTING_WAIT_CONNECT_MSG_AUTH pgs=0 cs=0 l=0).handle_connect_msg accept connect_seq 16 vs existing csq=15 existing_state=STATE_STANDBY
> > 2019-08-11 07:30:02.260295 7f69bc81f700  0 -- 192.168.10.72:6805/5915 >> 192.168.10.73:6806/4864 conn(0x56544de16800 :6805 s=STATE_ACCEPTING_WAIT_CONNECT_MSG_AUTH pgs=0 cs=0 l=0).handle_connect_msg accept connect_seq 15 vs existing csq=15 existing_state=STATE_CONNECTING_WAIT_CONNECT_REPLY
> > 2019-08-11 17:11:01.968247 7ff4822f1d80 -1 WARNING: the following dangerous and experimental features are enabled: bluestore,rocksdb
> > 2019-08-11 17:11:01.968333 7ff4822f1d80  0 ceph version 12.2.12 (1436006594665279fe734b4c15d7e08c13ebd777) luminous (stable), process ceph-osd, pid 1048682
> > 2019-08-11 17:11:01.970611 7ff4822f1d80  0 pidfile_write: ignore empty --pid-file
> > 2019-08-11 17:11:01.991542 7ff4822f1d80 -1 WARNING: the following dangerous and experimental features are enabled: bluestore,rocksdb
> > 2019-08-11 17:11:01.997597 7ff4822f1d80  0 load: jerasure load: lrc load: isa
> > 2019-08-11 17:11:01.997710 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block type kernel
> > 2019-08-11 17:11:01.997723 7ff4822f1d80  1 bdev(0x564774656c00 /var/lib/ceph/osd/ceph-4/block) open path /var/lib/ceph/osd/ceph-4/block
> > 2019-08-11 17:11:01.998127 7ff4822f1d80  1 bdev(0x564774656c00 /var/lib/ceph/osd/ceph-4/block) open size 858887553024 (0xc7f9b00000, 800GiB) block_size 4096 (4KiB) non-rotational
> > 2019-08-11 17:11:01.998231 7ff4822f1d80  1 bdev(0x564774656c00 /var/lib/ceph/osd/ceph-4/block) close
> > 2019-08-11 17:11:02.265144 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block type kernel
> > 2019-08-11 17:11:02.265177 7ff4822f1d80  1 bdev(0x564774658a00 /var/lib/ceph/osd/ceph-4/block) open path /var/lib/ceph/osd/ceph-4/block
> > 2019-08-11 17:11:02.265695 7ff4822f1d80  1 bdev(0x564774658a00 /var/lib/ceph/osd/ceph-4/block) open size 858887553024 (0xc7f9b00000, 800GiB) block_size 4096 (4KiB) non-rotational
> > 2019-08-11 17:11:02.266233 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block.db type kernel
> > 2019-08-11 17:11:02.266256 7ff4822f1d80  1 bdev(0x564774589a00 /var/lib/ceph/osd/ceph-4/block.db) open path /var/lib/ceph/osd/ceph-4/block.db
> > 2019-08-11 17:11:02.266812 7ff4822f1d80  1 bdev(0x564774589a00 /var/lib/ceph/osd/ceph-4/block.db) open size 29999759360 (0x6fc200000, 27.9GiB) block_size 4096 (4KiB) non-rotational
> > 2019-08-11 17:11:02.266998 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block type kernel
> > 2019-08-11 17:11:02.267015 7ff4822f1d80  1 bdev(0x564774659a00 /var/lib/ceph/osd/ceph-4/block) open path /var/lib/ceph/osd/ceph-4/block
> > 2019-08-11 17:11:02.267412 7ff4822f1d80  1 bdev(0x564774659a00 /var/lib/ceph/osd/ceph-4/block) open size 858887553024 (0xc7f9b00000, 800GiB) block_size 4096 (4KiB) non-rotational
> > 2019-08-11 17:11:02.298355 7ff4822f1d80  0  set rocksdb option compaction_readahead_size = 2MB
> > 2019-08-11 17:11:02.298368 7ff4822f1d80  0  set rocksdb option compaction_style = kCompactionStyleLevel
> > 2019-08-11 17:11:02.299628 7ff4822f1d80  0  set rocksdb option compaction_threads = 32
> > 2019-08-11 17:11:02.299648 7ff4822f1d80  0  set rocksdb option compression = kNoCompression
> > 2019-08-11 17:11:02.299993 7ff4822f1d80  0  set rocksdb option flusher_threads = 8
> > 2019-08-11 17:11:02.300006 7ff4822f1d80  0  set rocksdb option level0_file_num_compaction_trigger = 64
> > 2019-08-11 17:11:02.300011 7ff4822f1d80  0  set rocksdb option level0_slowdown_writes_trigger = 128
> > 2019-08-11 17:11:02.300017 7ff4822f1d80  0  set rocksdb option level0_stop_writes_trigger = 256
> > 2019-08-11 17:11:02.300022 7ff4822f1d80  0  set rocksdb option max_background_compactions = 64
> > 2019-08-11 17:11:02.300027 7ff4822f1d80  0  set rocksdb option max_bytes_for_level_base = 2GB
> > 2019-08-11 17:11:02.300034 7ff4822f1d80  0  set rocksdb option max_write_buffer_number = 64
> > 2019-08-11 17:11:02.300039 7ff4822f1d80  0  set rocksdb option min_write_buffer_number_to_merge = 32
> > 2019-08-11 17:11:02.300044 7ff4822f1d80  0  set rocksdb option recycle_log_file_num = 64
> > 2019-08-11 17:11:02.300048 7ff4822f1d80  0  set rocksdb option target_file_size_base = 4MB
> > 2019-08-11 17:11:02.300053 7ff4822f1d80  0  set rocksdb option write_buffer_size = 4MB
> > 2019-08-11 17:11:02.300093 7ff4822f1d80  0  set rocksdb option compaction_readahead_size = 2MB
> > 2019-08-11 17:11:02.300103 7ff4822f1d80  0  set rocksdb option compaction_style = kCompactionStyleLevel
> > 2019-08-11 17:11:02.300110 7ff4822f1d80  0  set rocksdb option compaction_threads = 32
> > 2019-08-11 17:11:02.300121 7ff4822f1d80  0  set rocksdb option compression = kNoCompression
> > 2019-08-11 17:11:02.300129 7ff4822f1d80  0  set rocksdb option flusher_threads = 8
> > 2019-08-11 17:11:02.300135 7ff4822f1d80  0  set rocksdb option level0_file_num_compaction_trigger = 64
> > 2019-08-11 17:11:02.300142 7ff4822f1d80  0  set rocksdb option level0_slowdown_writes_trigger = 128
> > 2019-08-11 17:11:02.300146 7ff4822f1d80  0  set rocksdb option level0_stop_writes_trigger = 256
> > 2019-08-11 17:11:02.300150 7ff4822f1d80  0  set rocksdb option max_background_compactions = 64
> > 2019-08-11 17:11:02.300155 7ff4822f1d80  0  set rocksdb option max_bytes_for_level_base = 2GB
> > 2019-08-11 17:11:02.300159 7ff4822f1d80  0  set rocksdb option max_write_buffer_number = 64
> > 2019-08-11 17:11:02.300166 7ff4822f1d80  0  set rocksdb option min_write_buffer_number_to_merge = 32
> > 2019-08-11 17:11:02.300176 7ff4822f1d80  0  set rocksdb option recycle_log_file_num = 64
> > 2019-08-11 17:11:02.300185 7ff4822f1d80  0  set rocksdb option target_file_size_base = 4MB
> > 2019-08-11 17:11:02.300193 7ff4822f1d80  0  set rocksdb option write_buffer_size = 4MB
> > 2019-08-11 17:11:02.819067 7ff4822f1d80 -1 *** Caught signal (Aborted) **
> >  in thread 7ff4822f1d80 thread_name:ceph-osd
> >
> >  ceph version 12.2.12 (1436006594665279fe734b4c15d7e08c13ebd777) luminous (stable)
> >  1: (()+0xa64ee1) [0x56476aafbee1]
> >  2: (()+0xf6d0) [0x7ff47f5a16d0]
> >  3: (gsignal()+0x37) [0x7ff47e5c2277]
> >  4: (abort()+0x148) [0x7ff47e5c3968]
> >  5: (__gnu_cxx::__verbose_terminate_handler()+0x165) [0x7ff47eed17d5]
> >  6: (()+0x5e746) [0x7ff47eecf746]
> >  7: (()+0x5e773) [0x7ff47eecf773]
> >  8: (()+0x5e993) [0x7ff47eecf993]
> >  9: (()+0xa6f149) [0x56476ab06149]
> >  10: (decode(std::string&, ceph::buffer::list::iterator&)+0x53) [0x56476a765313]
> >  11: (OSDSuperblock::decode(ceph::buffer::list::iterator&)+0x70) [0x56476a7f23e0]
> >  12: (OSD::read_superblock()+0x193) [0x56476a567943]
> >  13: (OSD::init()+0x773) [0x56476a5b4eb3]
> >  14: (main()+0x2d07) [0x56476a4b7ef7]
> >  15: (__libc_start_main()+0xf5) [0x7ff47e5ae445]
> >  16: (()+0x4c0dc3) [0x56476a557dc3]
> >  NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
> >
> > --- begin dump of recent events ---
> >    -68> 2019-08-11 17:11:01.963739 7ff4822f1d80  5 asok(0x56477431e1c0) register_command perfcounters_dump hook 0x5647742e4180
> >    -67> 2019-08-11 17:11:01.963761 7ff4822f1d80  5 asok(0x56477431e1c0) register_command 1 hook 0x5647742e4180
> >    -66> 2019-08-11 17:11:01.963765 7ff4822f1d80  5 asok(0x56477431e1c0) register_command perf dump hook 0x5647742e4180
> >    -65> 2019-08-11 17:11:01.963767 7ff4822f1d80  5 asok(0x56477431e1c0) register_command perfcounters_schema hook 0x5647742e4180
> >    -64> 2019-08-11 17:11:01.963772 7ff4822f1d80  5 asok(0x56477431e1c0) register_command perf histogram dump hook 0x5647742e4180
> >    -63> 2019-08-11 17:11:01.963775 7ff4822f1d80  5 asok(0x56477431e1c0) register_command 2 hook 0x5647742e4180
> >    -62> 2019-08-11 17:11:01.963777 7ff4822f1d80  5 asok(0x56477431e1c0) register_command perf schema hook 0x5647742e4180
> >    -61> 2019-08-11 17:11:01.963780 7ff4822f1d80  5 asok(0x56477431e1c0) register_command perf histogram schema hook 0x5647742e4180
> >    -60> 2019-08-11 17:11:01.963788 7ff4822f1d80  5 asok(0x56477431e1c0) register_command perf reset hook 0x5647742e4180
> >    -59> 2019-08-11 17:11:01.963792 7ff4822f1d80  5 asok(0x56477431e1c0) register_command config show hook 0x5647742e4180
> >    -58> 2019-08-11 17:11:01.963795 7ff4822f1d80  5 asok(0x56477431e1c0) register_command config help hook 0x5647742e4180
> >    -57> 2019-08-11 17:11:01.963799 7ff4822f1d80  5 asok(0x56477431e1c0) register_command config set hook 0x5647742e4180
> >    -56> 2019-08-11 17:11:01.963803 7ff4822f1d80  5 asok(0x56477431e1c0) register_command config get hook 0x5647742e4180
> >    -55> 2019-08-11 17:11:01.963806 7ff4822f1d80  5 asok(0x56477431e1c0) register_command config diff hook 0x5647742e4180
> >    -54> 2019-08-11 17:11:01.963809 7ff4822f1d80  5 asok(0x56477431e1c0) register_command config diff get hook 0x5647742e4180
> >    -53> 2019-08-11 17:11:01.963812 7ff4822f1d80  5 asok(0x56477431e1c0) register_command log flush hook 0x5647742e4180
> >    -52> 2019-08-11 17:11:01.963815 7ff4822f1d80  5 asok(0x56477431e1c0) register_command log dump hook 0x5647742e4180
> >    -51> 2019-08-11 17:11:01.963818 7ff4822f1d80  5 asok(0x56477431e1c0) register_command log reopen hook 0x5647742e4180
> >    -50> 2019-08-11 17:11:01.963827 7ff4822f1d80  5 asok(0x56477431e1c0) register_command dump_mempools hook 0x564774307ea8
> >    -49> 2019-08-11 17:11:01.968040 7ff4822f1d80 -1 WARNING: the following dangerous and experimental features are enabled: bluestore,rocksdb
> >    -48> 2019-08-11 17:11:01.968247 7ff4822f1d80 -1 WARNING: the following dangerous and experimental features are enabled: bluestore,rocksdb
> >    -47> 2019-08-11 17:11:01.968333 7ff4822f1d80  0 ceph version 12.2.12 (1436006594665279fe734b4c15d7e08c13ebd777) luminous (stable), process ceph-osd, pid 1048682
> >    -46> 2019-08-11 17:11:01.970611 7ff4822f1d80  0 pidfile_write: ignore empty --pid-file
> >    -45> 2019-08-11 17:11:01.991542 7ff4822f1d80 -1 WARNING: the following dangerous and experimental features are enabled: bluestore,rocksdb
> >    -44> 2019-08-11 17:11:01.997597 7ff4822f1d80  0 load: jerasure load: lrc load: isa
> >    -43> 2019-08-11 17:11:01.997710 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block type kernel
> >    -42> 2019-08-11 17:11:01.997723 7ff4822f1d80  1 bdev(0x564774656c00 /var/lib/ceph/osd/ceph-4/block) open path /var/lib/ceph/osd/ceph-4/block
> >    -41> 2019-08-11 17:11:01.998127 7ff4822f1d80  1 bdev(0x564774656c00 /var/lib/ceph/osd/ceph-4/block) open size 858887553024 (0xc7f9b00000, 800GiB) block_size 4096 (4KiB) non-rotational
> >    -40> 2019-08-11 17:11:01.998231 7ff4822f1d80  1 bdev(0x564774656c00 /var/lib/ceph/osd/ceph-4/block) close
> >    -39> 2019-08-11 17:11:02.265144 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block type kernel
> >    -38> 2019-08-11 17:11:02.265177 7ff4822f1d80  1 bdev(0x564774658a00 /var/lib/ceph/osd/ceph-4/block) open path /var/lib/ceph/osd/ceph-4/block
> >    -37> 2019-08-11 17:11:02.265695 7ff4822f1d80  1 bdev(0x564774658a00 /var/lib/ceph/osd/ceph-4/block) open size 858887553024 (0xc7f9b00000, 800GiB) block_size 4096 (4KiB) non-rotational
> >    -36> 2019-08-11 17:11:02.266233 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block.db type kernel
> >    -35> 2019-08-11 17:11:02.266256 7ff4822f1d80  1 bdev(0x564774589a00 /var/lib/ceph/osd/ceph-4/block.db) open path /var/lib/ceph/osd/ceph-4/block.db
> >    -34> 2019-08-11 17:11:02.266812 7ff4822f1d80  1 bdev(0x564774589a00 /var/lib/ceph/osd/ceph-4/block.db) open size 29999759360 (0x6fc200000, 27.9GiB) block_size 4096 (4KiB) non-rotational
> >    -33> 2019-08-11 17:11:02.266998 7ff4822f1d80  1 bdev create path /var/lib/ceph/osd/ceph-4/block type kernel
> >    -32> 2019-08-11 17:11:02.267015 7ff4822f1d80  1 bdev(0x564774659a00 /var/lib/ceph/osd/ceph-4/block) open path /var/lib/ceph/osd/ceph-4/block
> >    -31> 2019-08-11 17:11:02.267412 7ff4822f1d80  1 bdev(0x564774659a00 /var/lib/ceph/osd/ceph-4/block) open size 858887553024 (0xc7f9b00000, 800GiB) block_size 4096 (4KiB) non-rotational
> >    -30> 2019-08-11 17:11:02.298355 7ff4822f1d80  0  set rocksdb option compaction_readahead_size = 2MB
> >    -29> 2019-08-11 17:11:02.298368 7ff4822f1d80  0  set rocksdb option compaction_style = kCompactionStyleLevel
> >    -28> 2019-08-11 17:11:02.299628 7ff4822f1d80  0  set rocksdb option compaction_threads = 32
> >    -27> 2019-08-11 17:11:02.299648 7ff4822f1d80  0  set rocksdb option compression = kNoCompression
> >    -26> 2019-08-11 17:11:02.299993 7ff4822f1d80  0  set rocksdb option flusher_threads = 8
> >    -25> 2019-08-11 17:11:02.300006 7ff4822f1d80  0  set rocksdb option level0_file_num_compaction_trigger = 64
> >    -24> 2019-08-11 17:11:02.300011 7ff4822f1d80  0  set rocksdb option level0_slowdown_writes_trigger = 128
> >    -23> 2019-08-11 17:11:02.300017 7ff4822f1d80  0  set rocksdb option level0_stop_writes_trigger = 256
> >    -22> 2019-08-11 17:11:02.300022 7ff4822f1d80  0  set rocksdb option max_background_compactions = 64
> >    -21> 2019-08-11 17:11:02.300027 7ff4822f1d80  0  set rocksdb option max_bytes_for_level_base = 2GB
> >    -20> 2019-08-11 17:11:02.300034 7ff4822f1d80  0  set rocksdb option max_write_buffer_number = 64
> >    -19> 2019-08-11 17:11:02.300039 7ff4822f1d80  0  set rocksdb option min_write_buffer_number_to_merge = 32
> >    -18> 2019-08-11 17:11:02.300044 7ff4822f1d80  0  set rocksdb option recycle_log_file_num = 64
> >    -17> 2019-08-11 17:11:02.300048 7ff4822f1d80  0  set rocksdb option target_file_size_base = 4MB
> >    -16> 2019-08-11 17:11:02.300053 7ff4822f1d80  0  set rocksdb option write_buffer_size = 4MB
> >    -15> 2019-08-11 17:11:02.300093 7ff4822f1d80  0  set rocksdb option compaction_readahead_size = 2MB
> >    -14> 2019-08-11 17:11:02.300103 7ff4822f1d80  0  set rocksdb option compaction_style = kCompactionStyleLevel
> >    -13> 2019-08-11 17:11:02.300110 7ff4822f1d80  0  set rocksdb option compaction_threads = 32
> >    -12> 2019-08-11 17:11:02.300121 7ff4822f1d80  0  set rocksdb option compression = kNoCompression
> >    -11> 2019-08-11 17:11:02.300129 7ff4822f1d80  0  set rocksdb option flusher_threads = 8
> >    -10> 2019-08-11 17:11:02.300135 7ff4822f1d80  0  set rocksdb option level0_file_num_compaction_trigger = 64
> >     -9> 2019-08-11 17:11:02.300142 7ff4822f1d80  0  set rocksdb option level0_slowdown_writes_trigger = 128
> >     -8> 2019-08-11 17:11:02.300146 7ff4822f1d80  0  set rocksdb option level0_stop_writes_trigger = 256
> >     -7> 2019-08-11 17:11:02.300150 7ff4822f1d80  0  set rocksdb option max_background_compactions = 64
> >     -6> 2019-08-11 17:11:02.300155 7ff4822f1d80  0  set rocksdb option max_bytes_for_level_base = 2GB
> >     -5> 2019-08-11 17:11:02.300159 7ff4822f1d80  0  set rocksdb option max_write_buffer_number = 64
> >     -4> 2019-08-11 17:11:02.300166 7ff4822f1d80  0  set rocksdb option min_write_buffer_number_to_merge = 32
> >     -3> 2019-08-11 17:11:02.300176 7ff4822f1d80  0  set rocksdb option recycle_log_file_num = 64
> >     -2> 2019-08-11 17:11:02.300185 7ff4822f1d80  0  set rocksdb option target_file_size_base = 4MB
> >     -1> 2019-08-11 17:11:02.300193 7ff4822f1d80  0  set rocksdb option write_buffer_size = 4MB
> >      0> 2019-08-11 17:11:02.819067 7ff4822f1d80 -1 *** Caught signal (Aborted) **
> >  in thread 7ff4822f1d80 thread_name:ceph-osd
> > ________________________________
> > huxiaoyu@xxxxxxxxxxxx
> > _______________________________________________
> > ceph-users mailing list
> > ceph-users@xxxxxxxxxxxxxx
> > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
>
>
> --
> Cheers,
> Brad
> _______________________________________________
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux