Luminous OSD high mem usage cause OS die

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

    We just have a node die during a long recovery.   The
recovery(backfill) is from 3 Host , each host with 8 OSDs to 1 Host,
with 24 OSDs(crush chooseleaf type OSD), almost all data need to be
move and it already last for ~40 Hours before the target host crash.
Till that it is about 50% done, total data set is ~ 7 TB * 3 Replica.

    Despite the slow recovery speed (which is tracked in another
mailing thread), this time the target host (10.153.55.105) died due to
page allocation failure.  It looks like we run out all the memory and
some OSD on this Host died, which cause even more recovery /peering
and eaten more memory.

--------Some OSD log--------

2017-09-20 01:34:33.827924 7f82c127d700  1 osd.66 pg_epoch: 183092
pg[25.111d( v 170496'124722 (159408'121639,170496'124722]
local-lis/les=182588/182652 n=330 ec=112674/112674 lis/c 182588/181473
les/c/f 182652/181490/0 183092/183092/183092) [123,66,311] r=1
lpr=183092 pi=[181473,183092)/3 crt=170496'124722 lcod 0'0 unknown
NOTIFY] state<Start>: transitioning to Stray
2017-09-20 01:34:34.981430 7f82c6a88700  0 osd.66 183093 crush map has
features 720595732696154112, adjusting msgr requires for osds
2017-09-20 01:34:37.398096 7f82c6a88700  0 osd.66 183095 crush map has
features 720595732696154112, adjusting msgr requires for osds
2017-09-20 01:34:47.271077 7f82c6a88700  0 osd.66 183096 crush map has
features 720595732696154112, adjusting msgr requires for osds
2017-09-20 01:34:49.573938 7f82c6a88700  0 osd.66 183097 crush map has
features 720595732696154112, adjusting msgr requires for osds
2017-09-20 01:34:49.677852 7f82d4a50700  0 --
10.153.55.105:6820/5012957 >> 10.153.55.105:6870/2013287
conn(0x7f8424875800 :6820 s=STATE_ACCEPTING_WAIT_CONNECT_MSG_AUTH
pgs=0 cs=0 l=0).handle_connect_msg accept connect_seq 0 vs existing
csq=0 existing_state=STATE_CONNECTING
2017-09-20 01:34:50.879427 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.135:6840 osd.65 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879452 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.52.40:6885 osd.67 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879461 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6875 osd.105 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879467 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6822 osd.256 since back
2017-09-20 01:34:39.380397 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879473 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6835 osd.308 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879478 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6884 osd.311 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879482 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6845 osd.313 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879486 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6885 osd.318 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879491 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6841 osd.323 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879495 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6858 osd.329 since back
2017-09-20 01:34:49.183115 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879500 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6888 osd.332 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879506 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6896 osd.343 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879510 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6881 osd.414 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879515 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6851 osd.420 since back
2017-09-20 01:34:43.481058 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879521 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6893 osd.440 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:30.879423)
2017-09-20 01:34:50.879534 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6869 osd.542 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:30.879423)
2017-09-20 01:34:51.879795 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.135:6840 osd.65 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879828 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.52.40:6885 osd.67 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879832 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6875 osd.105 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879836 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6822 osd.256 since back
2017-09-20 01:34:39.380397 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879841 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6835 osd.308 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879844 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6884 osd.311 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879849 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6845 osd.313 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879853 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6885 osd.318 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879856 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6841 osd.323 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879859 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6858 osd.329 since back
2017-09-20 01:34:49.183115 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879863 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6888 osd.332 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879866 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6896 osd.343 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879869 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6881 osd.414 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879872 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6851 osd.420 since back
2017-09-20 01:34:43.481058 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879876 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6893 osd.440 ever on
either front or back, first ping sent 2017-09-20 01:34:30.677800
(cutoff 2017-09-20 01:34:31.879794)
2017-09-20 01:34:51.879879 7f82d129d700 -1 osd.66 183097
heartbeat_check: no reply from 10.153.55.105:6869 osd.542 since back
2017-09-20 01:34:30.677800 front 2017-09-20 01:34:30.677800 (cutoff
2017-09-20 01:34:31.879794)
2017-09-20 01:34:52.859141 7f82c6a88700  0 osd.66 183098 crush map has
features 720595732696154112, adjusting msgr requires for osds
2017-09-20 01:34:52.861665 7f82c1a7e700  1 osd.66 pg_epoch: 183098
pg[25.6fc( v 172737'68180 (157545'65180,172737'68180] lb
25:3f604bd7:::rbd_data.02058c74dc44db.0000000000001ace:head (bitwise)
local-lis/les=182580/182586 n=8 ec=112668/112668 lis/c 182824/172974
les/c/f 182825/172978/0 183098/183098/182910)
[323,123,268]/[323,137,321] r=-1 lpr=183098 pi=[172974,183098)/2
crt=172737'68180 remapped NOTIFY] start_peering_interval up [323,123]
-> [323,123,268], acting [323,137,321] -> [323,137,321],
acting_primary 323 -> 323, up_primary 323 -> 323, role -1 -> -1,
features acting 2305244844532236283 upacting 2305244844532236283
2017-09-20 01:34:52.862009 7f82c1a7e700  1 osd.66 pg_epoch: 183098
pg[25.6fc( v 172737'68180 (157545'65180,172737'68180] lb
25:3f604bd7:::rbd_data.02058c74dc44db.0000000000001ace:head (bitwise)
local-lis/les=182580/182586 n=8 ec=112668/112668 lis/c 182824/172974
les/c/f 182825/172978/0 183098/183098/182910)
[323,123,268]/[323,137,321] r=-1 lpr=183098 pi=[172974,183098)/2
crt=172737'68180 remapped NOTIFY] state<Start>: transitioning to Stray
2017-09-20 01:34:52.862660 7f82c127d700  1 osd.66 pg_epoch: 183098
pg[25.1765( v 170313'232833 (159408'229833,170313'232833] lb MIN
(bitwise) local-lis/les=182705/182719 n=0 ec=112676/112676 lis/c
182119/151043 les/c/f 182120/151044/0 183098/183098/159530)
[123,323,268]/[163,243,313] r=-1 lpr=183098 pi=[151043,183098)/2
crt=170313'232833 lcod 0'0 remapped NOTIFY] start_peering_interval up
[123,323] -> [123,323,268], acting [163,243,313] -> [163,243,313],
acting_primary 163 -> 163, up_primary 123 -> 123, role -1 -> -1,
features acting 2305244844532236283 upacting 2305244844532236283
2017-09-20 01:34:52.862718 7f82c127d700  1 osd.66 pg_epoch: 183098
pg[25.1765( v 170313'232833 (159408'229833,170313'232833] lb MIN
(bitwise) local-lis/les=182705/182719 n=0 ec=112676/112676 lis/c
182119/151043 les/c/f 182120/151044/0 183098/183098/159530)
[123,323,268]/[163,243,313] r=-1 lpr=183098 pi=[151043,183098)/2
crt=170313'232833 lcod 0'0 remapped NOTIFY] state<Start>:
transitioning to Stray
2017-09-20 01:34:52.865002 7f82c127d700  1 osd.66 pg_epoch: 183098
pg[25.1277( v 170886'428486 (164923'425486,170886'428486] lb
25:ee485f19:::rbd_data.02079821c85ca0.0000000000001a60:head (bitwise)
local-lis/les=182474/182495 n=17 ec=112674/112674 lis/c 183087/174035
les/c/f 183089/174039/0 183098/183098/183067) [414,268,66]/[414,111]
r=-1 lpr=183098 pi=[174035,183098)/3 luod=0'0 crt=170886'428486
active+remapped] start_peering_interval up [414,66] -> [414,268,66],
acting [414,111] -> [414,111], acting_primary 414 -> 414, up_primary
414 -> 414, role -1 -> -1, features acting 2305244844532236283
upacting 2305244844532236283
2017-09-20 01:34:52.865190 7f82c1a7e700  1 osd.66 pg_epoch: 183098
pg[25.35a( v 170244'21010 (147357'18010,170244'21010] lb
25:5ac00a15:::rbd_data.0209c81be3e81c.00000000000005e8:8cf (bitwise)
local-lis/les=176548/176551 n=1 ec=112666/112666 lis/c 176796/157467
les/c/f 176798/157468/0 183098/183098/159530) [318,313,268]/[131,275]
r=-1 lpr=183098 pi=[157467,183098)/1 crt=170244'21010 lcod 0'0
remapped NOTIFY] start_peering_interval up [318,313] -> [318,313,268],
acting [131,275] -> [131,275], acting_primary 131 -> 131, up_primary
318 -> 318, role -1 -> -1, features acting 2305244844532236283
upacting 2305244844532236283
2017-09-20 01:34:52.865289 7f82c1a7e700  1 osd.66 pg_epoch: 183098
pg[25.35a( v 170244'21010 (147357'18010,170244'21010] lb
25:5ac00a15:::rbd_data.0209c81be3e81c.00000000000005e8:8cf (bitwise)
local-lis/les=176548/176551 n=1 ec=112666/112666 lis/c 176796/157467
les/c/f 176798/157468/0 183098/183098/159530) [318,313,268]/[131,275]
r=-1 lpr=183098 pi=[157467,183098)/1 crt=170244'21010 lcod 0'0
remapped NOTIFY] state<Start>: transitioning to Stray
2017-09-20 01:34:52.865295 7f82c127d700  1 osd.66 pg_epoch: 183098
pg[25.1277( v 170886'428486 (164923'425486,170886'428486] lb
25:ee485f19:::rbd_data.02079821c85ca0.0000000000001a60:head (bitwise)
local-lis/les=182474/182495 n=17 ec=112674/112674 lis/c 183087/174035
les/c/f 183089/174039/0 183098/183098/183067) [414,268,66]/[414,111]
r=-1 lpr=183098 pi=[174035,183098)/3 crt=170886'428486 remapped
NOTIFY] state<Start>: transitioning to Stray
2017-09-20 01:34:52.866938 7f82c127d700  1 osd.66 pg_epoch: 183098
pg[25.27c( v 170291'47116 (158296'44074,170291'47116]
local-lis/les=169174/169175 n=258 ec=112666/112666 lis/c 169174/169174
les/c/f 169175/169188/0 183087/183087/183087) [66,440] r=0 lpr=183087
pi=[169174,183087)/8 crt=170291'47116 lcod 0'0 mlcod 0'0 down]
state<Started/Primary/Peering>: Peering, affected_by_map, going to
Reset




------Kern.log ----------------
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779332] swapper/28: page
allocation failure: order:0, mode:0x20
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779339] CPU: 28 PID: 0
Comm: swapper/28 Not tainted 3.13.0-24-generic #47-Ubuntu
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779341] Hardware name:
Dell DCS8420G/0C6G21, BIOS 1.1.3 02/06/2015
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779343]
0000000000000000 ffff88207fd83b58 ffffffff81715ac4 0000000000000020
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779350]
ffff88207fd83be0 ffffffff81153aab 00000000ffffffff ffff88207fff9e08
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779354]
0000000200000001 ffff88207fd977e0 ffff88207fd977f0 ffff88207fff9e00
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779359] Call Trace:
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779361]  <IRQ>
[<ffffffff81715ac4>] dump_stack+0x45/0x56
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779379]
[<ffffffff81153aab>] warn_alloc_failed+0xeb/0x140
Sep 19 22:45:10 slx03c-6rqx kernel: [16500140.779385]
[<ffffffff81158105>] __alloc_pages_nodemask+0x865/0xa20
Sep 19 22:45:11 slx03c-6rqx kernel: [16500140.779389]
[<ffffffff81195413>] alloc_pages_current+0xa3/0x160
Sep 19 22:45:11 slx03c-6rqx kernel: [16500140.779395]
[<ffffffff8160966c>] __netdev_alloc_frag+0x8c/0x140
Sep 19 22:45:11 slx03c-6rqx kernel: [16500140.779399]
[<ffffffff8160b707>] __netdev_alloc_skb+0x77/0xc0
Sep 19 22:45:11 slx03c-6rqx kernel: [16500140.779419]
[<ffffffffa007b9e4>] ixgbe_clean_rx_irq+0xf4/0xb10 [ixgbe]
Sep 19 22:45:11 slx03c-6rqx kernel: [16500140.779425]
[<ffffffffa007d140>] ixgbe_poll+0x460/0x800 [ixgbe]
Sep 19 22:45:11 slx03c-6rqx kernel: [16500140.779431]
[<ffffffff8161aee2>] net_rx_action+0x152/0x250
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779439]
[<ffffffff8106caec>] __do_softirq+0xec/0x2c0
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779443]
[<ffffffff8106d035>] irq_exit+0x105/0x110
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779452]
[<ffffffff817287d6>] do_IRQ+0x56/0xc0
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779461]
[<ffffffff8171df6d>] common_interrupt+0x6d/0x6d
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779462]  <EOI>
[<ffffffff815c95e2>] ? cpuidle_enter_state+0x52/0xc0
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779476]
[<ffffffff815c9709>] cpuidle_idle_call+0xb9/0x1f0
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779484]
[<ffffffff8101ceae>] arch_cpu_idle+0xe/0x30
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779489]
[<ffffffff810beb85>] cpu_startup_entry+0xc5/0x290
Sep 19 22:45:12 slx03c-6rqx kernel: [16500140.779497]
[<ffffffff81040fc8>] start_secondary+0x218/0x2c0
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux