Re: question on bluestore wal io behavior

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



thanks, the result with `bluefs preextend wal files = true` is as follow:

bstore_kv_sync 31622 [000] 279011.804497: block:block_rq_issue: 8,112
WS 0 () 31744816 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279011.817726: block:block_rq_issue: 8,112
WS 0 () 31744824 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [008] 279011.896340: block:block_rq_issue: 8,112
WS 0 () 31744824 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [008] 279011.913248: block:block_rq_issue: 8,112
WS 0 () 31744832 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [008] 279011.922135: block:block_rq_issue: 8,112
WS 0 () 31744840 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [009] 279012.066157: block:block_rq_issue: 8,112
WS 0 () 31744840 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.091929: block:block_rq_issue: 8,112
WS 0 () 31744848 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.104784: block:block_rq_issue: 8,112
WS 0 () 31744848 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.119474: block:block_rq_issue: 8,112
WS 0 () 31744856 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.177620: block:block_rq_issue: 8,112
WS 0 () 31744864 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.189050: block:block_rq_issue: 8,112
WS 0 () 31744864 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.222644: block:block_rq_issue: 8,112
WS 0 () 31744872 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.390916: block:block_rq_issue: 8,112
WS 0 () 31744880 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.405258: block:block_rq_issue: 8,112
WS 0 () 31744880 + 24 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.415339: block:block_rq_issue: 8,112
WS 0 () 31744896 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279012.477234: block:block_rq_issue: 8,112
WS 0 () 31744896 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [001] 279012.488351: block:block_rq_issue: 8,112
WS 0 () 31744904 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [002] 279012.519237: block:block_rq_issue: 8,112
WS 0 () 31744912 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [002] 279012.572239: block:block_rq_issue: 8,112
WS 0 () 31744912 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [002] 279012.588633: block:block_rq_issue: 8,112
WS 0 () 31744920 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [008] 279012.612379: block:block_rq_issue: 8,112
WS 0 () 31744928 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [008] 279012.966390: block:block_rq_issue: 8,112
WS 0 () 31744936 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [008] 279012.980747: block:block_rq_issue: 8,112
WS 0 () 31744936 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [001] 279012.990396: block:block_rq_issue: 8,112
WS 0 () 31744944 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [002] 279013.261043: block:block_rq_issue: 8,112
WS 0 () 31744952 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [002] 279013.272202: block:block_rq_issue: 8,112
WS 0 () 31744952 + 24 [bstore_kv_sync]
bstore_kv_sync 31622 [009] 279013.281706: block:block_rq_issue: 8,112
WS 0 () 31744968 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [009] 279013.342029: block:block_rq_issue: 8,112
WS 0 () 31744968 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [009] 279013.355062: block:block_rq_issue: 8,112
WS 0 () 31744976 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279013.375022: block:block_rq_issue: 8,112
WS 0 () 31744984 + 8 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279013.626828: block:block_rq_issue: 8,112
WS 0 () 31744984 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279013.647313: block:block_rq_issue: 8,112
WS 0 () 31744992 + 16 [bstore_kv_sync]
bstore_kv_sync 31622 [000] 279013.661472: block:block_rq_issue: 8,112
WS 0 () 31745000 + 8 [bstore_kv_sync]

2017-08-28 21:26 GMT+08:00 Sage Weil <sage@xxxxxxxxxxxx>:
> On Mon, 28 Aug 2017, zengran zhang wrote:
>> hi,
>>
>> we are testing bluestore and rocksdb, we observed that the wal block's
>> io is some odd... the iostate show that the wal block's `avgrq-sz` is
>> arountd 10, the perf record of the `block_rq_issue` captured as
>> follow:
>>
>> bstore_kv_sync 26357 [005] 177034.327414: block:block_rq_issue: 8,96
>> WS 0 () 53904888 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [006] 177034.348115: block:block_rq_issue: 8,96
>> WS 0 () 77472296 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [006] 177034.359633: block:block_rq_issue: 8,96
>> WS 0 () 53904888 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [003] 177034.364816: block:block_rq_issue: 8,96
>> WS 0 () 77472304 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [009] 177034.550656: block:block_rq_issue: 8,96
>> WS 0 () 53904896 + 80 [bstore_kv_sync]
>> bstore_kv_sync 26357 [009] 177034.556734: block:block_rq_issue: 8,96
>> WS 0 () 77472320 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [008] 177034.592900: block:block_rq_issue: 8,96
>> WS 0 () 53904968 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177034.598323: block:block_rq_issue: 8,96
>> WS 0 () 77472328 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [011] 177034.635053: block:block_rq_issue: 8,96
>> WS 0 () 53904976 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [012] 177034.640036: block:block_rq_issue: 8,96
>> WS 0 () 77472336 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [014] 177034.670053: block:block_rq_issue: 8,96
>> WS 0 () 53904976 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [015] 177034.681708: block:block_rq_issue: 8,96
>> WS 0 () 77472344 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [015] 177034.692890: block:block_rq_issue: 8,96
>> WS 0 () 53904976 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [015] 177034.698320: block:block_rq_issue: 8,96
>> WS 0 () 77472352 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [016] 177034.710399: block:block_rq_issue: 8,96
>> WS 0 () 53904984 + 32 [bstore_kv_sync]
>> bstore_kv_sync 26357 [016] 177034.740153: block:block_rq_issue: 8,96
>> WS 0 () 77472360 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [013] 177034.760271: block:block_rq_issue: 8,96
>> WS 0 () 53905008 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [015] 177034.931293: block:block_rq_issue: 8,96
>> WS 0 () 53905016 + 56 [bstore_kv_sync]
>> bstore_kv_sync 26357 [017] 177034.973679: block:block_rq_issue: 8,96
>> WS 0 () 77472376 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [018] 177035.001517: block:block_rq_issue: 8,96
>> WS 0 () 53905064 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [010] 177035.006956: block:block_rq_issue: 8,96
>> WS 0 () 77472384 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [019] 177035.018570: block:block_rq_issue: 8,96
>> WS 0 () 53905072 + 24 [bstore_kv_sync]
>> bstore_kv_sync 26357 [010] 177035.032255: block:block_rq_issue: 8,96
>> WS 0 () 77472392 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [012] 177035.157029: block:block_rq_issue: 8,96
>> WS 0 () 77472400 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [013] 177035.172142: block:block_rq_issue: 8,96
>> WS 0 () 53905096 + 48 [bstore_kv_sync]
>> bstore_kv_sync 26357 [014] 177035.182179: block:block_rq_issue: 8,96
>> WS 0 () 77472408 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [015] 177035.209644: block:block_rq_issue: 8,96
>> WS 0 () 53905136 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [016] 177035.215506: block:block_rq_issue: 8,96
>> WS 0 () 77472416 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [017] 177035.226028: block:block_rq_issue: 8,96
>> WS 0 () 53905144 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [018] 177035.232176: block:block_rq_issue: 8,96
>> WS 0 () 77472424 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177035.246060: block:block_rq_issue: 8,96
>> WS 0 () 53905144 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177035.257132: block:block_rq_issue: 8,96
>> WS 0 () 77472432 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177035.268880: block:block_rq_issue: 8,96
>> WS 0 () 53905144 + 24 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177035.344242: block:block_rq_issue: 8,96
>> WS 0 () 53905160 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177035.382172: block:block_rq_issue: 8,96
>> WS 0 () 77472448 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177035.393062: block:block_rq_issue: 8,96
>> WS 0 () 53905168 + 16 [bstore_kv_sync]
>> bstore_kv_sync 26357 [000] 177035.398896: block:block_rq_issue: 8,96
>> WS 0 () 77472456 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [017] 177035.420992: block:block_rq_issue: 8,96
>> WS 0 () 53905176 + 8 [bstore_kv_sync]
>> bstore_kv_sync 26357 [018] 177035.432239: block:block_rq_issue: 8,96
>> WS 0 () 77472464 + 8 [bstore_kv_sync]
>>
>> the io request on LBA 77xxx is always 8 sectors, and the the io
>> request on LBA 53xxx had some overlay areas...
>> my understand of the two sequent address (77xxx and 53xxx) is bluefs's
>> log and rocksdb'wal, why they alternate one by one so strictly...
>>
>> we think this behavior may hurt the performance, hope anyone who could
>> give an analysis, thanks so much!
>
> This should go away (the bluefs log) once the wal files have been written
> through once.  There is an experimental bluefs option
>
>  bluefs preextend wal files = true
>
> that will avoid this for the purposes of benchmarking, but do not use it
> on a production OSD because it is not fully crash safe.
>
> sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux