Yes, for now I'd personally prefer 60-64 GB per DB volume unless
one is unable to allocate 300+ GB.
This is 2x times larger than your DBs keep right now (and which
is pretty inline with RocksDB Level 3 max size).
Thanks,
Igor
On 6/18/2019 9:30 PM, Brett Chancellor
wrote:
Thanks Igor. I'm fine turning the warnings off, but
it's curious that only this cluster is showing the alerts. Is
there any value in rebuilding the with smaller SSD meta data
volumes? Say 60GB or 30GB?
-Brett
Hi Brett,
this issue has been with you long before upgrade to
14.2.1. This upgrade just brought corresponding alert
visible.
You can turn the alert off by setting
bluestore_warn_on_bluefs_spillover=false.
But generally this warning shows DB data layout
inefficiency - some data is kept at slow device - which
might has some negative performance impact.
Unfortunately that's a know issue with current
RocksDB/BlueStore interaction - spillovers to slow device
might take place even when there is plenty of free space
at fast one.
Thanks,
Igor
On
6/18/2019 8:46 PM, Brett Chancellor wrote:
Does anybody have a fix for BlueFS
spillover detected? This started happening 2 days after
an upgrade to 14.2.1 and has increased from 3 OSDs to
118 in the last 4 days. I read you could fix it by
rebuilding the OSDs, but rebuilding the 264 OSDs on this
cluster will take months of rebalancing.
$ sudo ceph
health detail
HEALTH_WARN BlueFS spillover detected on 118 OSD(s)
BLUEFS_SPILLOVER BlueFS spillover detected on 118
OSD(s)
osd.0 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.1 spilled over 103 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.5 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.6 spilled over 64 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.11 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.13 spilled over 23 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.21 spilled over 102 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.22 spilled over 103 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.23 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.24 spilled over 25 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.25 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.26 spilled over 64 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.27 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.30 spilled over 65 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.32 spilled over 21 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.34 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.42 spilled over 25 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.45 spilled over 103 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.46 spilled over 24 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.47 spilled over 63 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.48 spilled over 63 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.49 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.50 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.52 spilled over 140 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.53 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.54 spilled over 59 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.55 spilled over 134 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.56 spilled over 19 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.57 spilled over 61 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.58 spilled over 66 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.59 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.61 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.62 spilled over 59 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.65 spilled over 19 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.67 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.69 spilled over 20 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.71 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.73 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.74 spilled over 17 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.75 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.76 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.78 spilled over 64 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.80 spilled over 100 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.81 spilled over 63 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.82 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.83 spilled over 60 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.84 spilled over 60 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.85 spilled over 19 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.87 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.89 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.93 spilled over 102 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.95 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.98 spilled over 19 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.101 spilled over 20 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.103 spilled over 60 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.108 spilled over 20 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.110 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.112 spilled over 65 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.113 spilled over 64 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.115 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.117 spilled over 23 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.118 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.119 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.120 spilled over 65 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.121 spilled over 101 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.122 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.126 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.127 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.128 spilled over 23 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.129 spilled over 67 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.132 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.133 spilled over 23 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.137 spilled over 23 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.138 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.139 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.142 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.143 spilled over 27 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.144 spilled over 25 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.147 spilled over 23 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.148 spilled over 96 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.157 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.158 spilled over 64 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.160 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.161 spilled over 61 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.167 spilled over 23 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.177 spilled over 23 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.180 spilled over 140 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.185 spilled over 20 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.189 spilled over 62 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.190 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.192 spilled over 19 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.193 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.202 spilled over 21 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.207 spilled over 27 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.216 spilled over 60 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.219 spilled over 59 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.220 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.221 spilled over 176 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.223 spilled over 60 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.225 spilled over 22 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.226 spilled over 23 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.228 spilled over 59 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.236 spilled over 65 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.237 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.238 spilled over 23 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.239 spilled over 103 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.240 spilled over 25 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.241 spilled over 26 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.242 spilled over 60 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.243 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.244 spilled over 103 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.245 spilled over 144 GiB metadata from 'db'
device (29 GiB used of 148 GiB) to slow device
osd.246 spilled over 25 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.247 spilled over 24 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.251 spilled over 106 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.252 spilled over 105 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.261 spilled over 20 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
osd.262 spilled over 22 GiB metadata from 'db'
device (28 GiB used of 148 GiB) to slow device
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
|
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com