Re: Ceph capacity versus pool replicated size discrepancy?

Konstantin Shalygin <k0ste@xxxxxxxx> · Wed, 14 Aug 2019 10:05:36 +0700

        Hey guys, this is probably a really silly question, but I’m trying to reconcile where all of my space has gone in one cluster that I am responsible for.

The cluster is made up of 36 2TB SSDs across 3 nodes (12 OSDs per node), all using FileStore on XFS.  We are running Ceph Luminous 12.2.8 on this particular cluster. The only pool where data is heavily stored is the “rbd” pool, of which 7.09TiB is consumed.  With a replication of “3”, I would expect that the raw used to be close to 21TiB, but it’s actually closer to 35TiB.  Some additional details are below.  Any thoughts?

[cluster] root at dashboard:~# ceph df
GLOBAL:
    SIZE        AVAIL       RAW USED     %RAW USED
    62.8TiB     27.8TiB      35.1TiB         55.81
POOLS:
    NAME                           ID     USED        %USED     MAX AVAIL     OBJECTS
    rbd                            0      7.09TiB     53.76       6.10TiB     3056783
    data                           3      29.4GiB      0.47       6.10TiB        7918
    metadata                       4      57.2MiB         0       6.10TiB          95
    .rgw.root                      5      1.09KiB         0       6.10TiB           4
    default.rgw.control            6           0B         0       6.10TiB           8
    default.rgw.meta               7           0B         0       6.10TiB           0
    default.rgw.log                8           0B         0       6.10TiB         207
    default.rgw.buckets.index      9           0B         0       6.10TiB           0
    default.rgw.buckets.data       10          0B         0       6.10TiB           0
    default.rgw.buckets.non-ec     11          0B         0       6.10TiB           0

[cluster] root at dashboard:~# ceph --version
ceph version 12.2.8 (ae699615bac534ea496ee965ac6192cb7e0e07c0) luminous (stable)

[cluster] root at dashboard:~# ceph osd dump | grep 'replicated size'
pool 0 'rbd' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 682 pgp_num 682 last_change 414873 flags hashpspool min_write_recency_for_promote 1 stripe_width 0 application rbd
pool 3 'data' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 682 pgp_num 682 last_change 409614 flags hashpspool crash_replay_interval 45 min_write_recency_for_promote 1 stripe_width 0 application cephfs
pool 4 'metadata' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 682 pgp_num 682 last_change 409617 flags hashpspool min_write_recency_for_promote 1 stripe_width 0 application cephfs
pool 5 '.rgw.root' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 409 pgp_num 409 last_change 409710 lfor 0/336229 flags hashpspool stripe_width 0 application rgw
pool 6 'default.rgw.control' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 409 pgp_num 409 last_change 409711 lfor 0/336232 flags hashpspool stripe_width 0 application rgw
pool 7 'default.rgw.meta' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 409 pgp_num 409 last_change 409713 lfor 0/336235 flags hashpspool stripe_width 0 application rgw
pool 8 'default.rgw.log' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 409 pgp_num 409 last_change 409712 lfor 0/336238 flags hashpspool stripe_width 0 application rgw
pool 9 'default.rgw.buckets.index' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 409 pgp_num 409 last_change 409714 lfor 0/336241 flags hashpspool stripe_width 0 application rgw
pool 10 'default.rgw.buckets.data' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 409 pgp_num 409 last_change 409715 lfor 0/336244 flags hashpspool stripe_width 0 application rgw
pool 11 'default.rgw.buckets.non-ec' replicated size 3 min_size 1 crush_rule 0 object_hash rjenkins pg_num 409 pgp_num 409 last_change 409716 lfor 0/336247 flags hashpspool stripe_width 0 application rgw

[cluster] root at dashboard:~# ceph osd lspools
0 rbd,3 data,4 metadata,5 .rgw.root,6 default.rgw.control,7 default.rgw.meta,8 default.rgw.log,9 default.rgw.buckets.index,10 default.rgw.buckets.data,11 default.rgw.buckets.non-ec,

[cluster] root at dashboard:~# rados df
POOL_NAME                  USED    OBJECTS CLONES  COPIES  MISSING_ON_PRIMARY UNFOUND DEGRADED RD_OPS      RD      WR_OPS      WR
.rgw.root                  1.09KiB       4       0      12                  0       0        0          12    8KiB           0      0B
data                       29.4GiB    7918       0   23754                  0       0        0     1414777 3.74TiB     3524833 4.54TiB
default.rgw.buckets.data        0B       0       0       0                  0       0        0           0      0B           0      0B
default.rgw.buckets.index       0B       0       0       0                  0       0        0           0      0B           0      0B
default.rgw.buckets.non-ec      0B       0       0       0                  0       0        0           0      0B           0      0B
default.rgw.control             0B       8       0      24                  0       0        0           0      0B           0      0B
default.rgw.log                 0B     207       0     621                  0       0        0    21644149 20.6GiB    14422618      0B
default.rgw.meta                0B       0       0       0                  0       0        0           0      0B           0      0B
metadata                   57.2MiB      95       0     285                  0       0        0         780  189MiB       86885  476MiB
rbd                        7.09TiB 3053998 1539909 9161994                  0       0        0 23432304830 1.07PiB 11174458128  232TiB

total_objects    3062230
total_used       35.0TiB
total_avail      27.8TiB
total_space      62.8TiB

[cluster] root at dashboard:~# for pool in `rados lspools`; do echo $pool; ceph osd pool get $pool size; echo; done
rbd
size: 3
data
size: 3
metadata
size: 3
.rgw.root
size: 3
default.rgw.control
size: 3
default.rgw.meta
size: 3
default.rgw.log
size: 3
default.rgw.buckets.index
size: 3
default.rgw.buckets.data
size: 3
default.rgw.buckets.non-ec
size: 3

    Your rbd pool have clones. Lookup to rbd
        snapshots.

    k

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com