Size of RBD images

nicolasc <nicolas.canceill@xxxxxxxxxxx> · Tue, 19 Nov 2013 15:20:14 +0100

Hi every one,

In the course of playing with RBD, I noticed a few things:

* The RBD images are so thin-provisioned, you can create arbitrarily 
large ones.
  On my 0.72.1 freshly-installed empty 200TB cluster, I was able to 
create a 1PB image:

  $ rbd create --image-format 2 --size 1073741824 test_img

  This command is successful, and I can check the image status:

  $ rbd info test
  rbd image 'test':
    size 1024 TB in 268435456 objects
    order 22 (4096 kB objects)
    block_name_prefix: rbd_data.19f76b8b4567
    format: 2
    features: layering

* Such an oversized image seems unmountable on my 3.2.46 kernel.
  However the error message is not very explicit:

  $ rbd map test_img
  rbd: add failed: (2) No such file or directory

  There is no error or explanation to be seen anywhere in the logs.
  dmesg reports the connection to the cluster through RBD as usual, and 
that's it.
  Using the exact same commands with image size 32GB will successfully 
map the device.

* Such an oversize image takes an awfully long time to shrink or remove.
  However, the image has just been created and is empty.
  In RADOS, I only see the corresponding rbd_id and rbd_header, but no 
data object at all.
  Still, removing the 1PB image takes roughly 8 hours.

Cluster config:
3 mons, 8nodes * 72osds, about 4800pgs (2400pgs in pool "rbd")
cluster and public network are 10GbE, each node has 8 cores and 64GB mem

Oh, so my questions:
- why is it possible to create an image five times the size of the 
cluster without warning?
- where could this "No such file" error come from?
- why does it take long to shrink/delete a 
large-but-empty-and-thin-provisioned image?

I know that 1PB is oversized ("No such file" when trying to map), and 
32GB is not, so I am currently looking for the oversize threshold. More 
info coming soon.

Best regards,

Nicolas Canceill
Scalable Storage Systems
SURFsara (Amsterdam, NL)

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com