ceph-dis prepare : UUID=00000000-0000-0000-0000-000000000000

SCHAER Frederic <frederic.schaer@xxxxxx> · Thu, 9 Oct 2014 10:24:13 +0000

Hi,

I am setting up a test ceph cluster, on decommissioned  hardware (hence : not optimal, I know).
I have installed CentOS7, installed and setup ceph mons and OSD machines using puppet, and now I’m trying to add OSDs with the servers OSD disks… and I have issues (of course ;) )
I used the Ceph RHEL7 RPMs (ceph-0.80.6-0.el7.x86_64)

When I run “ceph-disk prepare” for a disk, I most of the time (but not always) get the partitions created, but not activated :

[root@ceph4 ~]# ceph-disk list|grep sdh
WARNING:ceph-disk:Old blkid does not support ID_PART_ENTRY_* fields, trying sgdisk; may not correctly identify ceph volumes with dmcrypt
/dev/sdh :
/dev/sdh1 ceph data, prepared, cluster ceph, journal /dev/sdh2
/dev/sdh2 ceph journal, for /dev/sdh1

I tried to debug udev rules thinking they were not launched to activate the OSD, but they are, and they fail on this error :

+ ln -sf ../../sdh2 /dev/disk/by-partuuid/5b3bde8f-ccad-4093-a8a5-ad6413ae8931
+ mkdir -p /dev/disk/by-parttypeuuid
+ ln -sf ../../sdh2 /dev/disk/by-parttypeuuid/45b0969e-9b03-4f30-b4c6-b4b80ceff106.5b3bde8f-ccad-4093-a8a5-ad6413ae8931
+ case $ID_PART_ENTRY_TYPE in
+ /usr/sbin/ceph-disk -v activate-journal /dev/sdh2
INFO:ceph-disk:Running command: /usr/bin/ceph-osd -i 0 --get-journal-uuid --osd-journal /dev/sdh2
SG_IO: bad/missing sense data, sb[]:  70 00 05 00 00 00 00 0b 00 00 00 00 20 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
DEBUG:ceph-disk:Journal /dev/sdh2 has OSD UUID 00000000-0000-0000-0000-000000000000
INFO:ceph-disk:Running command: /sbin/blkid -p -s TYPE -ovalue -- /dev/disk/by-partuuid/00000000-0000-0000-0000-000000000000
error: /dev/disk/by-partuuid/00000000-0000-0000-0000-000000000000: No such file or directory
ceph-disk: Cannot discover filesystem type: device /dev/disk/by-partuuid/00000000-0000-0000-0000-000000000000: Command '/sbin/blkid' returned non-zero exit status 2
+ exit
+ exec

You’ll notice the zeroed UUID…
Because of this, I looked at the output of ceph-disk prepare, and saw that partx complains at the end (this is the partx –a command) :

Warning: The kernel is still using the old partition table.
The new table will be used at the next reboot.
The operation has completed successfully.
partx: /dev/sdh: error adding partitions 1-2

And indeed, running “partx –a /dev/sdh” does not change anything.
But I just discovered that running “partx –u /dev/sdh” will fix everything ….????
I.e : right after I send this update command to the kernel, my debug logs show that the udev rule does everything fine and the OSD starts up.

I’m therefore wondering what I did wrong ?
is this CentOS 7 that is misbehaving, or the kernel, or…?
Any reason why partx –a is used instead of partx –u ?

I’d be glad to hear others advice on this !
Thanks && regards

Frederic Schaer

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com