Using the "manual" method of creating an OSD on RHEL 7.1 with Ceph 94.2 turns up an issue with the ondisk fsid of the journal device. From a quick web search I've found reference to this exact same issue from earlier this year. Is there a version of Ceph that works with RHEL 7.1??? [root@ceph0 ceph]# ceph-disk-prepare --cluster ceph --cluster-uuid b2c2e866-ab61-4f80-b116-20fa2ea2ca94 --fs-type xfs /dev/sdc /dev/sdb1 WARNING:ceph-disk:OSD will not be hot-swappable if journal is not the same device as the osd data The operation has completed successfully. partx: /dev/sdc: error adding partition 1 meta-data=/dev/sdc1 isize=2048 agcount=4, agsize=244188597 blks = sectsz=512 attr=2, projid32bit=1 = crc=0 finobt=0 data = bsize=4096 blocks=976754385, imaxpct=5 = sunit=0 swidth=0 blks naming =version 2 bsize=4096 ascii-ci=0 ftype=0 log =internal log bsize=4096 blocks=476930, version=2 = sectsz=512 sunit=0 blks, lazy-count=1 realtime =none extsz=4096 blocks=0, rtextents=0 The operation has completed successfully. partx: /dev/sdc: error adding partition 1 [root@ceph0 ceph]# mkdir /var/lib/ceph/osd/ceph-0 [root@ceph0 ceph]# ll /var/lib/ceph/osd/ total 0 drwxr-xr-x. 2 root root 6 Jun 29 12:01 ceph-0 [root@ceph0 ceph]# mount -t xfs /dev/sdc1 /var/lib/ceph/osd/ceph-0/ [root@ceph0 ceph]# mount proc on /proc type proc (rw,nosuid,nodev,noexec,relatime) sysfs on /sys type sysfs (rw,nosuid,nodev,noexec,relatime,seclabel) devtmpfs on /dev type devtmpfs (rw,nosuid,seclabel,size=57648336k,nr_inodes=14412084,mode=755) securityfs on /sys/kernel/security type securityfs (rw,nosuid,nodev,noexec,relatime) tmpfs on /dev/shm type tmpfs (rw,nosuid,nodev,seclabel) devpts on /dev/pts type devpts (rw,nosuid,noexec,relatime,seclabel,gid=5,mode=620,ptmxmode=000) tmpfs on /run type tmpfs (rw,nosuid,nodev,seclabel,mode=755) tmpfs on /sys/fs/cgroup type tmpfs (rw,nosuid,nodev,noexec,seclabel,mode=755) cgroup on /sys/fs/cgroup/systemd type cgroup (rw,nosuid,nodev,noexec,relatime,xattr,release_agent=/usr/lib/systemd/systemd-cgroups-agent,name=systemd) pstore on /sys/fs/pstore type pstore (rw,nosuid,nodev,noexec,relatime) cgroup on /sys/fs/cgroup/cpuset type cgroup (rw,nosuid,nodev,noexec,relatime,cpuset) cgroup on /sys/fs/cgroup/cpu,cpuacct type cgroup (rw,nosuid,nodev,noexec,relatime,cpuacct,cpu) cgroup on /sys/fs/cgroup/memory type cgroup (rw,nosuid,nodev,noexec,relatime,memory) cgroup on /sys/fs/cgroup/devices type cgroup (rw,nosuid,nodev,noexec,relatime,devices) cgroup on /sys/fs/cgroup/freezer type cgroup (rw,nosuid,nodev,noexec,relatime,freezer) cgroup on /sys/fs/cgroup/net_cls type cgroup (rw,nosuid,nodev,noexec,relatime,net_cls) cgroup on /sys/fs/cgroup/blkio type cgroup (rw,nosuid,nodev,noexec,relatime,blkio) cgroup on /sys/fs/cgroup/perf_event type cgroup (rw,nosuid,nodev,noexec,relatime,perf_event) cgroup on /sys/fs/cgroup/hugetlb type cgroup (rw,nosuid,nodev,noexec,relatime,hugetlb) configfs on /sys/kernel/config type configfs (rw,relatime) /dev/mapper/rhel_ceph0-root on / type xfs (rw,relatime,seclabel,attr2,inode64,noquota) selinuxfs on /sys/fs/selinux type selinuxfs (rw,relatime) systemd-1 on /proc/sys/fs/binfmt_misc type autofs (rw,relatime,fd=35,pgrp=1,timeout=300,minproto=5,maxproto=5,direct) debugfs on /sys/kernel/debug type debugfs (rw,relatime) mqueue on /dev/mqueue type mqueue (rw,relatime,seclabel) hugetlbfs on /dev/hugepages type hugetlbfs (rw,relatime,seclabel) /dev/mapper/rhel_ceph0-home on /home type xfs (rw,relatime,seclabel,attr2,inode64,noquota) /dev/sda2 on /boot type xfs (rw,relatime,seclabel,attr2,inode64,noquota) binfmt_misc on /proc/sys/fs/binfmt_misc type binfmt_misc (rw,relatime) fusectl on /sys/fs/fuse/connections type fusectl (rw,relatime) /dev/sdc1 on /var/lib/ceph/osd/ceph-0 type xfs (rw,relatime,seclabel,attr2,inode64,noquota) [root@ceph0 ceph]# ceph-osd -i=0 --mkfs 2015-06-29 12:02:47.702808 7f2fb4625880 -1 journal FileJournal::_open: disabling aio for non-block journal. Use journal_force_aio to force use of aio anyway 2015-06-29 12:02:47.702851 7f2fb4625880 -1 journal check: ondisk fsid 00000000-0000-0000-0000-000000000000 doesn't match expected 7e792d5e-a5c6-40cd-a361-0457875ea92c, invalid (someone else's?) journal 2015-06-29 12:02:47.702876 7f2fb4625880 -1 filestore(/var/lib/ceph/osd/ceph-0) mkjournal error creating journal on /var/lib/ceph/osd/ceph-0/journal: (22) Invalid argument 2015-06-29 12:02:47.702890 7f2fb4625880 -1 OSD::mkfs: ObjectStore::mkfs failed with error -22 2015-06-29 12:02:47.702928 7f2fb4625880 -1 ** ERROR: error creating empty object store in /var/lib/ceph/osd/ceph-0: (22) Invalid argument [root@ceph0 ceph]# > -----Original Message----- > From: Bruce McFarland > Sent: Monday, June 29, 2015 11:39 AM > To: 'Loic Dachary'; ceph-users@xxxxxxxxxxxxxx > Subject: RE: RHEL 7.1 ceph-disk failures creating OSD with ver > 0.94.2 > > It doesn't appear to be related to using wwn's for the drive id. The verbose > output shows ceph converting from wwn to sd letter. I ran with verbose on > and used sd letters for the data drive and the journal and get the same > failures. I'm attempting to create OSD's manually now. > > [root@ceph0 ceph]# ceph-disk -v prepare --cluster ceph --cluster-uuid > b2c2e866-ab61-4f80-b116-20fa2ea2ca94 --fs-type xfs --zap-disk /dev/sdc > /dev/sdb1 DEBUG:ceph-disk:Zapping partition table on /dev/sdc INFO:ceph- > disk:Running command: /usr/sbin/sgdisk --zap-all -- /dev/sdc > Caution: invalid backup GPT header, but valid main header; regenerating > backup header from main header. > > Warning! Main and backup partition tables differ! Use the 'c' and 'e' options > on the recovery & transformation menu to examine the two tables. > > Warning! One or more CRCs don't match. You should repair the disk! > > ************************************************************** > ************** > Caution: Found protective or hybrid MBR and corrupt GPT. Using GPT, but > disk verification and recovery are STRONGLY recommended. > ************************************************************** > ************** > GPT data structures destroyed! You may now partition the disk using fdisk or > other utilities. > INFO:ceph-disk:Running command: /usr/sbin/sgdisk --clear --mbrtogpt -- > /dev/sdc Creating new GPT entries. > The operation has completed successfully. > INFO:ceph-disk:calling partx on zapped device /dev/sdc INFO:ceph-disk:re- > reading known partitions will display errors INFO:ceph-disk:Running > command: /usr/sbin/partx -d /dev/sdc > partx: specified range <1:0> does not make sense INFO:ceph-disk:Running > command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup > osd_mkfs_options_xfs INFO:ceph-disk:Running command: /usr/bin/ceph- > conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs > INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph -- > name=osd. --lookup osd_mount_options_xfs INFO:ceph-disk:Running > command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup > osd_fs_mount_options_xfs INFO:ceph-disk:Running command: > /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size > INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph -- > name=osd. --lookup osd_cryptsetup_parameters INFO:ceph-disk:Running > command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup > osd_dmcrypt_key_size INFO:ceph-disk:Running command: /usr/bin/ceph- > conf --cluster=ceph --name=osd. --lookup osd_dmcrypt_type DEBUG:ceph- > disk:Journal is file /dev/sdb1 WARNING:ceph-disk:OSD will not be hot- > swappable if journal is not the same device as the osd data DEBUG:ceph- > disk:Creating osd partition on /dev/sdc INFO:ceph-disk:Running command: > /usr/sbin/sgdisk --largest-new=1 --change-name=1:ceph data --partition- > guid=1:6d05612e-5cc0-422c-9228-4e53ee0f27ac --typecode=1:89c57f98- > 2fe5-4dc0-89c1-f3ad0ceff2be -- /dev/sdc The operation has completed > successfully. > INFO:ceph-disk:calling partx on created device /dev/sdc INFO:ceph-disk:re- > reading known partitions will display errors INFO:ceph-disk:Running > command: /usr/sbin/partx -a /dev/sdc > partx: /dev/sdc: error adding partition 1 INFO:ceph-disk:Running command: > /usr/bin/udevadm settle DEBUG:ceph-disk:Creating xfs fs on /dev/sdc1 > INFO:ceph-disk:Running command: /usr/sbin/mkfs -t xfs -f -i size=2048 -- > /dev/sdc1 > meta-data=/dev/sdc1 isize=2048 agcount=4, agsize=244188597 > blks > = sectsz=512 attr=2, projid32bit=1 > = crc=0 finobt=0 > data = bsize=4096 blocks=976754385, imaxpct=5 > = sunit=0 swidth=0 blks > naming =version 2 bsize=4096 ascii-ci=0 ftype=0 > log =internal log bsize=4096 blocks=476930, version=2 > = sectsz=512 sunit=0 blks, lazy-count=1 > realtime =none extsz=4096 blocks=0, rtextents=0 > DEBUG:ceph-disk:Mounting /dev/sdc1 on /var/lib/ceph/tmp/mnt.DQ8nOj > with options noatime,inode64 INFO:ceph-disk:Running command: > /usr/bin/mount -t xfs -o noatime,inode64 -- /dev/sdc1 > /var/lib/ceph/tmp/mnt.DQ8nOj DEBUG:ceph-disk:Preparing osd data dir > /var/lib/ceph/tmp/mnt.DQ8nOj DEBUG:ceph-disk:Creating symlink > /var/lib/ceph/tmp/mnt.DQ8nOj/journal -> /dev/sdb1 DEBUG:ceph- > disk:Unmounting /var/lib/ceph/tmp/mnt.DQ8nOj INFO:ceph-disk:Running > command: /bin/umount -- /var/lib/ceph/tmp/mnt.DQ8nOj INFO:ceph- > disk:Running command: /usr/sbin/sgdisk --typecode=1:4fbd7e29-9d25-41b8- > afd0-062c0ceff05d -- /dev/sdc The operation has completed successfully. > INFO:ceph-disk:calling partx on prepared device /dev/sdc INFO:ceph-disk:re- > reading known partitions will display errors INFO:ceph-disk:Running > command: /usr/sbin/partx -a /dev/sdc > partx: /dev/sdc: error adding partition 1 > [root@ceph0 ceph]# > > > -----Original Message----- > > From: Loic Dachary [mailto:loic@xxxxxxxxxxx] > > Sent: Saturday, June 27, 2015 1:08 AM > > To: Bruce McFarland; ceph-users@xxxxxxxxxxxxxx > > Subject: Re: RHEL 7.1 ceph-disk failures creating OSD > > > > Hi Bruce, > > > > I think the problem comes from using /dev/disk/by-id/wwn- > > 0x500003959bd02f56 instead of /dev/sdw for the data disk, because > > ceph- disk has a device name parsing logic that works with /dev/XXX. > > Could you run the ceph-disk prepare command again with --verbose to > > confirm ? If that's the case there should be an error instead of what > > appears to be something that only does part of the work. > > > > Cheers > > > > On 26/06/2015 18:56, Bruce McFarland wrote: > > > Loic, > > > Thank you very much for the partprobe workaround. I rebuilt the > > > cluster > > using 94.2. > > > > > > I've created partitions on the journal SSDs with parted and then use > > > ceph- > > disk prepare as below. I'm not seeing all of the disks with the tmp > > mounts when I check 'mount' but I also don't see any of the mount > > directory mount points at /var/lib/ceph/osd. I'm see the following output > from prepare. > > When I attempt to 'activate' it errors out saying the devices don't exist. > > > > > > ceph-disk prepare --cluster ceph --cluster-uuid > > > b2c2e866-ab61-4f80-b116-20fa2ea2ca94 --fs-type xfs --zap-disk > > > /dev/disk/by-id/wwn-0x500003959bd02f56 > > > /dev/disk/by-id/wwn-0x500080d91010024b-part1 > > > Caution: invalid backup GPT header, but valid main header; > > > regenerating backup header from main header. > > > > > > > > > ************************************************************** > > ******** > > > ****** > > > Caution: Found protective or hybrid MBR and corrupt GPT. Using GPT, > > > but disk verification and recovery are STRONGLY recommended. > > > > > > ************************************************************** > > ******** > > > ****** GPT data structures destroyed! You may now partition the disk > > > using fdisk or other utilities. > > > Creating new GPT entries. > > > The operation has completed successfully. > > > partx: specified range <1:0> does not make sense WARNING:ceph- > > disk:OSD > > > will not be hot-swappable if journal is not the same device as the > > > osd data WARNING:ceph-disk:Journal /dev/disk/by-id/wwn- > > 0x500080d91010024b-part1 was not prepared with ceph-disk. Symlinking > > directly. > > > The operation has completed successfully. > > > partx: /dev/disk/by-id/wwn-0x500003959bd02f56: error adding partition > 1 > > > meta-data=/dev/sdw1 isize=2048 agcount=4, > agsize=244188597 > > blks > > > = sectsz=512 attr=2, projid32bit=1 > > > = crc=0 finobt=0 > > > data = bsize=4096 blocks=976754385, imaxpct=5 > > > = sunit=0 swidth=0 blks > > > naming =version 2 bsize=4096 ascii-ci=0 ftype=0 > > > log =internal log bsize=4096 blocks=476930, version=2 > > > = sectsz=512 sunit=0 blks, lazy-count=1 > > > realtime =none extsz=4096 blocks=0, rtextents=0 > > > The operation has completed successfully. > > > partx: /dev/disk/by-id/wwn-0x500003959bd02f56: error adding > > > partition > > > 1 > > > > > > > > > [root@ceph0 ceph]# ceph -v > > > ceph version 0.94.2 (5fb85614ca8f354284c713a2f9c610860720bbf3) > > > [root@ceph0 ceph]# rpm -qa | grep ceph > > > ceph-radosgw-0.94.2-0.el7.x86_64 > > > libcephfs1-0.94.2-0.el7.x86_64 > > > ceph-common-0.94.2-0.el7.x86_64 > > > python-cephfs-0.94.2-0.el7.x86_64 > > > ceph-0.94.2-0.el7.x86_64 > > > [root@ceph0 ceph]# > > > > > > > > > > > >> -----Original Message----- > > >> From: Loic Dachary [mailto:loic@xxxxxxxxxxx] > > >> Sent: Friday, June 26, 2015 3:29 PM > > >> To: Bruce McFarland; ceph-users@xxxxxxxxxxxxxx > > >> Subject: Re: RHEL 7.1 ceph-disk failures creating OSD > > >> > > >> Hi, > > >> > > >> Prior to firefly v0.80.8 ceph-disk zap did not call partprobe and > > >> that was causing the kind of problems you're experiencing. It was > > >> fixed by > > >> > > > https://github.com/ceph/ceph/commit/e70a81464b906b9a304c29f474e672 > > >> 6762b63a7c and is described in more details at > > >> http://tracker.ceph.com/issues/9665. Rebooting the machine ensures > > >> the partition table is up to date and that's what you probably want > > >> to do after that kind of failure. You can however avoid the failure > > >> by > > running: > > >> > > >> * ceph-disk zap > > >> * partproble > > >> * ceph-disk prepare > > >> > > >> Cheers > > >> > > >> P.S. The "partx: /dev/disk/by-id/wwn-0x500003959ba80a4e: error > > >> adding partition 1" can be ignored, it does not actually matter. A > > >> message was added later to avoid confusion with a real error. > > >> . > > >> On 26/06/2015 17:09, Bruce McFarland wrote: > > >>> I have moved storage nodes to RHEL 7.1 and used the basic server > > >>> install. I > > >> installed ceph-deploy and used the ceph.repo/epel.repo for > > >> installation of ceph 80.7. I have tried ceph-disk with issuing "zap" > > >> on the same command line as "prepare" and on a separate command > > line > > >> immediately before the ceph-disk prepare. I consistently run into > > >> the partition errors and am unable to create OSD's on RHEL 7.1. > > >>> > > >>> > > >>> > > >>> ceph-disk prepare --cluster ceph --cluster-uuid > > >>> 373a09f7-2070-4d20-8504- > > >> c8653fb6db80 --fs-type xfs --zap-disk /dev/disk/by-id/wwn- > > >> 0x500003959ba80a4e /dev/disk/by-id/wwn-0x500080d9101001d6- > part1 > > >>> > > >>> Caution: invalid backup GPT header, but valid main header; > > >>> regenerating > > >>> > > >>> backup header from main header. > > >>> > > >>> > > >>> > > >>> > > >> > > > ************************************************************** > > >> ************** > > >>> > > >>> Caution: Found protective or hybrid MBR and corrupt GPT. Using > > >>> GPT, but > > >> disk > > >>> > > >>> verification and recovery are STRONGLY recommended. > > >>> > > >>> > > >> > > > ************************************************************** > > >> ************** > > >>> > > >>> GPT data structures destroyed! You may now partition the disk > > >>> using fdisk > > >> or > > >>> > > >>> other utilities. > > >>> > > >>> The operation has completed successfully. > > >>> > > >>> WARNING:ceph-disk:OSD will not be hot-swappable if journal is not > > >>> the > > >> same device as the osd data > > >>> > > >>> The operation has completed successfully. > > >>> > > >>> meta-data=/dev/sdc1 isize=2048 agcount=4, > > agsize=244188597 > > >> blks > > >>> > > >>> = sectsz=512 attr=2, projid32bit=1 > > >>> > > >>> = crc=0 finobt=0 > > >>> > > >>> data = bsize=4096 blocks=976754385, imaxpct=5 > > >>> > > >>> = sunit=0 swidth=0 blks > > >>> > > >>> naming =version 2 bsize=4096 ascii-ci=0 ftype=0 > > >>> > > >>> log =internal log bsize=4096 blocks=476930, version=2 > > >>> > > >>> = sectsz=512 sunit=0 blks, lazy-count=1 > > >>> > > >>> realtime =none extsz=4096 blocks=0, rtextents=0 > > >>> > > >>> The operation has completed successfully. > > >>> > > >>> partx: /dev/disk/by-id/wwn-0x500003959ba80a4e: error adding > > >>> partition 1 > > >>> > > >>> > > >>> > > >>> _______________________________________________ > > >>> ceph-users mailing list > > >>> ceph-users@xxxxxxxxxxxxxx > > >>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > > >>> > > >> > > >> -- > > >> Loïc Dachary, Artisan Logiciel Libre > > > > > > > -- > > Loïc Dachary, Artisan Logiciel Libre _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com