Re: RHEL 7.1 ceph-disk failures creating OSD with ver 0.94.2

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Using the "manual" method of creating an OSD on RHEL 7.1 with Ceph 94.2 turns up an issue with the ondisk fsid of the journal device. From a quick web search I've found reference to this exact same issue from earlier this year. Is there a version of Ceph that works with RHEL 7.1??? 

[root@ceph0 ceph]# ceph-disk-prepare --cluster ceph --cluster-uuid b2c2e866-ab61-4f80-b116-20fa2ea2ca94 --fs-type xfs /dev/sdc /dev/sdb1
WARNING:ceph-disk:OSD will not be hot-swappable if journal is not the same device as the osd data
The operation has completed successfully.
partx: /dev/sdc: error adding partition 1
meta-data=/dev/sdc1              isize=2048   agcount=4, agsize=244188597 blks
         =                       sectsz=512   attr=2, projid32bit=1
         =                       crc=0        finobt=0
data     =                       bsize=4096   blocks=976754385, imaxpct=5
         =                       sunit=0      swidth=0 blks
naming   =version 2              bsize=4096   ascii-ci=0 ftype=0
log      =internal log           bsize=4096   blocks=476930, version=2
         =                       sectsz=512   sunit=0 blks, lazy-count=1
realtime =none                   extsz=4096   blocks=0, rtextents=0
The operation has completed successfully.
partx: /dev/sdc: error adding partition 1
[root@ceph0 ceph]# mkdir /var/lib/ceph/osd/ceph-0
[root@ceph0 ceph]# ll /var/lib/ceph/osd/
total 0
drwxr-xr-x. 2 root root 6 Jun 29 12:01 ceph-0
[root@ceph0 ceph]# mount -t xfs /dev/sdc1 /var/lib/ceph/osd/ceph-0/
[root@ceph0 ceph]# mount
proc on /proc type proc (rw,nosuid,nodev,noexec,relatime)
sysfs on /sys type sysfs (rw,nosuid,nodev,noexec,relatime,seclabel)
devtmpfs on /dev type devtmpfs (rw,nosuid,seclabel,size=57648336k,nr_inodes=14412084,mode=755)
securityfs on /sys/kernel/security type securityfs (rw,nosuid,nodev,noexec,relatime)
tmpfs on /dev/shm type tmpfs (rw,nosuid,nodev,seclabel)
devpts on /dev/pts type devpts (rw,nosuid,noexec,relatime,seclabel,gid=5,mode=620,ptmxmode=000)
tmpfs on /run type tmpfs (rw,nosuid,nodev,seclabel,mode=755)
tmpfs on /sys/fs/cgroup type tmpfs (rw,nosuid,nodev,noexec,seclabel,mode=755)
cgroup on /sys/fs/cgroup/systemd type cgroup (rw,nosuid,nodev,noexec,relatime,xattr,release_agent=/usr/lib/systemd/systemd-cgroups-agent,name=systemd)
pstore on /sys/fs/pstore type pstore (rw,nosuid,nodev,noexec,relatime)
cgroup on /sys/fs/cgroup/cpuset type cgroup (rw,nosuid,nodev,noexec,relatime,cpuset)
cgroup on /sys/fs/cgroup/cpu,cpuacct type cgroup (rw,nosuid,nodev,noexec,relatime,cpuacct,cpu)
cgroup on /sys/fs/cgroup/memory type cgroup (rw,nosuid,nodev,noexec,relatime,memory)
cgroup on /sys/fs/cgroup/devices type cgroup (rw,nosuid,nodev,noexec,relatime,devices)
cgroup on /sys/fs/cgroup/freezer type cgroup (rw,nosuid,nodev,noexec,relatime,freezer)
cgroup on /sys/fs/cgroup/net_cls type cgroup (rw,nosuid,nodev,noexec,relatime,net_cls)
cgroup on /sys/fs/cgroup/blkio type cgroup (rw,nosuid,nodev,noexec,relatime,blkio)
cgroup on /sys/fs/cgroup/perf_event type cgroup (rw,nosuid,nodev,noexec,relatime,perf_event)
cgroup on /sys/fs/cgroup/hugetlb type cgroup (rw,nosuid,nodev,noexec,relatime,hugetlb)
configfs on /sys/kernel/config type configfs (rw,relatime)
/dev/mapper/rhel_ceph0-root on / type xfs (rw,relatime,seclabel,attr2,inode64,noquota)
selinuxfs on /sys/fs/selinux type selinuxfs (rw,relatime)
systemd-1 on /proc/sys/fs/binfmt_misc type autofs (rw,relatime,fd=35,pgrp=1,timeout=300,minproto=5,maxproto=5,direct)
debugfs on /sys/kernel/debug type debugfs (rw,relatime)
mqueue on /dev/mqueue type mqueue (rw,relatime,seclabel)
hugetlbfs on /dev/hugepages type hugetlbfs (rw,relatime,seclabel)
/dev/mapper/rhel_ceph0-home on /home type xfs (rw,relatime,seclabel,attr2,inode64,noquota)
/dev/sda2 on /boot type xfs (rw,relatime,seclabel,attr2,inode64,noquota)
binfmt_misc on /proc/sys/fs/binfmt_misc type binfmt_misc (rw,relatime)
fusectl on /sys/fs/fuse/connections type fusectl (rw,relatime)
/dev/sdc1 on /var/lib/ceph/osd/ceph-0 type xfs (rw,relatime,seclabel,attr2,inode64,noquota)
[root@ceph0 ceph]# ceph-osd -i=0 --mkfs 
2015-06-29 12:02:47.702808 7f2fb4625880 -1 journal FileJournal::_open: disabling aio for non-block journal.  Use journal_force_aio to force use of aio anyway
2015-06-29 12:02:47.702851 7f2fb4625880 -1 journal check: ondisk fsid 00000000-0000-0000-0000-000000000000 doesn't match expected 7e792d5e-a5c6-40cd-a361-0457875ea92c, invalid (someone else's?) journal
2015-06-29 12:02:47.702876 7f2fb4625880 -1 filestore(/var/lib/ceph/osd/ceph-0) mkjournal error creating journal on /var/lib/ceph/osd/ceph-0/journal: (22) Invalid argument
2015-06-29 12:02:47.702890 7f2fb4625880 -1 OSD::mkfs: ObjectStore::mkfs failed with error -22
2015-06-29 12:02:47.702928 7f2fb4625880 -1  ** ERROR: error creating empty object store in /var/lib/ceph/osd/ceph-0: (22) Invalid argument
[root@ceph0 ceph]#

> -----Original Message-----
> From: Bruce McFarland
> Sent: Monday, June 29, 2015 11:39 AM
> To: 'Loic Dachary'; ceph-users@xxxxxxxxxxxxxx
> Subject: RE:  RHEL 7.1 ceph-disk failures creating OSD with ver
> 0.94.2
> 
> It doesn't appear to be related to using wwn's for the drive id. The verbose
> output shows ceph converting from wwn to sd letter. I ran with verbose on
> and used sd letters for the data drive and the journal and get the same
> failures. I'm attempting to create OSD's manually now.
> 
> [root@ceph0 ceph]# ceph-disk -v prepare --cluster ceph --cluster-uuid
> b2c2e866-ab61-4f80-b116-20fa2ea2ca94 --fs-type xfs --zap-disk /dev/sdc
> /dev/sdb1 DEBUG:ceph-disk:Zapping partition table on /dev/sdc INFO:ceph-
> disk:Running command: /usr/sbin/sgdisk --zap-all -- /dev/sdc
> Caution: invalid backup GPT header, but valid main header; regenerating
> backup header from main header.
> 
> Warning! Main and backup partition tables differ! Use the 'c' and 'e' options
> on the recovery & transformation menu to examine the two tables.
> 
> Warning! One or more CRCs don't match. You should repair the disk!
> 
> **************************************************************
> **************
> Caution: Found protective or hybrid MBR and corrupt GPT. Using GPT, but
> disk verification and recovery are STRONGLY recommended.
> **************************************************************
> **************
> GPT data structures destroyed! You may now partition the disk using fdisk or
> other utilities.
> INFO:ceph-disk:Running command: /usr/sbin/sgdisk --clear --mbrtogpt --
> /dev/sdc Creating new GPT entries.
> The operation has completed successfully.
> INFO:ceph-disk:calling partx on zapped device /dev/sdc INFO:ceph-disk:re-
> reading known partitions will display errors INFO:ceph-disk:Running
> command: /usr/sbin/partx -d /dev/sdc
> partx: specified range <1:0> does not make sense INFO:ceph-disk:Running
> command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup
> osd_mkfs_options_xfs INFO:ceph-disk:Running command: /usr/bin/ceph-
> conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs
> INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --
> name=osd. --lookup osd_mount_options_xfs INFO:ceph-disk:Running
> command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup
> osd_fs_mount_options_xfs INFO:ceph-disk:Running command:
> /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size
> INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --
> name=osd. --lookup osd_cryptsetup_parameters INFO:ceph-disk:Running
> command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup
> osd_dmcrypt_key_size INFO:ceph-disk:Running command: /usr/bin/ceph-
> conf --cluster=ceph --name=osd. --lookup osd_dmcrypt_type DEBUG:ceph-
> disk:Journal is file /dev/sdb1 WARNING:ceph-disk:OSD will not be hot-
> swappable if journal is not the same device as the osd data DEBUG:ceph-
> disk:Creating osd partition on /dev/sdc INFO:ceph-disk:Running command:
> /usr/sbin/sgdisk --largest-new=1 --change-name=1:ceph data --partition-
> guid=1:6d05612e-5cc0-422c-9228-4e53ee0f27ac --typecode=1:89c57f98-
> 2fe5-4dc0-89c1-f3ad0ceff2be -- /dev/sdc The operation has completed
> successfully.
> INFO:ceph-disk:calling partx on created device /dev/sdc INFO:ceph-disk:re-
> reading known partitions will display errors INFO:ceph-disk:Running
> command: /usr/sbin/partx -a /dev/sdc
> partx: /dev/sdc: error adding partition 1 INFO:ceph-disk:Running command:
> /usr/bin/udevadm settle DEBUG:ceph-disk:Creating xfs fs on /dev/sdc1
> INFO:ceph-disk:Running command: /usr/sbin/mkfs -t xfs -f -i size=2048 --
> /dev/sdc1
> meta-data=/dev/sdc1              isize=2048   agcount=4, agsize=244188597
> blks
>          =                       sectsz=512   attr=2, projid32bit=1
>          =                       crc=0        finobt=0
> data     =                       bsize=4096   blocks=976754385, imaxpct=5
>          =                       sunit=0      swidth=0 blks
> naming   =version 2              bsize=4096   ascii-ci=0 ftype=0
> log      =internal log           bsize=4096   blocks=476930, version=2
>          =                       sectsz=512   sunit=0 blks, lazy-count=1
> realtime =none                   extsz=4096   blocks=0, rtextents=0
> DEBUG:ceph-disk:Mounting /dev/sdc1 on /var/lib/ceph/tmp/mnt.DQ8nOj
> with options noatime,inode64 INFO:ceph-disk:Running command:
> /usr/bin/mount -t xfs -o noatime,inode64 -- /dev/sdc1
> /var/lib/ceph/tmp/mnt.DQ8nOj DEBUG:ceph-disk:Preparing osd data dir
> /var/lib/ceph/tmp/mnt.DQ8nOj DEBUG:ceph-disk:Creating symlink
> /var/lib/ceph/tmp/mnt.DQ8nOj/journal -> /dev/sdb1 DEBUG:ceph-
> disk:Unmounting /var/lib/ceph/tmp/mnt.DQ8nOj INFO:ceph-disk:Running
> command: /bin/umount -- /var/lib/ceph/tmp/mnt.DQ8nOj INFO:ceph-
> disk:Running command: /usr/sbin/sgdisk --typecode=1:4fbd7e29-9d25-41b8-
> afd0-062c0ceff05d -- /dev/sdc The operation has completed successfully.
> INFO:ceph-disk:calling partx on prepared device /dev/sdc INFO:ceph-disk:re-
> reading known partitions will display errors INFO:ceph-disk:Running
> command: /usr/sbin/partx -a /dev/sdc
> partx: /dev/sdc: error adding partition 1
> [root@ceph0 ceph]#
> 
> > -----Original Message-----
> > From: Loic Dachary [mailto:loic@xxxxxxxxxxx]
> > Sent: Saturday, June 27, 2015 1:08 AM
> > To: Bruce McFarland; ceph-users@xxxxxxxxxxxxxx
> > Subject: Re:  RHEL 7.1 ceph-disk failures creating OSD
> >
> > Hi Bruce,
> >
> > I think the problem comes from using /dev/disk/by-id/wwn-
> > 0x500003959bd02f56 instead of /dev/sdw for the data disk, because
> > ceph- disk has a device name parsing logic that works with /dev/XXX.
> > Could you run the ceph-disk prepare command again with --verbose to
> > confirm ? If that's the case there should be an error instead of what
> > appears to be something that only does part of the work.
> >
> > Cheers
> >
> > On 26/06/2015 18:56, Bruce McFarland wrote:
> > > Loic,
> > > Thank you very much for the partprobe workaround. I rebuilt the
> > > cluster
> > using 94.2.
> > >
> > > I've created partitions on the journal SSDs with parted and then use
> > > ceph-
> > disk prepare as below. I'm not seeing all of the disks with the tmp
> > mounts when I check 'mount' but I also don't see any of the mount
> > directory mount points at /var/lib/ceph/osd. I'm see the following output
> from prepare.
> > When I attempt to 'activate' it errors out saying the devices don't exist.
> > >
> > > ceph-disk prepare --cluster ceph --cluster-uuid
> > > b2c2e866-ab61-4f80-b116-20fa2ea2ca94 --fs-type xfs --zap-disk
> > > /dev/disk/by-id/wwn-0x500003959bd02f56
> > > /dev/disk/by-id/wwn-0x500080d91010024b-part1
> > > Caution: invalid backup GPT header, but valid main header;
> > > regenerating backup header from main header.
> > >
> > >
> >
> **************************************************************
> > ********
> > > ******
> > > Caution: Found protective or hybrid MBR and corrupt GPT. Using GPT,
> > > but disk verification and recovery are STRONGLY recommended.
> > >
> >
> **************************************************************
> > ********
> > > ****** GPT data structures destroyed! You may now partition the disk
> > > using fdisk or other utilities.
> > > Creating new GPT entries.
> > > The operation has completed successfully.
> > > partx: specified range <1:0> does not make sense WARNING:ceph-
> > disk:OSD
> > > will not be hot-swappable if journal is not the same device as the
> > > osd data WARNING:ceph-disk:Journal /dev/disk/by-id/wwn-
> > 0x500080d91010024b-part1 was not prepared with ceph-disk. Symlinking
> > directly.
> > > The operation has completed successfully.
> > > partx: /dev/disk/by-id/wwn-0x500003959bd02f56: error adding partition
> 1
> > > meta-data=/dev/sdw1              isize=2048   agcount=4,
> agsize=244188597
> > blks
> > >          =                       sectsz=512   attr=2, projid32bit=1
> > >          =                       crc=0        finobt=0
> > > data     =                       bsize=4096   blocks=976754385, imaxpct=5
> > >          =                       sunit=0      swidth=0 blks
> > > naming   =version 2              bsize=4096   ascii-ci=0 ftype=0
> > > log      =internal log           bsize=4096   blocks=476930, version=2
> > >          =                       sectsz=512   sunit=0 blks, lazy-count=1
> > > realtime =none                   extsz=4096   blocks=0, rtextents=0
> > > The operation has completed successfully.
> > > partx: /dev/disk/by-id/wwn-0x500003959bd02f56: error adding
> > > partition
> > > 1
> > >
> > >
> > > [root@ceph0 ceph]# ceph -v
> > > ceph version 0.94.2 (5fb85614ca8f354284c713a2f9c610860720bbf3)
> > > [root@ceph0 ceph]# rpm -qa | grep ceph
> > > ceph-radosgw-0.94.2-0.el7.x86_64
> > > libcephfs1-0.94.2-0.el7.x86_64
> > > ceph-common-0.94.2-0.el7.x86_64
> > > python-cephfs-0.94.2-0.el7.x86_64
> > > ceph-0.94.2-0.el7.x86_64
> > > [root@ceph0 ceph]#
> > >
> > >
> > >
> > >> -----Original Message-----
> > >> From: Loic Dachary [mailto:loic@xxxxxxxxxxx]
> > >> Sent: Friday, June 26, 2015 3:29 PM
> > >> To: Bruce McFarland; ceph-users@xxxxxxxxxxxxxx
> > >> Subject: Re:  RHEL 7.1 ceph-disk failures creating OSD
> > >>
> > >> Hi,
> > >>
> > >> Prior to firefly v0.80.8 ceph-disk zap did not call partprobe and
> > >> that was causing the kind of problems you're experiencing. It was
> > >> fixed by
> > >>
> >
> https://github.com/ceph/ceph/commit/e70a81464b906b9a304c29f474e672
> > >> 6762b63a7c and is described in more details at
> > >> http://tracker.ceph.com/issues/9665. Rebooting the machine ensures
> > >> the partition table is up to date and that's what you probably want
> > >> to do after that kind of failure. You can however avoid the failure
> > >> by
> > running:
> > >>
> > >>  * ceph-disk zap
> > >>  * partproble
> > >>  * ceph-disk prepare
> > >>
> > >> Cheers
> > >>
> > >> P.S. The "partx: /dev/disk/by-id/wwn-0x500003959ba80a4e: error
> > >> adding partition 1" can be ignored, it does not actually matter. A
> > >> message was added later to avoid confusion with a real error.
> > >> .
> > >> On 26/06/2015 17:09, Bruce McFarland wrote:
> > >>> I have moved storage nodes to RHEL 7.1 and used the basic server
> > >>> install. I
> > >> installed ceph-deploy and used the ceph.repo/epel.repo for
> > >> installation of ceph 80.7. I have tried ceph-disk with issuing "zap"
> > >> on the same command line as "prepare" and on a separate command
> > line
> > >> immediately before the ceph-disk prepare. I consistently run into
> > >> the partition errors and am unable to create OSD's on RHEL 7.1.
> > >>>
> > >>>
> > >>>
> > >>> ceph-disk prepare --cluster ceph --cluster-uuid
> > >>> 373a09f7-2070-4d20-8504-
> > >> c8653fb6db80 --fs-type xfs --zap-disk /dev/disk/by-id/wwn-
> > >> 0x500003959ba80a4e /dev/disk/by-id/wwn-0x500080d9101001d6-
> part1
> > >>>
> > >>> Caution: invalid backup GPT header, but valid main header;
> > >>> regenerating
> > >>>
> > >>> backup header from main header.
> > >>>
> > >>>
> > >>>
> > >>>
> > >>
> >
> **************************************************************
> > >> **************
> > >>>
> > >>> Caution: Found protective or hybrid MBR and corrupt GPT. Using
> > >>> GPT, but
> > >> disk
> > >>>
> > >>> verification and recovery are STRONGLY recommended.
> > >>>
> > >>>
> > >>
> >
> **************************************************************
> > >> **************
> > >>>
> > >>> GPT data structures destroyed! You may now partition the disk
> > >>> using fdisk
> > >> or
> > >>>
> > >>> other utilities.
> > >>>
> > >>> The operation has completed successfully.
> > >>>
> > >>> WARNING:ceph-disk:OSD will not be hot-swappable if journal is not
> > >>> the
> > >> same device as the osd data
> > >>>
> > >>> The operation has completed successfully.
> > >>>
> > >>> meta-data=/dev/sdc1              isize=2048   agcount=4,
> > agsize=244188597
> > >> blks
> > >>>
> > >>>          =                       sectsz=512   attr=2, projid32bit=1
> > >>>
> > >>>          =                       crc=0        finobt=0
> > >>>
> > >>> data     =                       bsize=4096   blocks=976754385, imaxpct=5
> > >>>
> > >>>          =                       sunit=0      swidth=0 blks
> > >>>
> > >>> naming   =version 2              bsize=4096   ascii-ci=0 ftype=0
> > >>>
> > >>> log      =internal log           bsize=4096   blocks=476930, version=2
> > >>>
> > >>>          =                       sectsz=512   sunit=0 blks, lazy-count=1
> > >>>
> > >>> realtime =none                   extsz=4096   blocks=0, rtextents=0
> > >>>
> > >>> The operation has completed successfully.
> > >>>
> > >>> partx: /dev/disk/by-id/wwn-0x500003959ba80a4e: error adding
> > >>> partition 1
> > >>>
> > >>>
> > >>>
> > >>> _______________________________________________
> > >>> ceph-users mailing list
> > >>> ceph-users@xxxxxxxxxxxxxx
> > >>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
> > >>>
> > >>
> > >> --
> > >> Loïc Dachary, Artisan Logiciel Libre
> > >
> >
> > --
> > Loïc Dachary, Artisan Logiciel Libre

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux