Re: problem readding an osd

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



New post bellow…

 

From: Greg [mailto:itooo@xxxxxxxxx]
Sent: Monday, May 06, 2013 2:31 PM
To: Glen Aidukas
Subject: Re: problem readding an osd

 

Le 06/05/2013 20:05, Glen Aidukas a écrit :

Greg,

 

Not sure where to use the –d switch.  I tried the following:

 

                Service ceph start –d

                Service ceph –d start

 

Both do not work.

 

I did see an error in my log though…

 

2013-05-06 13:03:38.432479 7f0007ef2780 -1 filestore(/srv/ceph/osd/osd.2) limited size xattrs -- filestore_xattr_use_omap enabled

2013-05-06 13:03:38.438563 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount FIEMAP ioctl is supported and appears to work

2013-05-06 13:03:38.438591 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount FIEMAP ioctl is disabled via 'filestore fiemap' config option

2013-05-06 13:03:38.438804 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount did NOT detect btrfs

2013-05-06 13:03:38.484841 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount syncfs(2) syscall fully supported (by glibc and kernel)

2013-05-06 13:03:38.485010 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount found snaps <>

2013-05-06 13:03:38.488631 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount: enabling WRITEAHEAD journal mode: btrfs not detected

2013-05-06 13:03:38.488936 7f0007ef2780  1 journal _open /srv/ceph/osd/osd.2/journal fd 19: 1048576000 bytes, block size 4096 bytes, directio = 1, aio = 0

2013-05-06 13:03:38.489095 7f0007ef2780  1 journal _open /srv/ceph/osd/osd.2/journal fd 19: 1048576000 bytes, block size 4096 bytes, directio = 1, aio = 0

2013-05-06 13:03:38.490116 7f0007ef2780  1 journal close /srv/ceph/osd/osd.2/journal

2013-05-06 13:03:38.538302 7f0007ef2780 -1 filestore(/srv/ceph/osd/osd.2) limited size xattrs -- filestore_xattr_use_omap enabled

2013-05-06 13:03:38.559813 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount FIEMAP ioctl is supported and appears to work

2013-05-06 13:03:38.559848 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount FIEMAP ioctl is disabled via 'filestore fiemap' config option

2013-05-06 13:03:38.560082 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount did NOT detect btrfs

2013-05-06 13:03:38.566015 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount syncfs(2) syscall fully supported (by glibc and kernel)

2013-05-06 13:03:38.566106 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount found snaps <>

2013-05-06 13:03:38.569047 7f0007ef2780  0 filestore(/srv/ceph/osd/osd.2) mount: enabling WRITEAHEAD journal mode: btrfs not detected

2013-05-06 13:03:38.569237 7f0007ef2780  1 journal _open /srv/ceph/osd/osd.2/journal fd 27: 1048576000 bytes, block size 4096 bytes, directio = 1, aio = 0

2013-05-06 13:03:38.569316 7f0007ef2780  1 journal _open /srv/ceph/osd/osd.2/journal fd 27: 1048576000 bytes, block size 4096 bytes, directio = 1, aio = 0

2013-05-06 13:03:38.574317 7f0007ef2780  1 journal close /srv/ceph/osd/osd.2/journal

2013-05-06 13:03:38.574801 7f0007ef2780 -1  ** ERROR: osd init failed: (1) Operation not permitted

 

 

Glen Aidukas  [Manager IT Infrasctructure]


From: ceph-users-bounces@xxxxxxxxxxxxxx [mailto:ceph-users-bounces@xxxxxxxxxxxxxx] On Behalf Of Greg
Sent: Monday, May 06, 2013 1:47 PM
To: ceph-users@xxxxxxxxxxxxxx
Subject: Re: problem readding an osd

 

Le 06/05/2013 19:23, Glen Aidukas a écrit :

Hello,

 

I think this is a newbe question but I tested everything and, yes I FTFM as best I could.

 

I’m evaluating ceph and so I setup a cluster of 4 nodes.  The nodes are KVM virtual machines named ceph01 to ceph04 all running Ubuntu 12.04.2 LTS each with a single osd named osd.1 though osd.4 respective to the host they were running on.  Each host also has a 1TB disk for ceph to use ‘/dev/vdb1’.

 

After some work I was able to get the cluster up and running and even mounted it on a test client host (named ceph00).  I ran into issues when I was testing a failure.  I shut off ceph02 and watched via (ceph –w) it recover and move the data around.  At this point all is fine.

 

When I turned the host back on, it did not auto reconnect.  I expected this.  I then send through many attempts to re add it but all failed.

 

Here is an output from:  ceph osd tree

 

# id    weight  type name       up/down reweight

-1      4       root default

-3      4               rack unknownrack

-2      1                       host ceph01

1       1                               osd.1   up      1

-4      1                       host ceph02

2       1                               osd.2   down    0

-5      1                       host ceph03

3       1                               osd.3   up      1

-6      1                       host ceph04

4       1                               osd.4   up      1

-7      0               rack unkownrack

 

ceph -s

   health HEALTH_WARN 208 pgs peering; 208 pgs stuck inactive; 208 pgs stuck unclean; 1/4 in osds are down

   monmap e1: 1 mons at {a=10.30.20.81:6789/0}, election epoch 1, quorum 0 a

   osdmap e172: 4 osds: 3 up, 4 in

    pgmap v1970: 960 pgs: 752 active+clean, 208 peering; 5917 MB data, 61702 MB used, 2854 GB / 3068 GB avail

   mdsmap e39: 1/1/1 up {0=a=up:active}

 

While I’m able to get it to be in the ‘in’ state, I cant seem to bring it up.

 

Any ideas on how to fix this?

 

Glen,

try to bring up your OSD daemon with -d switch, this will probably give you some information. (alternatively look in the logs)

Cheers,

Glen,

1/ please reply to the list so everyone can benefit
2/ please quote downwards as it is easier to read
3/ did "testing everything" not include reading the logs ?
4/ now know you have an error, you can dig into your problem

Regards,

 

Greg,

 

1)      I hit reply and didn’t realize it was going directly to you until after I hit send. I will make sure I send back to the mail list for future posts.

 

2)      Got it!

 

3)      I did see the logs at one point but forgot to mention it in the original post.  My bad… L

 

4)      I did dig into this error but was not able to determine the cause.  Like I said, I did a lot of research before posting. 

 

I’m sure I must be stuck and looking in the wrong direction.  This is why I came here to have my head pointed in the correct location.  J

 

Regards,

 

-Glen

 

 

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux