The fundamental problem seems to be the same in each case, related to a
missing master_zone in the zonegroup. Like yours, our cluster has been
running for several years with few config changes, though in our case,
the 10.2.3 radosgw simply doesn't start at all, logging the following error:
2016-10-05 16:39:53.814677 7f3a1d085900 0 zonegroup default missing
zone for master_zone=
2016-10-05 16:39:53.819964 7f3a1d085900 -1 Couldn't init storage
provider (RADOS)
There seem to be several approaches to fixing it - I did find that link
you refer to, and also the "fix-zone" script from Yehuda referred to in:
http://lists.ceph.com/pipermail/ceph-users-ceph.com/2016-April/009189.html
then later this looks like a simpler solution to the same issue:
http://lists.ceph.com/pipermail/ceph-users-ceph.com/2016-July/011157.html
I am just moving slowly, as there is ~300TB in the object store which we
naturally don't want anything to happen to...
There was a good question in that first thread which I never saw an
answer to,
http://lists.ceph.com/pipermail/ceph-users-ceph.com/2016-April/009178.html
- namely can the hammer and jewel rados gateways co-exist for a short
time, or if correcting the master_zone will bring down the
still-functional hammer gateway. Or indeed whether all the gateways
should be stopped before updating any of them.
Graham
On 10/06/2016 06:47 PM, Andrei Mikhailovsky wrote:
Hi Graham,
Yeah, I am not sure why no one else is having the same issues. Anyway, had a chat on irc and got a link that helped me: https://www.mail-archive.com/ceph-users@xxxxxxxxxxxxxx/msg31764.html
I've followed what it said, even though the errors i got were different, but it helped me to start the service. I am yet to test if the rgw is functional and user clients can connect.
Hope that helps
andrei
----- Original Message -----
From: "Graham Allan" <gta@xxxxxxx>
To: "ceph-users" <ceph-users@xxxxxxxxxxxxxx>
Sent: Thursday, 6 October, 2016 20:04:38
Subject: Re: unable to start radosgw after upgrade from 10.2.2 to 10.2.3
That's interesting, as I am getting the exact same errors after
upgrading from Hammer 0.94.9 to Jewel 10.2.3 (on ubuntu 14.04).
I wondered if it was the issue referred to a few months ago here, but
I'm not so sure, since the error returned from radosgw-admin commands is
different:
http://lists.ceph.com/pipermail/ceph-users-ceph.com/2016-April/009171.html
I do have one radosgw which is still on 0.94.9 and still functions
normally - is it possible that this is preventing the config migration
alluded to in that thread? I'm reluctant to do anything to the
still-working 0.94.9 gateway until I can get the 10.2.3 gateways working!
Graham
On 10/05/2016 04:23 PM, Andrei Mikhailovsky wrote:
Hello everyone,
I've just updated my ceph to version 10.2.3 from 10.2.2 and I am no
longer able to start the radosgw service. When executing I get the
following error:
2016-10-05 22:14:10.735883 7f1852d26a00 0 ceph version 10.2.3
(ecc23778eb545d8dd55e2e4735b53cc93f92e65b), process radosgw, pid 2711
2016-10-05 22:14:10.765648 7f1852d26a00 0 pidfile_write: ignore empty
--pid-file
2016-10-05 22:14:11.287772 7f1852d26a00 0 zonegroup default missing
zone for master_zone=
2016-10-05 22:14:11.294141 7f1852d26a00 -1 Couldn't init storage
provider (RADOS)
I had no issues starting rados on 10.2.2 and all versions prior to that.
I am running ceph 10.2.3 on Ubuntu 16.04 LTS servers.
Could someone please help me with fixing the problem?
Thanks
Andrei
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
--
Graham Allan
Minnesota Supercomputing Institute - gta@xxxxxxx
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
--
Graham Allan
Minnesota Supercomputing Institute - gta@xxxxxxx
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com