Re: Broken bucket on upgrade

Gregory Farnum <greg@xxxxxxxxxxx> · Wed, 19 Mar 2014 11:55:32 -0700



Exactly what errors did you see, from which log? In general the OSD
does suicide on filesystem errors.
-Greg
Software Engineer #42 @ http://inktank.com | http://ceph.com


On Wed, Mar 19, 2014 at 4:06 AM, Mike Bryant <mike@xxxxxxxxxxxxxxxx> wrote:
> So I've done some more digging, and running the radosgw in debug mode I
> found some messages from osd.3 saying IOError, when it was trying to get
> .rgw:productimages.
> I took that OSD down, and everything started working.
>
> My question now is, why didn't that OSD suicide when it hit an IOError,
> instead of causing the cluster to stop working?
>
>
> On 19 March 2014 10:07, Mike Bryant <mike@xxxxxxxxxxxxxxxx> wrote:
>>
>> Hi,
>> I've just upgraded a test cluster to Emporer, and one of my S3 buckets
>> seems to have broken.
>>
>> s3 access is returning a 500 code (UnknownError).
>>
>> Running bucket stats, it's missing from the list.
>> Trying to do it explicitly:
>>
>> radosgw-admin bucket stats --bucket=productimages
>> 2014-03-19 10:06:17.829397 7ff0b81c7780  0 could not get bucket info for
>> bucket=productimages
>>
>> I can see the header object in the .rgw pool:
>> rados --cluster=cit-external ls --pool .rgw
>> .pools.avail
>> productimages
>> test
>> tests3fs
>>
>> Does anyone have any idea on what might have happened, or how I can get
>> this bucket back?
>>
>> Cheers
>> Mike
>
>
>
> _______________________________________________
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com