Re: Ceph Bucket strange issues rgw.none + id and marker diferent.

Burkhard Linke <Burkhard.Linke@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> · Wed, 8 May 2019 09:39:51 +0200

Hi,

just a comment (and please correct my if I'm wrong)....

There are no "folders" in S3. A bucket is a plain list of objects. What 
you recognize as a folder is an artificial construct, e.g. usual path 
delimiter used by S3 access tool to create "folders".

As a result, listing a bucket with 6 million objects in 875 "folders" 
does require listing all 6 million objects. You can validate this by 
looking at the requests send to the RGW (for example using 's3cmd -d la'):

...

DEBUG: Sending request method_string='GET', uri='/?delimiter=/', 
headers={'x-amz-content-sha256': 'XXXXXXX', 'Authorization': 
'AWS4-HMAC-SHA256 
Credential=XXXXXXX/US/s3/aws4_request,SignedHeaders=host;x-amz-content-sha256;x-amz-date,Signature=XXXXXX', 
'x-amz-date': '20190508T073339Z'}, body=(0 bytes)

....

And compare the request URL to the S3 API spec:

https://docs.aws.amazon.com/AmazonS3/latest/API/v2-RESTBucketGET.html

'delimiter=/' is just a convenience parameter for grouping the results. 
The implementation still has to enumerate all objects.

Regards,

Burkhard

--
Dr. rer. nat. Burkhard Linke
Bioinformatics and Systems Biology
Justus-Liebig-University Giessen
35392 Giessen, Germany
Phone: (+49) (0)641 9935810

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com