Did you make progress on this? We have a ton of < 64K objects as well and are struggling to get good performance out of our RGW. Sometimes we have RGW instances that are just gobbling up CPU even when there are no requests to them, so it seems like things are getting hung up somewhere. There is nothing in the logs and I haven't had time to do more troubleshooting.
There's a bug in the current stable Nautilus release that causes a loop and/or crash in get_obj_data::flush (you should be able to see it gobbling up CPU in perf top). This is the related issue: https://tracker.ceph.com/issues/39660 -- it should be fixed as soon as 14.2.5 is released (any day now, supposedly).
Hope this helps, Ed |
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com