Assuming you deployed within the last 48 hours, I’m going to bet you are using v10.2.4 which has an issue that causes high cpu utilization.
Should see large ramp up in loadav after 15 minutes exactly.
Reed
Hello,
I am testing out a new node setup for us and I have configured a node in a single node cluster. It has 24 OSDs. Everything looked okay during the initial build and I was able to run the 'rados bench' on it just fine. However, if I just let the cluster sit and run for a few minutes without anything happening, the load starts to go up quickly. Each OSD device ends up using 130% CPU, with the load on the box hitting 550.00. No operations are going on, nothing shows up in the logs as happening or wrong. If I restart the OSD processes, the load stays down for a few minutes(almost at nothing) and then just jumps back up again.
Any idea what could cause this or a direction I can look to check it?
Have a good day,
Lewis George
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxxhttp://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
|
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com