Hello, I’m working with two different Ceph clusters, and in both clusters, I’m seeing very high latency values. Here’s part of a sample perf dump: "recoverystate_perf": { "initial_latency": { "avgcount": 338, "sum": 0.069851000}, "started_latency": { "avgcount": 1647, "sum": 322317122.940019000}, "reset_latency": { "avgcount": 1985, "sum": 195.935076000}, "start_latency": { "avgcount": 1985, "sum": 0.234355000}, "primary_latency": { "avgcount": 266, "sum": 10819570.688122000}, You can see both started latency and primary latency have extremely high values.
Some info about the cluster: All nodes are on the same subnet - 2 VMs, 1 physical node VM1 is just a Monitor, VM2 is Monitor and OSD, Physical node is just an OSD. One additional question, are these latency values in milliseconds? Is there any documentation on the units for perf dump command? I’ve looked around but haven’t seen anything. Thanks, Dan
|
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com