Hi, With --debug-objecter=20, I found that the rados ls command hangs looping on laggy messages :
2019-07-03 13:33:24.913 7efc402f5700 10
client.21363886.objecter _op_submit op 0x7efc3800dc10
2019-07-03 13:33:24.913 7efc402f5700 20
client.21363886.objecter _calc_target epoch 13146 base @3
precalc_pgid 1 pgid 3.100 is_read
2019-07-03 13:33:24.913 7efc402f5700 20
client.21363886.objecter _calc_target target @3 -> pgid 3.100
2019-07-03 13:33:24.913 7efc402f5700 10
client.21363886.objecter _calc_target raw pgid 3.100 -> actual
3.100 acting [29,12,55] primary 29
2019-07-03 13:33:24.913 7efc402f5700 20
client.21363886.objecter _get_session s=0x7efc380024c0 osd=29 3
2019-07-03 13:33:24.913 7efc402f5700 10
client.21363886.objecter _op_submit oid '@3' '@3' [pgnls
start_epoch 13146] tid 11 osd.29
2019-07-03 13:33:24.913 7efc402f5700 20
client.21363886.objecter get_session s=0x7efc380024c0 osd=29 3
2019-07-03 13:33:24.913 7efc402f5700 15
client.21363886.objecter _session_op_assign 29 11
2019-07-03 13:33:24.913 7efc402f5700 15
client.21363886.objecter _send_op 11 to 3.100 on osd.29
2019-07-03 13:33:24.913 7efc402f5700 20
client.21363886.objecter put_session s=0x7efc380024c0 osd=29 4
2019-07-03 13:33:24.913 7efc402f5700 5
client.21363886.objecter 1 in flight
2019-07-03 13:33:29.678 7efc3e2f1700 10
client.21363886.objecter tick
2019-07-03 13:33:34.678 7efc3e2f1700 10
client.21363886.objecter tick
2019-07-03 13:33:39.678 7efc3e2f1700 10
client.21363886.objecter tick
2019-07-03 13:33:39.678 7efc3e2f1700 2
client.21363886.objecter tid 11 on osd.29 is laggy
2019-07-03 13:33:39.678 7efc3e2f1700 10
client.21363886.objecter _maybe_request_map subscribing (onetime)
to next osd map
2019-07-03 13:33:44.678 7efc3e2f1700 10
client.21363886.objecter tick
2019-07-03 13:33:44.678 7efc3e2f1700 2
client.21363886.objecter tid 11 on osd.29 is laggy
2019-07-03 13:33:44.678 7efc3e2f1700 10
client.21363886.objecter _maybe_request_map subscribing (onetime)
to next osd map
2019-07-03 13:33:49.679 7efc3e2f1700 10
client.21363886.objecter tick I tried to disable this OSD but the problem goes on another
OSD, and so on.The ceph client packages are up to date, all RBD command still work from a monitor but not from Openstack controllers. And the other Ceph pool on the same OSD host but on different disks works perfectly with Openstack... The issue looks like these old on, but It seems fixed since fews years : https://tracker.ceph.com/issues/2454 and https://tracker.ceph.com/issues/8515 Is there anything more I can check? Adrien Le 02/07/2019 à 14:10, Adrien Georget a
écrit :
Hi Eugen, |
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com