On Sun, Oct 12, 2014 at 7:46 AM, Loic Dachary <loic@xxxxxxxxxxx> wrote: > Hi, > > On a 0.80.6 cluster the command > > ceph tell osd.6 version > > hangs forever. I checked that it establishes a TCP connection to the OSD, raised the OSD debug level to 20 and I do not see > > https://github.com/ceph/ceph/blob/firefly/src/osd/OSD.cc#L4991 > > in the logs. All other OSDs answer to the same "version" command as they should. And ceph daemon osd.6 version on the machine running OSD 6 responds as it should. There also are an ever growing number of slow requests on this OSD. But not error in the logs. In other words, except for taking forever to answer any kind of request the OSD looks fine. > > Another OSD running on the same machine is behaving well. > > Any idea what that behaviour relates to ? What commands have you run? The admin socket commands don't require nearly as many locks, nor do they go through the same event loops that messages do. You might have found a deadlock or something. (In which case just restarting the OSD would probably fix it, but you should grab a core dump first.) -Greg _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com