Re: ceph tell osd.6 version : hang

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Oct 12, 2014 at 7:46 AM, Loic Dachary <loic@xxxxxxxxxxx> wrote:
> Hi,
>
> On a 0.80.6 cluster the command
>
> ceph tell osd.6 version
>
> hangs forever. I checked that it establishes a TCP connection to the OSD, raised the OSD debug level to 20 and I do not see
>
> https://github.com/ceph/ceph/blob/firefly/src/osd/OSD.cc#L4991
>
> in the logs. All other OSDs answer to the same "version" command as they should. And ceph daemon osd.6 version on the machine running OSD 6 responds as it should. There also are an ever growing number of slow requests on this OSD. But not error in the logs. In other words, except for taking forever to answer any kind of request the OSD looks fine.
>
> Another OSD running on the same machine is behaving well.
>
> Any idea what that behaviour relates to ?

What commands have you run? The admin socket commands don't require
nearly as many locks, nor do they go through the same event loops that
messages do. You might have found a deadlock or something. (In which
case just restarting the OSD would probably fix it, but you should
grab a core dump first.)
-Greg
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux