Re: OSD:s failing out after upgrade to 9.2.0 on Ubuntu 14.04

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Did some more logging and for some reason it seems like I do have some problem communicating with my OSDs:

 

“ceph tell osd.* version” gives two different errors that might shed some light on what is going on…

 

osd.0: Error ENXIO: problem getting command descriptions from osd.0

osd.0: problem getting command descriptions from osd.0

osd.3: Error ENXIO: problem getting command descriptions from osd.3

osd.3: problem getting command descriptions from osd.3

2015-11-16 21:37:17.737671 7fafa05f6700  0 -- 172.16.0.202:0/1780708646 >> 172.16.0.202:7001/8193 pipe(0x7fafa4068e40 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7fafa4062920).fault

osd.4: Error EINTR: problem getting command descriptions from osd.4

osd.4: problem getting command descriptions from osd.4

 

I have some quite large logs that I was going through and noticed that I got this in the osd-log:

2015-11-16 20:27:43.432210 7f6b356a0700  1 osd.0 39502 osdmap indicates one or more pre-v0.94.4 hammer OSDs is running

 

That made me check what version the OSDs were, but I get that log entry is because it cannot check the version of the other OSDs at all.

 

From: ceph-users [mailto:ceph-users-bounces@xxxxxxxxxxxxxx] On Behalf Of Claes Sahlström
Sent: den 16 november 2015 17:51
To: 'ceph-users' <ceph-users@xxxxxxxxxxxxxx>
Subject: Re: [ceph-users] OSD:s failing out after upgrade to 9.2.0 on Ubuntu 14.04

 

After some time 4 more OSD:s from one server dropped out and it now seems that only 3 OSD:s from 1 server (I have 3 servers each with 4 OSD:s) are marked as up the other 9 are down. I have shut the servers down for now since I will not have any time to work with this until the weekend.

 

Any suggestion of how to get the system online again are most welcome. The OSD disks have not crashed and I hope to be able to get them to join the cluster again and get the data back.

 

I am not sure what I did wrong when doing the upgrade from Hammer to Infernalis, at first I thought that it was that I didn´t remove the ceph user and group when upgrading, but now I have no clue, I do not think I actually had a ceph-user before Infernalis.

 

Any help or suggestions what I can try to get the system online is most welcome.

 

Thanks,

Claes

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux