On Wed, Apr 12, 2017 at 3:00 AM, Laszlo Budai <laszlo@xxxxxxxxxxxxxxxx> wrote: > Hello, > > yesterday one of our compute nodes has recorded the following message for > one of the ceph connections: > > submit_message osd_op(client.28817736.0:690186 > rbd_data.15c046b11ab57b7.00000000000000c4 [read 2097152~380928] 3.6f81364a > ack+read+known_if_redirected e3617) v5 remote, 10.12.68.71:6818/6623, failed > lossy con, dropping message A read message, sent to the OSD at IP 10.12.68.71:6818/6623, is being dropped because the connection has somehow failed; since it's a "lossy" connection we don't need to remember the message and resend it. That failure could be an actual TCP/IP stack error; it could be because a different thread killed the connection and it's now closed. If you've just got one of these and didn't see other problems, it's innocuous — I expect the most common cause for this is an OSD getting marked down while IO is pending to it. :) -Greg > > Can someone "decode" the above message, or direct me to some document where > I could read more about it? > > We have ceph 0.94.10. > > Thank you, > Laszlo > _______________________________________________ > ceph-users mailing list > ceph-users@xxxxxxxxxxxxxx > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com