Thanks. So I'm going to continue looking for the cause of these IO errors. -- dan On Fri, Jun 22, 2018 at 3:22 PM Sergey Malinin <hell@xxxxxxxxxxx> wrote: > > From http://docs.ceph.com/docs/mimic/rados/troubleshooting/troubleshooting-pg/ : > > "Now 1 knows that these object exist, but there is no live ceph-osd who has a copy. In this case, IO to those objects will block, and the cluster will hope that the failed node comes back soon; this is assumed to be preferable to returning an IO error to the user." > > On 22.06.2018, at 16:16, Dan van der Ster <dan@xxxxxxxxxxxxxx> wrote: > > Hi all, > > Quick question: does an IO with an unfound object result in an IO > error or should the IO block? > > During a jewel to luminous upgrade some PGs passed through a state > with unfound objects for a few seconds. And this seems to match the > times when we had a few IO errors on RBD attached volumes. > > Wondering what is the correct behaviour here... > > Cheers, Dan > _______________________________________________ > ceph-users mailing list > ceph-users@xxxxxxxxxxxxxx > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > > > _______________________________________________ > ceph-users mailing list > ceph-users@xxxxxxxxxxxxxx > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com