Thanks, working on fixing the peering objects. Going to attempt a recovery on the bad pgs tomorrow.
The corrupt OSD which they were on was marked 'lost' so i expected it wouldn't try to peer with it anymore. Anyway I do have the data, at least.
-Ben
On Fri, Mar 4, 2016 at 1:04 AM, Luis Periquito <periquito@xxxxxxxxx> wrote:
you should really fix the peering objects.
So far what I've seen in ceph is that it prefers data integrity over
availability. So if it thinks that it can't keep all working properly
it tends to stop (i.e. blocked requests), thus I don't believe there's
a way to do this.
> _______________________________________________
On Fri, Mar 4, 2016 at 1:04 AM, Ben Hines <bhines@xxxxxxxxx> wrote:
> I have a few bad objects in ceph which are 'stuck on peering'. The clients
> hit them and they build up and eventually stop all traffic to the OSD. I
> can open up traffic by resetting the OSD (aborting those requests)
> temporarily.
>
> Is there a way to tell ceph to cancel/abort these 'slow requests' once they
> get to certain amount of time? Rather than building up and blocking
> everything..
>
> -Ben
>
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com