On Thu, 6 Mar 2014, Daniel Poelzleithner wrote: > On 02/11/2014 07:48 AM, Daniel Poelzleithner wrote: > > > I wrote a small patch that ignores object_trim requests when he does not > > find the context of this request. > > We have a node that fails to start permanently and there is no way to > > get all nodes back up. > [...] > > This is regarding bug http://tracker.ceph.com/issues/6101 > > The patch now ran for 2 weeks and the 4th node is working again. > I think this patch is safe to apply, but not fixing the underlying problem. > Some state in ceph causes the delete event to be triggered every some > seconds and causes a log entry to be generated. > > Do you need more informations to find the cause ? This definitely is > some wired internal state and is no race condition. Can you try, instead of the discard_event, to do post_event(SnapTrim()); return transit< WaitingOnReplicas >(); and see if that lets it move past the bad entry? Thanks- sage -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html