Re: [PATCH] Fix for corrupted ceph cluster

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 6 Mar 2014, Daniel Poelzleithner wrote:
> On 02/11/2014 07:48 AM, Daniel Poelzleithner wrote:
> 
> > I wrote a small patch that ignores object_trim requests when he does not
> > find the context of this request.
> > We have a node that fails to start permanently and there is no way to
> > get all nodes back up.
> [...]
> > This is regarding bug http://tracker.ceph.com/issues/6101
> 
> The patch now ran for 2 weeks and the 4th node is working again.
> I think this patch is safe to apply, but not fixing the underlying problem.
> Some state in ceph causes the delete event to be triggered every some
> seconds and causes a log entry to be generated.
> 
> Do you need more informations to find the cause ? This definitely is
> some wired internal state and is no race condition.

Can you try, instead of the discard_event, to do

    post_event(SnapTrim());
    return transit< WaitingOnReplicas >();

and see if that lets it move past the bad entry?

Thanks-
sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux