Re: [ceph-users] stalls caused by scrub on jewel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Sage,

Could you please clarify: do we need to set nodeep-scrub also, or does
this somehow only affect the (shallow) scrub?

(Note that deep scrubs will start when the deep_scrub_interval has
passed, even with noscrub set).

Cheers, Dan


On Tue, Nov 15, 2016 at 11:35 PM, Sage Weil <sweil@xxxxxxxxxx> wrote:
> Hi everyone,
>
> There was a regression in jewel that can trigger long OSD stalls during
> scrub.  How long the stalls are depends on how many objects are in your
> PGs, how fast your storage device is, and what is cached, but in at least
> one case they were long enough that the OSD internal heartbeat check
> failed and it committed suicide (120 seconds).
>
> The workaround for now is to simply
>
>  ceph osd set noscrub
>
> as the bug is only triggered by scrub.  A fix is being tested and will be
> available shortly.
>
> If you've seen any kind of weird latencies or slow requests on jewel, I
> suggest setting noscrub and seeing if they go away!
>
> The tracker bug is
>
>  http://tracker.ceph.com/issues/17859
>
> Big thanks to Yoann Moulin for helping track this down!
>
> sage
> _______________________________________________
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux