Re: Ceph backfill problem

Matthew H <matthew.heler@xxxxxxxxxxx> · Fri, 21 Sep 2018 03:04:36 +0000

Without knowing more about the underlying hardware, you likely are reaching some type of IO resource constraint. Are your journals colocated or non-colocated? How fast is your backend OSD storage device?

You may also want to look at setting the norebalance flag.

Good luck!

> On Sep 20, 2018, at 19:52, Chen Allen <uilcxr@xxxxxxxxx> wrote:
> 
> Hi there,
> 
> Has anyone experienced below?
> 2 of OSD server was down, after bring up 2 of servers, I brought 52 OSD's in with just weight of 0.05, but it causing huge backfilling load, I saw so many blocked requests and a number of pg stuck inactive. some of servers was impact. so I stopped backfilling by mark nobackfill flag. everything back to normal.
> But the most strange thing happens after 2 hours, the backfilling suddenly start again despite of nobackfill flag marked and causing so many blocked requests then we have to reweight 52 OSD's to 0 to stabilize storage.
> 
> Not sure why backfill start again. Anyone has any idea about that please comments. 
> 
> Thanks so much.
> Allen
> _______________________________________________
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com