Re: [PATCH 4/4] bcache: fix I/O significant decline while backend devices registering

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2018/4/12 2:38 PM, tang.junhui@xxxxxxxxxx wrote:
> From: Tang Junhui <tang.junhui@xxxxxxxxxx>
> 
> I attached several backend devices in the same cache set, and produced lots
> of dirty data by running small rand I/O writes in a long time, then I
> continue run I/O in the others cached devices, and stopped a cached device,
> after a mean while, I register the stopped device again, I see the running
> I/O in the others cached devices dropped significantly, sometimes even
> jumps to zero.
> 
> In currently code, bcache would traverse each keys and btree node to count
> the dirty data under read locker, and the writes threads can not get the
> btree write locker, and when there is a lot of keys and btree node in the
> registering device, it would last several seconds, so the write I/Os in
> others cached device are blocked and declined significantly.
> 
> In this patch, when a device registering to a ache set, which exist others
> cached devices with running I/Os, we get the amount of dirty data of the
> device in an incremental way, and do not block other cached devices all the
> time.
> 
> Signed-off-by: Tang Junhui <tang.junhui@xxxxxxxxxx>
> ---
>  drivers/md/bcache/writeback.c | 26 +++++++++++++++++++++++++-
>  1 file changed, 25 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/md/bcache/writeback.c b/drivers/md/bcache/writeback.c
> index 56a3788..2467d3a 100644
> --- a/drivers/md/bcache/writeback.c
> +++ b/drivers/md/bcache/writeback.c
> @@ -488,10 +488,14 @@ static int bch_writeback_thread(void *arg)
>  }
>  

Hi Junhui,

>  /* Init */
> +#define INIT_KEYS_MIN			500000
> +#define INIT_KEYS_SLEEP_TIME		100
>  

How about renaming the above macros to something like
INIT_KEYS_WAIT_COUNTER and INIT_KEYS_SLEEP_MS ?

Also I suggested to rename io_inflight in previous patch review comments.

For rested part of this patch looks good to me.

Reviewed-by: Coly Li <colyli@xxxxxxx>

Thanks.

Coly Li

>  struct sectors_dirty_init {
>  	struct btree_op	op;
>  	unsigned	inode;
> +	size_t		count;
> +	struct bkey	start;
>  };
>  
>  static int sectors_dirty_init_fn(struct btree_op *_op, struct btree *b,
> @@ -506,18 +510,38 @@ static int sectors_dirty_init_fn(struct btree_op *_op, struct btree *b,
>  		bcache_dev_sectors_dirty_add(b->c, KEY_INODE(k),
>  					     KEY_START(k), KEY_SIZE(k));
>  
> +	op->count++;
> +	if (atomic_read(&b->c->io_inflight) &&
> +	   !(op->count % INIT_KEYS_MIN)) {
> +		bkey_copy_key(&op->start, k);
> +		return -EAGAIN;
> +	}
> +
>  	return MAP_CONTINUE;
>  }
>  
>  void bch_sectors_dirty_init(struct bcache_device *d)
>  {
>  	struct sectors_dirty_init op;
> +	int ret;
>  
>  	bch_btree_op_init(&op.op, -1);
>  	op.inode = d->id;
> +	op.count = 0;
> +	op.start = KEY(op.inode, 0, 0);
>  
> -	bch_btree_map_keys(&op.op, d->c, &KEY(op.inode, 0, 0),
> +	do {
> +		ret = bch_btree_map_keys(&op.op, d->c, &op.start,
>  			   sectors_dirty_init_fn, 0);
> +		if (ret == -EAGAIN)
> +			schedule_timeout_interruptible(
> +				msecs_to_jiffies(INIT_KEYS_SLEEP_TIME));
> +		else if (ret < 0) {
> +			pr_warn("sectors dirty init failed, ret=%d!", ret);
> +			break;
> +		}
> +	} while (ret == -EAGAIN);
> +
>  }
>  
>  void bch_cached_dev_writeback_init(struct cached_dev *dc)
> 




[Index of Archives]     [Linux RAID]     [Linux SCSI]     [Linux ATA RAID]     [IDE]     [Linux Wireless]     [Linux Kernel]     [ATH6KL]     [Linux Bluetooth]     [Linux Netdev]     [Kernel Newbies]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Device Mapper]

  Powered by Linux