On Wed, Aug 11, 2010 at 05:45:13PM +0200, Heinz Mauelshagen wrote: [..] > > [..] > > > +/* Decide about throttling (ie. deferring bios). */ > > > +static int throttle(struct throttle_c *tc, struct bio *bio) > > > +{ > > > + int rw = (bio_data_dir(bio) == WRITE); > > > + unsigned bps; /* Bytes per second. */ > > > + > > > + smp_rmb(); > > > + bps = tc->params.bs[rw]; > > > + if (bps) { > > > + unsigned size; > > > + struct account *ac = &tc->account; > > > + struct ac_rw *ac_rw = ac->rw + rw; > > > + > > > + if (time_after(jiffies, ac_rw->end_jiffies)) > > > + /* Measure time exceeded. */ > > > + account_reset(rw, tc); > > > + else if (test_bit(rw, &ac->flags)) > > > + /* In case we're throttled already. */ > > > + return 1; > > > + > > > + /* Account I/O size. */ > > > + size = ac_rw->size + bio->bi_size; > > > + if (size > bps) { > > > + /* Hit kilobytes per second threshold. */ > > > + set_bit(rw, &ac->flags); > > > + return 1; > > > > If bio->bi_size is greate than bps, will I always keep on throttling > > and hang? > > bps needs to be set larger than the bio maximum size expected with the > current implementatio, right. The algorithm needs changing to cope with > bi_size larger than bps (see below). > > > > > [..] > > > +/* Map a throttle io. */ > > > +static int throttle_map(struct dm_target *ti, struct bio *bio, > > > + union map_info *map_context) > > > +{ > > > + int r, rw = (bio_data_dir(bio) == WRITE); > > > + struct throttle_c *tc = ti->private; > > > + struct ac_rw *ac_rw = tc->account.rw + rw; > > > + > > > + mutex_lock(&ac_rw->mutex); > > > + do { > > > + r = throttle(tc, bio); > > > + if (r) { > > > + long end = ac_rw->end_jiffies, j = jiffies; > > > + > > > + /* Wait till next second when KB/s reached. */ > > > + if (j < end) > > > + schedule_timeout_uninterruptible(end - j); > > > + } > > > > So a thread is blocked if it crossed the IO rate. There is no such > mechanism > > to take the bio, statsh away somewhere and dispatch it to disk later. > The > > way request queue descriptors work. > > Right, the aim for this testing target was to keep it as simple as > possible to solve the purpose of simulating low bandwidth transports or > varying device throughput properties. > > Cheap approaches to tackle this issue include to set ti->split_io based > on bps < BIO_MAX_SIZE (units ignored) in the ctr/message interface, > to prohibit bps smaller than BIO_MAX_SIZE altogether or to change the > throttle() algorithm to allow for > 1s measurement periods based on > bi_size maximum vs. bps ratios. > > The 1st one obviously causing more bio splits, the 2nd one prohibiting > small bandwidth simulation and the last one causing io peeks, which is > actually not what I wanted. Can't we just wait for enough number of seconds to allow bigger bio to pass. So if bio size is 4MB and rate limit is 1MB/s then wait for 4 seconds. That way there are no splits, and no io peeks? Thanks Vivek -- dm-devel mailing list dm-devel@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/dm-devel