This is ressurection of my old RFC patch for dirty-set accounting cgroup [1] Now it's merged into memory cgroup and got bandwidth controller as a bonus. That shows alternative solution: less accurate but much less monstrous than accurate page-based dirty-set controller from Tejun Heo. Memory overhead: +1 pointer into struct address_space. Perfomance overhead is almost zero, no new locks added. Idea is stright forward: link each inode to some cgroup when first dirty page appers and account all dirty pages to it. Writeback is implemented as single per-bdi writeback work which writes only inodes which belong to memory cgroups where amount of dirty memory is beyond thresholds. Third patch adds trick for handling shared inodes which have dirty pages from several cgroups: it marks whole inode as shared and alters writeback filter for it. The rest is an example of bandwith and iops controller build on top of that. Design is completely original, I bet nobody ever used task-works for that =) [1] [PATCH RFC] fsio: filesystem io accounting cgroup http://marc.info/?l=linux-kernel&m=137331569501655&w=2 Patches also available here: https://github.com/koct9i/linux.git branch memcg_dirty_control --- Konstantin Khebnikov (6): memcg: inode-based dirty and writeback pages accounting memcg: dirty-set limiting and filtered writeback memcg: track shared inodes with dirty pages percpu_ratelimit: high-performance ratelimiting counter delay-injection: resource management via procrastination memcg: filesystem bandwidth controller block/blk-core.c | 2 fs/direct-io.c | 2 fs/fs-writeback.c | 22 ++ fs/inode.c | 1 include/linux/backing-dev.h | 1 include/linux/fs.h | 14 + include/linux/memcontrol.h | 27 +++ include/linux/percpu_ratelimit.h | 45 ++++ include/linux/sched.h | 7 + include/linux/writeback.h | 1 include/trace/events/sched.h | 7 + include/trace/events/writeback.h | 1 kernel/sched/core.c | 66 +++++++ kernel/sched/fair.c | 12 + lib/Makefile | 1 lib/percpu_ratelimit.c | 168 +++++++++++++++++ mm/memcontrol.c | 381 ++++++++++++++++++++++++++++++++++++++ mm/page-writeback.c | 32 +++ mm/readahead.c | 2 mm/truncate.c | 1 mm/vmscan.c | 4 21 files changed, 787 insertions(+), 10 deletions(-) create mode 100644 include/linux/percpu_ratelimit.h create mode 100644 lib/percpu_ratelimit.c -- Signature -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>