I send this dummy patch to describe a bit of work, maybe someone may have additional ideas, concepts and tips. In any case I'm glad I mapped huge EXT4 and data was synced to disk. Some concepts about compounds: - first_page moved to lru union to free place for buffers - refcounting changed - compound pages are "auto managed", page recovering is for backward compatibilit with 2.6 kernels, actully those kernels allowed getting tail page of count 0, but at eye glance moving few times around 0 could cause dangling pointer bug Compound view. In distinction to huge pages and THP, file system compound pages are really loosely treated, as a main difference there is no implication huge page => huge pmd, huge page may exist and may have no huge mappings at all. Each page is managed almost like stand alone, have own count, mapcount, dirty bit etc. It can't be added to any LRU nor list, because list_head is shared with compound metadata. Read / write locking of compound. Splitting may be dequeued this is to prevent deadlocks, "legacy" code will probably start with normal page locked, and then try to lock compound, for splitting purposes this may cause deadlocks (actually this flag was not included in faulting and enywhere else, but should be). Still there is no defragmentation daemon nor anything simillar, this behaviour is forced by MAP_HUGETLB. Things not made: * kswapd & co. not tested. * mlock not fixed, fix will cover get_user_pages & follow_user_pages. * fork, page_mkclean, mlock, not fixed. * dropping caches = bug. * migration not checked * shmfs - writeback for reclaim should split, simple to make, but ext4 experiments should go first (syncing) * no huge COW mapping allowed. * code cleaning from all printk... Signed-off-by: Radosław Smogura <mail@xxxxxxxxxx> --- mm/filemap.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/mm/filemap.c b/mm/filemap.c index f050209..7174fff 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -1783,7 +1783,7 @@ int filemap_fault_huge(struct vm_area_struct *vma, struct vm_fault *vmf) int ret = VM_FAULT_LOCKED; error = vma->vm_ops->fault(vma, vmf); - /* XXX Repeatable flags in __do fault etc. */ + /* XXX Repeatable flags in __do fault etc. */ if (error & (VM_FAULT_ERROR | VM_FAULT_NOPAGE | VM_FAULT_RETRY | VM_FAULT_NOHUGE)) { return error; -- 1.7.3.4 -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/ Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>