This patch adds the new user-api argument structure intended for set_mempolicy2 and mbind2. struct mpol_args { /* Basic mempolicy settings */ unsigned short mode; unsigned short mode_flags; unsigned long *pol_nodes; unsigned long pol_maxnodes; /* get_mempolicy2: policy information (e.g. next interleave node) */ int policy_node; /* get_mempolicy2: memory range policy */ unsigned long addr; int addr_node; /* all operations: policy home node */ unsigned long home_node; /* mbind2: address ranges to apply the policy */ const struct iovec __user *vec; size_t vlen; }; This structure is intended to be extensible as new mempolicy extensions are added. For example, set_mempolicy_home_node was added to allow vma mempolicies to have a preferred/home node assigned. This structure allows the addition of that setting at the time the mempolicy is set, rather than requiring additional calls to modify the policy. Another suggested extension is to allow mbind2 to operate on multiple memory ranges with a single call. mbind presently operates on a single (address, length) tuple. It was suggested that mbind2 should operate on an iovec, which allows many memory ranges to have the same mempolicy applied to it with a single system call. Full breakdown of arguments as of this patch: mode: Mempolicy mode (MPOL_DEFAULT, MPOL_INTERLEAVE) mode_flags: Flags previously or'd into mode in set_mempolicy (e.g.: MPOL_F_STATIC_NODES, MPOL_F_RELATIVE_NODES) pol_nodes: Policy nodemask pol_maxnodes: Max number of nodes in the policy nodemask policy_node: for get_mempolicy2. Returns extended information about a policy that was previously reported by passing MPOL_F_NODE to get_mempolicy. Instead of overriding the mode value, simply add a field. addr: for get_mempolicy2. Used with MPOL_F_ADDR to run get_mempolicy against the vma the address belongs to instead of the task. addr_node: for get_mempolicy2. Returns the node the address belongs to. Previously get_mempolicy() would override the output value of (mode) if MPOL_F_ADDR and MPOL_F_NODE were set. Instead, we extend mpol_args to do this by default if MPOL_F_ADDR is set and do away with MPOL_F_NODE. vec/vlen: Used by mbind2 to apply the mempolicy to all address ranges described by the iovec. Suggested-by: Frank van der Linden <fvdl@xxxxxxxxxx> Suggested-by: Vinicius Tavares Petrucci <vtavarespetr@xxxxxxxxxx> Suggested-by: Hasan Al Maruf <Hasan.Maruf@xxxxxxx> Signed-off-by: Gregory Price <gregory.price@xxxxxxxxxxxx> Co-developed-by: Vinicius Tavares Petrucci <vtavarespetr@xxxxxxxxxx> Signed-off-by: Vinicius Tavares Petrucci <vtavarespetr@xxxxxxxxxx> --- .../admin-guide/mm/numa_memory_policy.rst | 31 +++++++++++++++++++ include/uapi/linux/mempolicy.h | 18 +++++++++++ 2 files changed, 49 insertions(+) diff --git a/Documentation/admin-guide/mm/numa_memory_policy.rst b/Documentation/admin-guide/mm/numa_memory_policy.rst index b7b8d3dd420f..6d645519c2c1 100644 --- a/Documentation/admin-guide/mm/numa_memory_policy.rst +++ b/Documentation/admin-guide/mm/numa_memory_policy.rst @@ -488,6 +488,37 @@ closest to which page allocation will come from. Specifying the home node overri the default allocation policy to allocate memory close to the local node for an executing CPU. +Extended Mempolicy Arguments:: + + struct mpol_args { + /* Basic mempolicy settings */ + unsigned short mode; + unsigned short mode_flags; + unsigned long *pol_nodes; + unsigned long pol_maxnodes; + + /* get_mempolicy2: policy node information */ + int policy_node; + + /* get_mempolicy2: memory range policy */ + unsigned long addr; + int addr_node; + + /* mbind2: policy home node */ + unsigned long home_node; + + /* mbind2: address ranges to apply the policy */ + struct iovec *vec; + size_t vlen; + }; + +The extended mempolicy argument structure is defined to allow the mempolicy +interfaces future extensibility without the need for additional system calls. + +The core arguments (mode, mode_flags, pol_nodes, and pol_maxnodes) apply to +all interfaces relative to their non-extended counterparts. Each additional +field may only apply to specific extended interfaces. See the respective +extended interface man page for more details. Memory Policy Command Line Interface ==================================== diff --git a/include/uapi/linux/mempolicy.h b/include/uapi/linux/mempolicy.h index 1f9bb10d1a47..e6b50903047c 100644 --- a/include/uapi/linux/mempolicy.h +++ b/include/uapi/linux/mempolicy.h @@ -27,6 +27,24 @@ enum { MPOL_MAX, /* always last member of enum */ }; +struct mpol_args { + /* Basic mempolicy settings */ + unsigned short mode; + unsigned short mode_flags; + unsigned long *pol_nodes; + unsigned long pol_maxnodes; + /* get_mempolicy: policy node information */ + int policy_node; + /* get_mempolicy: memory range policy */ + unsigned long addr; + int addr_node; + /* mbind2: policy home node */ + int home_node; + /* mbind2: address ranges to apply the policy */ + struct iovec *vec; + size_t vlen; +}; + /* Flags for set_mempolicy */ #define MPOL_F_STATIC_NODES (1 << 15) #define MPOL_F_RELATIVE_NODES (1 << 14) -- 2.39.1