Re: [PATCH] mm: introduce sysctl file to flush per-cpu vmstat statistics

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 3 Dec 2020, Thomas Gleixner wrote:

> >> The current CPU isolation is a best effort approach and I agree that for
> >> more strict isolation modes we need to be able to enforce that and hunt
> >> down offenders and think about them one by one.
> >
> > There are two apprahces actually to make the OS quiet. One is the best
> > effort approach which is more like the current NOHZ one with additional
> > actions to flush things. The other is the strict approach were one wants a
> > guarantee that the OS does not do anything at all.
>
> And here the consensus stops again :)
>
> The point is that between the relaxed best effort / heuristics based
> scenario and the 'user space task asks for absolute silence' scenario is
> a huge difference:

The two approaches are:

1. Enforce silence and abort if the application tries anything that
jeopardizes that silence (f.e. a syscall that cannot directly complete,
a major page fault etc).

This mode needs to work like an on/off switch so that the application can
exit the  mode and do regular system calls. Only specially designed
software will be able to use this mode since it is so restrictive and care
needs to be taken to enable/disable this mode.

2. Silence now (dump all caches, correlate counters, all pending work
finishes). This is a one shot approach. Anything later that causes counter
increments, cache population etc etc may occur but will then cause
additional latency required to re-enable the caches, statistics threads
and so on and so on.

The silence now function will be used when the app waits for an
event that may occur shortly and the reaction time to that event needs to
be as low latency as possible. F.e. The app may be in a complex polling
loop that should not be interrupted. Once an event is detected syscalls
etc could potentially occur (depending on the event). When the app goes
back to the polling loop it will do another "silence now" call.

>   Is this really a black and white decision?

These are two different modes of usage.


>   And as we know that there are quite some shades of grey, there is lots
>   of choice and we need to come up with solutions for delegating the
>   policy decision to the user/admin and not just provide a off/on knob.

One of these choices is not an on-off knob.

> Again: I fundamentaly disagree with the proposed task isolation patches
> approach as they leave no choice at all.

There are no degres of gray here. I dont understand why you have these
concerns.

> pattern of the application, e.g.
>
>  1     read_data_set() <- involving syscalls/OS obviously

??? You cannot use syscalls for high speed or low latency I/O!!!

>  2     compute_set()   <- let me alone
>  3     save_data_set() <- involving syscalls/OS obviously

Again saving data may not be possible through the kernel since syscalls
may have too much overhead and latency.

>        repeat the above...

There is a fundamental misunderstanding here. This is not primarily about
compute but about I/O. In particular I/O that does not involve the kernel.
RDMA or things like DPDK, SPDK or other low hardware level things.

Typically a user space poll loop checks numerous memory locations related
to this I/O or shared memory areas where other cpus interact with the
thread that wants to be OS noise free.

> Summary: The problem to be solved cannot be restricted to
>
>     self_defined_important_task(OWN_WORLD);
>
> Policy is not a binary on/off problem. It's manifold across all levels
> of the stack and only a kernel problem when it comes down to the last
> line of defence.

This a clearly defined set of functions and I am not sure how policy fits
into that.





[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux