On Thu, Aug 01, 2019 at 01:23:59AM +0300, Arseny Maslennikov wrote: > On Tue, Jul 30, 2019 at 06:19:40PM +0200, Greg Kroah-Hartman wrote: > > On Tue, Jun 25, 2019 at 07:11:53PM +0300, Arseny Maslennikov wrote: > > > If the three termios local flags isig, icanon, iexten are enabled > > > and the local flag nokerninfo is disabled for a tty governed > > > by the n_tty line discipline, then on receiving the keyboard status > > > character n_tty will generate a status message and write it out to > > > the tty before sending SIGINFO to the tty's foreground process group. > > > > > > This kerninfo line contains information about the current system load > > > as well as some properties of "the most interesting" process in the > > > tty's current foreground process group, namely: > > > - its PID as seen inside its deepest PID namespace; > > > * the whole process group ought to be in a single PID namespace, > > > so this is actually deterministic > > > - its saved command name truncated to 16 bytes (task_struct::comm); > > > * at the time of writing TASK_COMM_LEN == 16 > > > - its state and some related bits, procps-style; > > > - for S and D: its symbolic wait channel, if available; or a short > > > description for other process states instead; > > > - its user, system and real rusage time values; > > > - its resident set size (as well as the high watermark) in kilobytes. > > > > Why is this really all needed as we have the SysRq handlers that report > > all of this today? > > Different use-cases have different needs; SysRq is targeted at a different > audience; see below. > > > > The "most interesting" process is chosen as follows: > > > - runnables over everything > > > - uninterruptibles over everything else > > > - among 2 runnables pick the biggest utime + stime > > > - any unresolved ties are decided in favour of greatest PID. > > > > This does not feel like something that the tty core code should be doing > > at all. > > Yes, this selection part is quite clumsy. In defense of it, one could > argue that we already have the whole n_tty implemented in kernel-space. One could argue that :) > One way we could get rid of this is to display a summarized statistic > for the whole pgrp: pgid, oldest real time, cumulative utime and stime, > cumulative memory usage. Would this be more acceptable? Are there any > other ideas? Given that I really think you are just making something up here that no one really is needing for their workflow, but would just be "cool to have", I say do whatever you think is right. And there is nothing wrong with "cool to have" things, I'm not trying to dismiss this, it's just that all new features come with the "you must support this until the end of time" requirement that we must balance it with. > > > While the kerninfo line is not very useful for debugging the kernel > > > itself, since we have much more powerful debugging tools, it still gives > > > the user behind the terminal some meaningful feedback to a VSTATUS that > > > works even if no processes respond. > > > > That's what SysRq is for. If there's a specific set of values that we > > don't currently report in that facility, why not just add the > > information there? It's much simpler and "safer" that way. > > SysRq is intended for the person either administrating the system to be used in > emergency (e.g. f for the oom kill, the famous s-u-b combo also comes to > mind) or debugging the kernel, and it indeed does a much better job for > those purposes. In both use-cases mentioned the person has access to > the system console, where the sysrq button handlers produce all their > output, if any, and to either a physical keyboard / serial console or to > /proc/sysrq-trigger, whose mode is 0200 (writable by uid 0 only). > > The use-case for this is different: the ^T-line as proposed by this > patch is for the user that interacts with a system through a terminal, who > wants to be informed not about the whole system (sort of what SysRq-t > tells you), but about what they run on that particular tty. Ok, fair enough, although if you just add a new sysrq option for "what is running on this tty", would that help resolve this? > This is much less about "why does my system/kernel seem to hang?" or > exposing low-level internals (registers, hrtimers, locks, ...), and more > about "is my SSH terminal session unresponsive?" and "I ran a command, > it doesn't finish, how's it doing?". > e.g. A user might want to know if their SSH connection is alive without > interrupting anything, while having no access both to SysRq and console, > and no one in fg pgrp actually handles SIGINFO. If you have access to a tty, you should have access to sysrq, right? > SysRq is system-wide, whereas this is per-terminal and only cares about > one tty which the status char is pressed at and its foreground pgrp > (most likely it's the foreground shell job). > > I hope this is clear enough. It is, yes. My big objection is the crazy code I point out above, as well as the "create a totally new interface when we might be able to use an existing one" that you need to convince me is really required :) thanks, greg k-h