David Laight <David.Laight@xxxxxxxxxx> writes: > From: Oleg Nesterov [mailto:oleg@xxxxxxxxxx] >> Sent: 11 June 2019 19:56 >> On 06/10, Eric W. Biederman wrote: >> > >> > Personally I don't think anyone sane would intentionally depend on this >> > and I don't think there is a sufficiently reliable way to depend on this >> > by accident that people would actually be depending on it. >> >> Agreed. >> >> As I said I like these changes and I see nothing wrong. To me they fix the >> current behaviour, or at least make it more consistent. >> >> But perhaps this should be documented in the changelog? To make it clear >> that this change was intentional. > > What happens if you run the test program I posted yesterday after the changes? > > It looks like pselect() and epoll_pwait() operated completely differently. > pselect() would always calls the signal handlers. > epoll_pwait() only calls them when EINTR is returned. > So changing epoll_pwait() and pselect() to work the same way > is bound to break some applications. That is not the change we are discussing. We are looking at making pselect and epoll_pwait act a little more like sigtimedwait. In particular we are discussiong signals whose handler is SIG_DFL and whose default disposition is to kill the process, such as SIGINT. When those signals are delivered and they are not blocked, we take an optimized path in complete_signal and start the process tear down. That early start of process tear down does not happen if the signal is blocked or it happens to be in real_blocked (from sigtimedwait). This matters is the small time window where the signal is received while the process has temporarily unblocked the signal, and the signal is not detected by the code and blocked and oridinarily would not be delivered until next time because of restore_sigmask_unless. If the tear down has started early. Even if we would not have returned the process normally the signal can kill the process. AKA epoll_pwait can today result in it's caller being killed without -EINTR being returned. My change fixes that behavior and disables the early process tear down for those signals, by having real_blocked match the set of signals that are normally blocked by the process. The result should be the signal will have to wait for the next call that temporarily unblocked the process. The real benefit is that that the code becomes more comprehensible. It is my patch that titled: "signal: Always keep real_blocked in sync with blocked" that causes that to happen. Eric