On Mon, Jun 13, 2022 at 08:30:29PM +0800, juanfengpy@xxxxxxxxx wrote: > From: caelli <caelli@xxxxxxxxxxx> This name/address does not match what you are sending it from, and I do not think this is how you sign legal documents right? For that reason alone, I can't take this :( > > We have met a hang on pty device, the reader was blocking > at epoll on master side, the writer was sleeping at wait_woken > inside n_tty_write on slave side, and the write buffer on > tty_port was full, we found that the reader and writer would > never be woken again and blocked forever. > > The problem was caused by a race between reader and kworker: > n_tty_read(reader): n_tty_receive_buf_common(kworker): > |room = N_TTY_BUF_SIZE - (ldata->read_head - tail) > |room <= 0 > copy_from_read_buf()| > n_tty_kick_worker() | > |ldata->no_room = true > > After writing to slave device, writer wakes up kworker to flush > data on tty_port to reader, and the kworker finds that reader > has no room to store data so room <= 0 is met. At this moment, > reader consumes all the data on reader buffer and call > n_tty_kick_worker to check ldata->no_room which is false and > reader quits reading. Then kworker sets ldata->no_room=true > and quits too. > > If write buffer is not full, writer will wake kworker to flush data > again after following writes, but if write buffer is full and writer > goes to sleep, kworker will never be woken again and tty device is > blocked. > > This problem can be solved with a check for read buffer size inside > n_tty_receive_buf_common, if read buffer is empty and ldata->no_room > is true, a call to n_tty_kick_worker is necessary to keep flushing > data to reader. > > Signed-off-by: caelli <caelli@xxxxxxxxxxx> > --- > Previous threads: > https://lore.kernel.org/all/CAPmgiULo4h8bOrzL+XJ5Pndw0kz80fBPfH_KNLx3c5j-Yj04SA@xxxxxxxxxxxxxx/t/ > > I corrected some format problems as recommended and switched client to git send-email, > which may be ok. And subject is changed from 'tty: fix a possible hang on tty device' to > 'tty: fix hang on tty device with no_room set' to make subject more obvious. Please properly version your patches like the documentation explains how to, so we know what has changed from previous versions. Otherwise they all look identical to us. thanks, greg k-h