On Fri, Jul 10, 2020 at 11:19 AM Doug Anderson <dianders@xxxxxxxxxxxx> wrote: > > Hi, > > On Fri, Jul 10, 2020 at 10:39 AM Evan Green <evgreen@xxxxxxxxxxxx> wrote: > > > > On Fri, Jun 26, 2020 at 1:01 PM Douglas Anderson <dianders@xxxxxxxxxxxx> wrote: > > > > > > The geni serial driver had the rather sketchy hack in it where it > > > would adjust the number of bytes per RX FIFO word from 4 down to 1 if > > > it detected that CONFIG_CONSOLE_POLL was enabled (for kgdb) and this > > > was a console port (defined by the kernel directing output to this > > > port via the "console=" command line argument). > > > > > > The problem with that sketchy hack is that it's possible to run kgdb > > > over a serial port even if it isn't used for console. > > > > > > Let's avoid the hack by simply handling the 4-bytes-per-FIFO word case > > > for kdb. We'll have to have a (very small) cache but that should be > > > fine. > > > > > > A nice side effect of this patch is that an agetty (or similar) > > > running on this port is less likely to drop characters. We'll > > > have roughly 4 times the RX FIFO depth than we used to now. > > > > > > NOTE: the character cache here isn't shared between the polling API > > > and the non-polling API. That means that, technically, the polling > > > API could eat a few extra bytes. This doesn't seem to pose a huge > > > problem in reality because we'll only get several characters per FIFO > > > word if those characters are all received at nearly the same time and > > > we don't really expect non-kgdb characters to be sent to the same port > > > as kgdb at the exact same time we're exiting kgdb. > > > > > > ALSO NOTE: we still have the sketchy hack for setting the number of > > > bytes per TX FIFO word in place, but that one is less bad. kgdb > > > doesn't have any problem with this because it always just sends 1 byte > > > at a time and waits for it to finish. The TX FIFO hack is only really > > > needed for console output. In any case, a future patch will remove > > > that hack, too. > > > > > > Signed-off-by: Douglas Anderson <dianders@xxxxxxxxxxxx> > > > --- > > > > > > drivers/tty/serial/qcom_geni_serial.c | 80 ++++++++++++++++++--------- > > > 1 file changed, 55 insertions(+), 25 deletions(-) > > > > > > diff --git a/drivers/tty/serial/qcom_geni_serial.c b/drivers/tty/serial/qcom_geni_serial.c > > > index 0300867eab7a..4610e391e886 100644 > > > --- a/drivers/tty/serial/qcom_geni_serial.c > > > +++ b/drivers/tty/serial/qcom_geni_serial.c > > > @@ -103,11 +103,13 @@ > > > #define DEFAULT_IO_MACRO_IO2_IO3_MASK GENMASK(15, 4) > > > #define IO_MACRO_IO2_IO3_SWAP 0x4640 > > > > > > -#ifdef CONFIG_CONSOLE_POLL > > > -#define CONSOLE_RX_BYTES_PW 1 > > > -#else > > > -#define CONSOLE_RX_BYTES_PW 4 > > > -#endif > > > +struct qcom_geni_private_data { > > > + /* NOTE: earlycon port will have NULL here */ > > > + struct uart_driver *drv; > > > + > > > + u32 poll_cached_bytes; > > > + unsigned int poll_cached_bytes_cnt; > > > +}; > > > > > > struct qcom_geni_serial_port { > > > struct uart_port uport; > > > @@ -129,6 +131,8 @@ struct qcom_geni_serial_port { > > > int wakeup_irq; > > > bool rx_tx_swap; > > > bool cts_rts_swap; > > > + > > > + struct qcom_geni_private_data private_data; > > > }; > > > > > > static const struct uart_ops qcom_geni_console_pops; > > > @@ -264,8 +268,9 @@ static bool qcom_geni_serial_poll_bit(struct uart_port *uport, > > > unsigned int baud; > > > unsigned int fifo_bits; > > > unsigned long timeout_us = 20000; > > > + struct qcom_geni_private_data *private_data = uport->private_data; > > > > > > - if (uport->private_data) { > > > + if (private_data->drv) { > > > port = to_dev_port(uport, uport); > > > baud = port->baud; > > > if (!baud) > > > @@ -331,23 +336,42 @@ static void qcom_geni_serial_abort_rx(struct uart_port *uport) > > > } > > > > > > #ifdef CONFIG_CONSOLE_POLL > > > + > > > static int qcom_geni_serial_get_char(struct uart_port *uport) > > > { > > > - u32 rx_fifo; > > > + struct qcom_geni_private_data *private_data = uport->private_data; > > > u32 status; > > > + u32 word_cnt; > > > + int ret; > > > + > > > + if (!private_data->poll_cached_bytes_cnt) { > > > + status = readl(uport->membase + SE_GENI_M_IRQ_STATUS); > > > + writel(status, uport->membase + SE_GENI_M_IRQ_CLEAR); > > > > > > - status = readl(uport->membase + SE_GENI_M_IRQ_STATUS); > > > - writel(status, uport->membase + SE_GENI_M_IRQ_CLEAR); > > > + status = readl(uport->membase + SE_GENI_S_IRQ_STATUS); > > > + writel(status, uport->membase + SE_GENI_S_IRQ_CLEAR); > > > > > > - status = readl(uport->membase + SE_GENI_S_IRQ_STATUS); > > > - writel(status, uport->membase + SE_GENI_S_IRQ_CLEAR); > > > + status = readl(uport->membase + SE_GENI_RX_FIFO_STATUS); > > > + word_cnt = status & RX_FIFO_WC_MSK; > > > + if (!word_cnt) > > > + return NO_POLL_CHAR; > > > > > > - status = readl(uport->membase + SE_GENI_RX_FIFO_STATUS); > > > - if (!(status & RX_FIFO_WC_MSK)) > > > - return NO_POLL_CHAR; > > > + if (word_cnt == 1 && (status & RX_LAST)) > > > > I forget how the partial word snapping works. Are you sure you want > > word_cnt == 1? I see qcom_geni_serial_handle_rx() looks at RX_LAST > > independently as long as word_cnt != 0. I'm worried the hardware > > allows one FIFO entry with say 2 bytes in it and RX_LAST set, but then > > also piles new stuff in the FIFO behind it, so that word_cnt can be > > >1. > > So I guess one point of evidence that the logic I have there is OK is > that it works. :-P > > ...but also looking closer. Maybe first it's important to understand > the REALLY WEIRD protocol that geni serial uses. This was discovered > through a bunch of trial and error long ago when poking at how the > driver worked. > > When you're reading from geni it essentially breaks things into > packets. If you're midway through reading a packet of data and more > bytes come in then geni will hide them from you until you read the > whole packet. I'm not totally sure the exact conditions for when it > decides to make a packet out of the data, but it's not important for > this discussion. > > So when you read "RX_FIFO_WC" it tells you the total number of FIFO > words in the current packet. That number will only ever go down. A > packet is made up of some number of whole words plus one word that > could be a whole word or could be a partial word. So if "RX_FIFO_WC" > says 4 it means you've got 3 whole words (3 * 4 = 12 bytes) and one > word that might be partial. You can find out about that one partial > word (always the last word in the FIFO) by reading "RX_LAST" and > "RX_LAST_BYTE_VALID". > > Once you finally read the last word in the FIFO then geni can tell you > about the next packet of data. > > OK, so hopefully that made sense? I see, yes. In your description, the status register is a single storage element with a decrementing counter that stores the state for this current packet. Then it's a mystery how/where additional packets are buffered. In my mental model, RX_FIFO_WC tells you how many elements are occupied in the FIFO, regardless of packet boundaries. RX_LAST and byte count are actually little tags in each FIFO element. So a FIFO element internally consists of 4 data bytes + 3 status bits. Then that portion of the status register shows you the status/tag bits for the element at the top of the FIFO. I can't remember why I think it works this way, but it does explain how the hardware can queue/store multiple RX packets, and how it would handle overflows. > > So qcom_geni_serial_handle_rx() is trying to read ALL bytes. It first > figures out the total count of bytes and then reads them all. That's > why it needs to look at RX_LAST all the time. Also of note: RX_LAST > only ever applies to the last word in the FIFO. If it was possible > for the word count to grow _before_ fully clearing out the FIFO then > it would be a race and software would never be able to tell which byte > RX_LAST applied to. In my world, I was imagining word count could grow, since it showed the current fullness of the FIFO, but RX_LAST and BYTE_COUNT_MSK showed the tag bits for the current element. It looked like qcom_geni_serial_handle_rx() was coded for that since it seemed to only check that FIFO_WC was non-zero. If you only check RX_LAST, and remove the word_cnt == 1, then the code is correct in both of our mental models of the hardware, right? But TBH I've basically fabricated my mental model out of the air. So I'll defer to you if you're sure it works the way you've described. -Evan > > In the case of qcom_geni_serial_get_char() we don't want to read all > bytes because we don't have a place to put them. We only want 1 byte > but sometimes we're forced to read 4 because of the whole > 4-bytes-per-fifo-entry thing. So we read one FIFO entry. As long as > we're not reading the _last_ FIFO entry then geni won't start a new > packet. So if RX_FIFO_WC > 1 and we read a byte then it'll go down by > 1. If RX_FIFO_WC == 1 then we're reading the last entry and geni is > allowed to start its next packet.