HI Marc, Gentle reminder! Are you happy with the open comment's disposition? I can send a next version of patch if we have a closure on current set of comments. Thanks, Ramesh > -----Original Message----- > From: Ramesh Shanmugasundaram > Sent: 01 April 2016 13:49 > To: Ramesh Shanmugasundaram <ramesh.shanmugasundaram@xxxxxxxxxxxxxx>; > wg@xxxxxxxxxxxxxx; robh+dt@xxxxxxxxxx; pawel.moll@xxxxxxx; > mark.rutland@xxxxxxx; ijc+devicetree@xxxxxxxxxxxxxx; galak@xxxxxxxxxxxxxx; > corbet@xxxxxxx > Cc: linux-renesas-soc@xxxxxxxxxxxxxxx; devicetree@xxxxxxxxxxxxxxx; linux- > can@xxxxxxxxxxxxxxx; netdev@xxxxxxxxxxxxxxx; linux-doc@xxxxxxxxxxxxxxx; > geert+renesas@xxxxxxxxx; Chris Paterson <Chris.Paterson2@xxxxxxxxxxx> > Subject: RE: [PATCH v2] can: rcar_canfd: Add Renesas R-Car CAN FD driver > > Hi Marc, > > Thanks for your time & review comments. > > > -----Original Message----- > (...) > > > +#define RCANFD_NAPI_WEIGHT 8 /* Rx poll quota */ > > > + > > > +#define RCANFD_NUM_CHANNELS 2 > > > +#define RCANFD_CHANNELS_MASK 0x3 /* Two channels max */ > > > > (BIT(RCANFD_NUM_CHANNELS) - 1 > > OK > > > > > > + > > > +/* Rx FIFO is a global resource of the controller. There are 8 such > > FIFOs > > > + * available. Each channel gets a dedicated Rx FIFO (i.e.) the > > > + channel > (...) > > > +#define RCANFD_CMFIFO_CFDLC(x) (((x) & 0xf) << 28) > > > +#define RCANFD_CMFIFO_CFPTR(x) (((x) & 0xfff) << 16) > > > +#define RCANFD_CMFIFO_CFTS(x) (((x) & 0xff) << 0) > > > + > > > +/* Global Test Config register */ > > > +#define RCANFD_GTSTCFG_C0CBCE BIT(0) > > > +#define RCANFD_GTSTCFG_C1CBCE BIT(1) > > > + > > > +#define RCANFD_GTSTCTR_ICBCTME BIT(0) > > > + > > > +/* AFL Rx rules registers */ > > > +#define RCANFD_AFLCFG_SETRNC0(x) (((x) & 0xff) << 24) > > > +#define RCANFD_AFLCFG_SETRNC1(x) (((x) & 0xff) << 16) > > > > What about something like: > > > > #define RCANFD_AFLCFG_SETRNC(n, x) (((x) & 0xff) << (24 - n * 8)) > > > > This will save some if()s in the code > > Nice :-). Will update. > > > > > > +#define RCANFD_AFLCFG_GETRNC0(x) (((x) >> 24) & 0xff) > > > +#define RCANFD_AFLCFG_GETRNC1(x) (((x) >> 16) & 0xff) > > > + > > > +#define RCANFD_AFL_PAGENUM(entry) ((entry) / 16) > (...) > > > +#define rcar_canfd_read(priv, offset) \ > > > + readl(priv->base + (offset)) > > > +#define rcar_canfd_write(priv, offset, val) \ > > > + writel(val, priv->base + (offset)) > > > +#define rcar_canfd_set_bit(priv, reg, val) \ > > > + rcar_canfd_update(val, val, priv->base + (reg)) > > > +#define rcar_canfd_clear_bit(priv, reg, val) \ > > > + rcar_canfd_update(val, 0, priv->base + (reg)) > > > +#define rcar_canfd_update_bit(priv, reg, mask, val) \ > > > + rcar_canfd_update(mask, val, priv->base + (reg)) > > > > please use static inline functions instead of defines. > > OK. > > > > > > + > > > +static void rcar_canfd_get_data(struct canfd_frame *cf, > > > + struct rcar_canfd_channel *priv, u32 off) > > > > Please use "struct rcar_canfd_channel *priv" as first argument, struct > > canfd_frame *cf as second. Remove off, as the offset is already > > defined by the channel. > > I'll re-order priv, cf as you mentioned. I'll leave "off" arg as it is > because it is based on FIFO number of channel + mode (CAN only or CANFD > only). Otherwise, I will have to add another check inside this function > for mode. > > > > +{ > > > + u32 i, lwords; > > > + > > > + lwords = cf->len / sizeof(u32); > > > + if (cf->len % sizeof(u32)) > > > + lwords++; > > > > Use DIV_ROUND_UP() instead of open coding it. > > Agreed. Thanks. > > > > > > + for (i = 0; i < lwords; i++) > > > + *((u32 *)cf->data + i) = > > > + rcar_canfd_read(priv, off + (i * sizeof(u32))); } > > > + > > > +static void rcar_canfd_put_data(struct canfd_frame *cf, > > > + struct rcar_canfd_channel *priv, u32 off) > > > > same here > > Yes (same as _get_data) > > > > > > +{ > > > + u32 i, j, lwords, leftover; > > > + u32 data = 0; > > > + > > > + lwords = cf->len / sizeof(u32); > > > + leftover = cf->len % sizeof(u32); > > > + for (i = 0; i < lwords; i++) > > > + rcar_canfd_write(priv, off + (i * sizeof(u32)), > > > + *((u32 *)cf->data + i)); > > > > Here you don't convert the endianess... > > Yes > > > > + > > > + if (leftover) { > > > + u8 *p = (u8 *)((u32 *)cf->data + i); > > > + > > > + for (j = 0; j < leftover; j++) > > > + data |= p[j] << (j * 8); > > > > ...here you do an implicit endianess conversion. "data" is little > > endian, while p[j] is big endian. > > Not sure I got the question correctly. > Controller expectation of data bytes in 32bit register is bits[7:0] = > byte0, bits[15:8] = byte1 and so on - little endian. > For e.g. if cf->data points to byte stream H'112233445566 (cf->data[0] = > 0x11), first rcar_canfd_write will write 0x44332211 value to register. Yes > the host cpu is assumed little endian. In leftover case, data will be > 0x00006655 - again little endian. > I think I should remove this leftover logic and just mask the unused bits > to zero as cf->data is pre-allocated for max length anyway. > > p[j] is a byte?? Am I missing something? > > > > > > + rcar_canfd_write(priv, off + (i * sizeof(u32)), data); > > > + } > > > > Have you tested to send CAN frames with len != n*4 against a different > > controller? > > Yes, with Vector VN1630A. > > > > +} > > > + > > > +static void rcar_canfd_tx_failure_cleanup(struct net_device *ndev) > > > +{ > > > + u32 i; > > > + > > > + for (i = 0; i < RCANFD_FIFO_DEPTH; i++) > > > + can_free_echo_skb(ndev, i); > > > +} > > > + > (...) > > > +static void rcar_canfd_tx_done(struct net_device *ndev) { > > > + struct rcar_canfd_channel *priv = netdev_priv(ndev); > > > + struct net_device_stats *stats = &ndev->stats; > > > + u32 sts; > > > + unsigned long flags; > > > + u32 ch = priv->channel; > > > + > > > + do { > > > > You should iterare over all pending CAN frames: > > > > > for (/* nix */; (priv->tx_head - priv->tx_tail) > 0; priv- > > >tx_tail++) { > > > > Yes, current code does iterate over pending tx'ed frames. If we use this > for loop semantics, we may have to protect the whole loop with no real > benefit. > > > > > > + u8 unsent, sent; > > > + > > > + sent = priv->tx_tail % RCANFD_FIFO_DEPTH; > > > > and check here, if that packet has really been tramsitted. Exit the > > loop otherweise. > > We are here because of tx_done and hence no need to check tx done again > for the first iteration. Hence the do-while loop. Checks further down > takes care of the need for more iterations/pending tx'ed frames. > > > > > > + stats->tx_packets++; > > > + stats->tx_bytes += priv->tx_len[sent]; > > > + priv->tx_len[sent] = 0; > > > + can_get_echo_skb(ndev, sent); > > > + > > > + spin_lock_irqsave(&priv->tx_lock, flags); > > > > What does the tx_lock protect? The tx path is per channel, isn't it? > > You are right - tx path is per channel. In tx path, head & tail are also > used to determine when to stop/wakeup netif queue. They are incremented & > compared in different contexts to make this decision. Per channel tx_lock > protects this critical section. > > > > > > + priv->tx_tail++; > > > + sts = rcar_canfd_read(priv, > > > + RCANFD_CFSTS(ch, RCANFD_CFFIFO_IDX)); > > > + unsent = RCANFD_CMFIFO_CFMC(sts); > > > + > > > + /* Wake producer only when there is room */ > > > + if (unsent != RCANFD_FIFO_DEPTH) > > > + netif_wake_queue(ndev); > > > > Move the netif_wake_queue() out of the loop. > > With the tx logic mentioned above, I think keeping it in the loop seems > better. For e.g. say cpu1 pumps 8 frames from an app loop and cpu0 handles > tx_done interrupt handling: cpu1 would have stopped the queue because FIFO > is full -> cpu0 gets tx_done interrupt and by the time it checks "unsent" > there could be one or more frames transmitted by device (i.e.) there would > be more space in fifo. It is better to wakeup the netif queue then and > there so that app running on cpu1 can pump more. If we move it out of the > loop we wake up the queue only in the end. Have I missed anything? > > > > > > + > > > + if (priv->tx_head - priv->tx_tail <= unsent) { > > > + spin_unlock_irqrestore(&priv->tx_lock, flags); > > > + break; > > > + } > > > + spin_unlock_irqrestore(&priv->tx_lock, flags); > > > + > > > + } while (1); > > > + > > > + /* Clear interrupt */ > > > + rcar_canfd_write(priv, RCANFD_CFSTS(ch, RCANFD_CFFIFO_IDX), > > > + sts & ~RCANFD_CMFIFO_CFTXIF); > > > + can_led_event(ndev, CAN_LED_EVENT_TX); } > > > + > > > + if (cf->can_id & CAN_RTR_FLAG) > > > + id |= RCANFD_CMFIFO_CFRTR; > > > + > > > + rcar_canfd_write(priv, RCANFD_F_CFID(ch, RCANFD_CFFIFO_IDX), > > > + id); > > > + ptr = RCANFD_CMFIFO_CFDLC(can_len2dlc(cf->len)); > > > > ptr usually means pointer, better call it dlc. > > I used the register name "ptr". OK, will change it do dlc. > > > > > > + rcar_canfd_write(priv, RCANFD_F_CFPTR(ch, RCANFD_CFFIFO_IDX), > > > + ptr); > > > + > (...) > > > + can_put_echo_skb(skb, ndev, priv->tx_head % RCANFD_FIFO_DEPTH); > > > + > > > + spin_lock_irqsave(&priv->tx_lock, flags); > > > + priv->tx_head++; > > > + > > > + /* Start Tx: Write 0xff to CFPC to increment the CPU-side > > > + * pointer for the Common FIFO > > > + */ > > > + rcar_canfd_write(priv, RCANFD_CFPCTR(ch, RCANFD_CFFIFO_IDX), > > > +0xff); > > > + > > > + /* Stop the queue if we've filled all FIFO entries */ > > > + if (priv->tx_head - priv->tx_tail >= RCANFD_FIFO_DEPTH) > > > + netif_stop_queue(ndev); > > > > Please move the check of stop_queue, before starting the send. > > OK. > > > > > > + > > > + spin_unlock_irqrestore(&priv->tx_lock, flags); > > > + return NETDEV_TX_OK; > > > +} > > > + > (...) > > > +{ > > > + struct rcar_canfd_channel *priv = > > > + container_of(napi, struct rcar_canfd_channel, napi); > > > + int num_pkts; > > > + u32 sts; > > > + u32 ch = priv->channel; > > > + u32 ridx = ch + RCANFD_RFFIFO_IDX; > > > + > > > + for (num_pkts = 0; num_pkts < quota; num_pkts++) { > > > + sts = rcar_canfd_read(priv, RCANFD_RFSTS(ridx)); > > > + /* Clear interrupt bit */ > > > + if (sts & RCANFD_RFFIFO_RFIF) > > > + rcar_canfd_write(priv, RCANFD_RFSTS(ridx), > > > + sts & ~RCANFD_RFFIFO_RFIF); > > > + > > > + /* Check FIFO empty condition */ > > > + if (sts & RCANFD_RFFIFO_RFEMP) > > > + break; > > > + > > > + rcar_canfd_rx_pkt(priv); > > > > This sequence looks strange. You first conditionally ack the interrupt > > then you check for empty fifo, then read the CAN frame. I would assume > > that you first check if there's a CAN frame, read it and then clear > > the interrupt. > > Yes, I shall re-arrange the sequence as you mentioned. > > > > > > + } > > > + > > > + /* All packets processed */ > > > + if (num_pkts < quota) { > > > + napi_complete(napi); > > > + /* Enable Rx FIFO interrupts */ > > > + rcar_canfd_set_bit(priv, RCANFD_RFCC(ridx), > > RCANFD_RFFIFO_RFIE); > (...) > > > + for_each_set_bit(ch, &gpriv->channels_mask, RCANFD_NUM_CHANNELS) { > > > + err = rcar_canfd_channel_probe(gpriv, ch); > > > + if (err) > > > + goto fail_channel; > > > + } > > > > Should the CAN IP core be clocked the whole time? What about shuting > > down the clock and enabling it on ifup? > > The fCAN clock is enabled only on ifup of one of the channels. However, > the peripheral clock is enabled during probe to bring the controller to > Global Operational mode. This clock cannot be turned off with the register > values & mode retained. > > > > + > > > + platform_set_drvdata(pdev, gpriv); > > > + dev_info(&pdev->dev, "global operational state (clk %d)\n", > > > + gpriv->clock_select); > (...) > > > + rcar_canfd_reset_controller(gpriv); > > > + rcar_canfd_disable_global_interrupts(gpriv); > > > + > > > + for_each_set_bit(ch, &gpriv->channels_mask, RCANFD_NUM_CHANNELS) { > > > + priv = gpriv->ch[ch]; > > > + if (priv) { > > > > This should always be true. > > I agree. I thought I cleaned this up but it's in my local commits :-( > > > > > > + rcar_canfd_disable_channel_interrupts(priv); > > > + unregister_candev(priv->ndev); > > > + netif_napi_del(&priv->napi); > > > + free_candev(priv->ndev); > > > > Please make use of rcar_canfd_channel_remove(), as you already have > > the function. > > Yes. > > Thanks, > Ramesh