Hi Oliver, Thanks :-) Actually it will be v5 patch set & I have sent it now. Marc comments were on old v2 patch. Thanks, Ramesh > -----Original Message----- > From: Oliver Hartkopp [mailto:socketcan@xxxxxxxxxxxx] > Sent: 28 April 2016 07:28 > To: Ramesh Shanmugasundaram <ramesh.shanmugasundaram@xxxxxxxxxxxxxx>; > wg@xxxxxxxxxxxxxx; robh+dt@xxxxxxxxxx; pawel.moll@xxxxxxx; > mark.rutland@xxxxxxx; ijc+devicetree@xxxxxxxxxxxxxx; galak@xxxxxxxxxxxxxx; > corbet@xxxxxxx; mkl@xxxxxxxxxxxxxx > Cc: linux-renesas-soc@xxxxxxxxxxxxxxx; devicetree@xxxxxxxxxxxxxxx; linux- > can@xxxxxxxxxxxxxxx; netdev@xxxxxxxxxxxxxxx; linux-doc@xxxxxxxxxxxxxxx; > geert+renesas@xxxxxxxxx; Chris Paterson <Chris.Paterson2@xxxxxxxxxxx> > Subject: Re: [PATCH v2] can: rcar_canfd: Add Renesas R-Car CAN FD driver > > Hello Ramesh, > > please send out a new v3 patchset to trigger the process again :-) > > Best regards, > Oliver > > On 04/13/2016 08:25 AM, Ramesh Shanmugasundaram wrote: > > HI Marc, > > > > Gentle reminder! > > Are you happy with the open comment's disposition? I can send a next > version of patch if we have a closure on current set of comments. > > > > Thanks, > > Ramesh > > > >> -----Original Message----- > >> From: Ramesh Shanmugasundaram > >> Sent: 01 April 2016 13:49 > >> To: Ramesh Shanmugasundaram <ramesh.shanmugasundaram@xxxxxxxxxxxxxx>; > >> wg@xxxxxxxxxxxxxx; robh+dt@xxxxxxxxxx; pawel.moll@xxxxxxx; > >> mark.rutland@xxxxxxx; ijc+devicetree@xxxxxxxxxxxxxx; > >> galak@xxxxxxxxxxxxxx; corbet@xxxxxxx > >> Cc: linux-renesas-soc@xxxxxxxxxxxxxxx; devicetree@xxxxxxxxxxxxxxx; > >> linux- can@xxxxxxxxxxxxxxx; netdev@xxxxxxxxxxxxxxx; > >> linux-doc@xxxxxxxxxxxxxxx; > >> geert+renesas@xxxxxxxxx; Chris Paterson <Chris.Paterson2@xxxxxxxxxxx> > >> Subject: RE: [PATCH v2] can: rcar_canfd: Add Renesas R-Car CAN FD > >> driver > >> > >> Hi Marc, > >> > >> Thanks for your time & review comments. > >> > >>> -----Original Message----- > >> (...) > >>>> +#define RCANFD_NAPI_WEIGHT 8 /* Rx poll quota */ > >>>> + > >>>> +#define RCANFD_NUM_CHANNELS 2 > >>>> +#define RCANFD_CHANNELS_MASK 0x3 /* Two channels max */ > >>> > >>> (BIT(RCANFD_NUM_CHANNELS) - 1 > >> > >> OK > >> > >>> > >>>> + > >>>> +/* Rx FIFO is a global resource of the controller. There are 8 > >>>> +such > >>> FIFOs > >>>> + * available. Each channel gets a dedicated Rx FIFO (i.e.) the > >>>> + channel > >> (...) > >>>> +#define RCANFD_CMFIFO_CFDLC(x) (((x) & 0xf) << 28) > >>>> +#define RCANFD_CMFIFO_CFPTR(x) (((x) & 0xfff) << 16) > >>>> +#define RCANFD_CMFIFO_CFTS(x) (((x) & 0xff) << 0) > >>>> + > >>>> +/* Global Test Config register */ > >>>> +#define RCANFD_GTSTCFG_C0CBCE BIT(0) > >>>> +#define RCANFD_GTSTCFG_C1CBCE BIT(1) > >>>> + > >>>> +#define RCANFD_GTSTCTR_ICBCTME BIT(0) > >>>> + > >>>> +/* AFL Rx rules registers */ > >>>> +#define RCANFD_AFLCFG_SETRNC0(x) (((x) & 0xff) << 24) > >>>> +#define RCANFD_AFLCFG_SETRNC1(x) (((x) & 0xff) << 16) > >>> > >>> What about something like: > >>> > >>> #define RCANFD_AFLCFG_SETRNC(n, x) (((x) & 0xff) << (24 - n * 8)) > >>> > >>> This will save some if()s in the code > >> > >> Nice :-). Will update. > >> > >>> > >>>> +#define RCANFD_AFLCFG_GETRNC0(x) (((x) >> 24) & 0xff) > >>>> +#define RCANFD_AFLCFG_GETRNC1(x) (((x) >> 16) & 0xff) > >>>> + > >>>> +#define RCANFD_AFL_PAGENUM(entry) ((entry) / 16) > >> (...) > >>>> +#define rcar_canfd_read(priv, offset) \ > >>>> + readl(priv->base + (offset)) > >>>> +#define rcar_canfd_write(priv, offset, val) \ > >>>> + writel(val, priv->base + (offset)) > >>>> +#define rcar_canfd_set_bit(priv, reg, val) \ > >>>> + rcar_canfd_update(val, val, priv->base + (reg)) > >>>> +#define rcar_canfd_clear_bit(priv, reg, val) \ > >>>> + rcar_canfd_update(val, 0, priv->base + (reg)) > >>>> +#define rcar_canfd_update_bit(priv, reg, mask, val) \ > >>>> + rcar_canfd_update(mask, val, priv->base + (reg)) > >>> > >>> please use static inline functions instead of defines. > >> > >> OK. > >> > >>> > >>>> + > >>>> +static void rcar_canfd_get_data(struct canfd_frame *cf, > >>>> + struct rcar_canfd_channel *priv, u32 off) > >>> > >>> Please use "struct rcar_canfd_channel *priv" as first argument, > >>> struct canfd_frame *cf as second. Remove off, as the offset is > >>> already defined by the channel. > >> > >> I'll re-order priv, cf as you mentioned. I'll leave "off" arg as it > >> is because it is based on FIFO number of channel + mode (CAN only or > >> CANFD only). Otherwise, I will have to add another check inside this > >> function for mode. > >> > >>>> +{ > >>>> + u32 i, lwords; > >>>> + > >>>> + lwords = cf->len / sizeof(u32); > >>>> + if (cf->len % sizeof(u32)) > >>>> + lwords++; > >>> > >>> Use DIV_ROUND_UP() instead of open coding it. > >> > >> Agreed. Thanks. > >> > >>> > >>>> + for (i = 0; i < lwords; i++) > >>>> + *((u32 *)cf->data + i) = > >>>> + rcar_canfd_read(priv, off + (i * sizeof(u32))); } > >>>> + > >>>> +static void rcar_canfd_put_data(struct canfd_frame *cf, > >>>> + struct rcar_canfd_channel *priv, u32 off) > >>> > >>> same here > >> > >> Yes (same as _get_data) > >> > >>> > >>>> +{ > >>>> + u32 i, j, lwords, leftover; > >>>> + u32 data = 0; > >>>> + > >>>> + lwords = cf->len / sizeof(u32); > >>>> + leftover = cf->len % sizeof(u32); > >>>> + for (i = 0; i < lwords; i++) > >>>> + rcar_canfd_write(priv, off + (i * sizeof(u32)), > >>>> + *((u32 *)cf->data + i)); > >>> > >>> Here you don't convert the endianess... > >> > >> Yes > >> > >>>> + > >>>> + if (leftover) { > >>>> + u8 *p = (u8 *)((u32 *)cf->data + i); > >>>> + > >>>> + for (j = 0; j < leftover; j++) > >>>> + data |= p[j] << (j * 8); > >>> > >>> ...here you do an implicit endianess conversion. "data" is little > >>> endian, while p[j] is big endian. > >> > >> Not sure I got the question correctly. > >> Controller expectation of data bytes in 32bit register is bits[7:0] = > >> byte0, bits[15:8] = byte1 and so on - little endian. > >> For e.g. if cf->data points to byte stream H'112233445566 > >> (cf->data[0] = 0x11), first rcar_canfd_write will write 0x44332211 > >> value to register. Yes the host cpu is assumed little endian. In > >> leftover case, data will be > >> 0x00006655 - again little endian. > >> I think I should remove this leftover logic and just mask the unused > >> bits to zero as cf->data is pre-allocated for max length anyway. > >> > >> p[j] is a byte?? Am I missing something? > >> > >>> > >>>> + rcar_canfd_write(priv, off + (i * sizeof(u32)), data); > >>>> + } > >>> > >>> Have you tested to send CAN frames with len != n*4 against a > >>> different controller? > >> > >> Yes, with Vector VN1630A. > >> > >>>> +} > >>>> + > >>>> +static void rcar_canfd_tx_failure_cleanup(struct net_device *ndev) > >>>> +{ > >>>> + u32 i; > >>>> + > >>>> + for (i = 0; i < RCANFD_FIFO_DEPTH; i++) > >>>> + can_free_echo_skb(ndev, i); > >>>> +} > >>>> + > >> (...) > >>>> +static void rcar_canfd_tx_done(struct net_device *ndev) { > >>>> + struct rcar_canfd_channel *priv = netdev_priv(ndev); > >>>> + struct net_device_stats *stats = &ndev->stats; > >>>> + u32 sts; > >>>> + unsigned long flags; > >>>> + u32 ch = priv->channel; > >>>> + > >>>> + do { > >>> > >>> You should iterare over all pending CAN frames: > >>> > >>>> for (/* nix */; (priv->tx_head - priv->tx_tail) > 0; priv- > >>>> tx_tail++) { > >>> > >> > >> Yes, current code does iterate over pending tx'ed frames. If we use > >> this for loop semantics, we may have to protect the whole loop with > >> no real benefit. > >> > >>> > >>>> + u8 unsent, sent; > >>>> + > >>>> + sent = priv->tx_tail % RCANFD_FIFO_DEPTH; > >>> > >>> and check here, if that packet has really been tramsitted. Exit the > >>> loop otherweise. > >> > >> We are here because of tx_done and hence no need to check tx done > >> again for the first iteration. Hence the do-while loop. Checks > >> further down takes care of the need for more iterations/pending tx'ed > frames. > >> > >>> > >>>> + stats->tx_packets++; > >>>> + stats->tx_bytes += priv->tx_len[sent]; > >>>> + priv->tx_len[sent] = 0; > >>>> + can_get_echo_skb(ndev, sent); > >>>> + > >>>> + spin_lock_irqsave(&priv->tx_lock, flags); > >>> > >>> What does the tx_lock protect? The tx path is per channel, isn't it? > >> > >> You are right - tx path is per channel. In tx path, head & tail are > >> also used to determine when to stop/wakeup netif queue. They are > >> incremented & compared in different contexts to make this decision. > >> Per channel tx_lock protects this critical section. > >> > >>> > >>>> + priv->tx_tail++; > >>>> + sts = rcar_canfd_read(priv, > >>>> + RCANFD_CFSTS(ch, RCANFD_CFFIFO_IDX)); > >>>> + unsent = RCANFD_CMFIFO_CFMC(sts); > >>>> + > >>>> + /* Wake producer only when there is room */ > >>>> + if (unsent != RCANFD_FIFO_DEPTH) > >>>> + netif_wake_queue(ndev); > >>> > >>> Move the netif_wake_queue() out of the loop. > >> > >> With the tx logic mentioned above, I think keeping it in the loop > >> seems better. For e.g. say cpu1 pumps 8 frames from an app loop and > >> cpu0 handles tx_done interrupt handling: cpu1 would have stopped the > >> queue because FIFO is full -> cpu0 gets tx_done interrupt and by the > time it checks "unsent" > >> there could be one or more frames transmitted by device (i.e.) there > >> would be more space in fifo. It is better to wakeup the netif queue > >> then and there so that app running on cpu1 can pump more. If we move > >> it out of the loop we wake up the queue only in the end. Have I missed > anything? > >> > >>> > >>>> + > >>>> + if (priv->tx_head - priv->tx_tail <= unsent) { > >>>> + spin_unlock_irqrestore(&priv->tx_lock, flags); > >>>> + break; > >>>> + } > >>>> + spin_unlock_irqrestore(&priv->tx_lock, flags); > >>>> + > >>>> + } while (1); > >>>> + > >>>> + /* Clear interrupt */ > >>>> + rcar_canfd_write(priv, RCANFD_CFSTS(ch, RCANFD_CFFIFO_IDX), > >>>> + sts & ~RCANFD_CMFIFO_CFTXIF); > >>>> + can_led_event(ndev, CAN_LED_EVENT_TX); } > >>>> + > >>>> + if (cf->can_id & CAN_RTR_FLAG) > >>>> + id |= RCANFD_CMFIFO_CFRTR; > >>>> + > >>>> + rcar_canfd_write(priv, RCANFD_F_CFID(ch, RCANFD_CFFIFO_IDX), > >>>> + id); > >>>> + ptr = RCANFD_CMFIFO_CFDLC(can_len2dlc(cf->len)); > >>> > >>> ptr usually means pointer, better call it dlc. > >> > >> I used the register name "ptr". OK, will change it do dlc. > >> > >>> > >>>> + rcar_canfd_write(priv, RCANFD_F_CFPTR(ch, RCANFD_CFFIFO_IDX), > >>>> + ptr); > >>>> + > >> (...) > >>>> + can_put_echo_skb(skb, ndev, priv->tx_head % > RCANFD_FIFO_DEPTH); > >>>> + > >>>> + spin_lock_irqsave(&priv->tx_lock, flags); > >>>> + priv->tx_head++; > >>>> + > >>>> + /* Start Tx: Write 0xff to CFPC to increment the CPU-side > >>>> + * pointer for the Common FIFO > >>>> + */ > >>>> + rcar_canfd_write(priv, RCANFD_CFPCTR(ch, RCANFD_CFFIFO_IDX), > >>>> +0xff); > >>>> + > >>>> + /* Stop the queue if we've filled all FIFO entries */ > >>>> + if (priv->tx_head - priv->tx_tail >= RCANFD_FIFO_DEPTH) > >>>> + netif_stop_queue(ndev); > >>> > >>> Please move the check of stop_queue, before starting the send. > >> > >> OK. > >> > >>> > >>>> + > >>>> + spin_unlock_irqrestore(&priv->tx_lock, flags); > >>>> + return NETDEV_TX_OK; > >>>> +} > >>>> + > >> (...) > >>>> +{ > >>>> + struct rcar_canfd_channel *priv = > >>>> + container_of(napi, struct rcar_canfd_channel, napi); > >>>> + int num_pkts; > >>>> + u32 sts; > >>>> + u32 ch = priv->channel; > >>>> + u32 ridx = ch + RCANFD_RFFIFO_IDX; > >>>> + > >>>> + for (num_pkts = 0; num_pkts < quota; num_pkts++) { > >>>> + sts = rcar_canfd_read(priv, RCANFD_RFSTS(ridx)); > >>>> + /* Clear interrupt bit */ > >>>> + if (sts & RCANFD_RFFIFO_RFIF) > >>>> + rcar_canfd_write(priv, RCANFD_RFSTS(ridx), > >>>> + sts & ~RCANFD_RFFIFO_RFIF); > >>>> + > >>>> + /* Check FIFO empty condition */ > >>>> + if (sts & RCANFD_RFFIFO_RFEMP) > >>>> + break; > >>>> + > >>>> + rcar_canfd_rx_pkt(priv); > >>> > >>> This sequence looks strange. You first conditionally ack the > >>> interrupt then you check for empty fifo, then read the CAN frame. I > >>> would assume that you first check if there's a CAN frame, read it > >>> and then clear the interrupt. > >> > >> Yes, I shall re-arrange the sequence as you mentioned. > >> > >>> > >>>> + } > >>>> + > >>>> + /* All packets processed */ > >>>> + if (num_pkts < quota) { > >>>> + napi_complete(napi); > >>>> + /* Enable Rx FIFO interrupts */ > >>>> + rcar_canfd_set_bit(priv, RCANFD_RFCC(ridx), > >>> RCANFD_RFFIFO_RFIE); > >> (...) > >>>> + for_each_set_bit(ch, &gpriv->channels_mask, > RCANFD_NUM_CHANNELS) { > >>>> + err = rcar_canfd_channel_probe(gpriv, ch); > >>>> + if (err) > >>>> + goto fail_channel; > >>>> + } > >>> > >>> Should the CAN IP core be clocked the whole time? What about shuting > >>> down the clock and enabling it on ifup? > >> > >> The fCAN clock is enabled only on ifup of one of the channels. > >> However, the peripheral clock is enabled during probe to bring the > >> controller to Global Operational mode. This clock cannot be turned > >> off with the register values & mode retained. > >> > >>>> + > >>>> + platform_set_drvdata(pdev, gpriv); > >>>> + dev_info(&pdev->dev, "global operational state (clk %d)\n", > >>>> + gpriv->clock_select); > >> (...) > >>>> + rcar_canfd_reset_controller(gpriv); > >>>> + rcar_canfd_disable_global_interrupts(gpriv); > >>>> + > >>>> + for_each_set_bit(ch, &gpriv->channels_mask, > RCANFD_NUM_CHANNELS) { > >>>> + priv = gpriv->ch[ch]; > >>>> + if (priv) { > >>> > >>> This should always be true. > >> > >> I agree. I thought I cleaned this up but it's in my local commits :-( > >> > >>> > >>>> + rcar_canfd_disable_channel_interrupts(priv); > >>>> + unregister_candev(priv->ndev); > >>>> + netif_napi_del(&priv->napi); > >>>> + free_candev(priv->ndev); > >>> > >>> Please make use of rcar_canfd_channel_remove(), as you already have > >>> the function. > >> > >> Yes. > >> > >> Thanks, > >> Ramesh