On Thu, May 28, 2020 at 9:19 PM Dafna Hirschfeld <dafna.hirschfeld@xxxxxxxxxxxxx> wrote: > > Hi, Tomasz, Laurent, and everyone > > On 21.05.20 12:38, Tomasz Figa wrote: > > Hi Dafna, > > > > On Thu, May 21, 2020 at 2:09 AM Laurent Pinchart > > <laurent.pinchart@xxxxxxxxxxxxxxxx> wrote: > >> > >> Hi Dafna, > >> > >> Thank you for the patch. > >> > >> On Tue, May 12, 2020 at 02:05:22PM +0200, Dafna Hirschfeld wrote: > >>> Reading the statistics registers might take too long > >>> to run inside the irq handler. Currently it is deferred > >>> to bottom half using workqueues. This patch replaces the > >>> workqueue with threaded interrupts for reading the > >>> statistics registers. > >> > >> Could you please explain why this is needed/desired ? Removal of the > >> kzalloc(GFP_ATOMIC) is very nice, but I'm sure there would have been > >> ways to avoid it while keeping a workqueue. I'm not claiming the patch > >> is bad, but I'd like to understand the reason. > >> > > > > Thanks a lot for working on this driver! > > > > I'll second what Laurent said. In general, a description of the patch > > should explain why a change is needed, e.g. what issues it fixes or > > what improvements it brings. > > So from what I understand, the correct way to handle bottom half is > using threaded interrupts since they run in higher priority. > In case of this statistics reading for example, we want to read the stats > as fast as possible once we recieve the interrupt. If we read the stats too > long after the interrupt was recieved, then the values in the stats registerd > might have changed by then. Does that make sense? > > I use the rockpi4 board to test the code and actually I did an experiment > of moving the code that reads the stats into the hard irq and it ran > fine. Maybe you know why the code is currently in a work queue and not > inside the hard irq? Also, the params struct is of size 5337 bytes and > the params subdevice can potentionaly run a lot of registers writing but > for some reasone the code of writing to the params registers still runs > inside the hard irq. > Actually I now recall that we already discussed this in the past with Ezequiel and Helen and I even measured the time needed to readout the statistics registers and it was consistently taking between 23 to 40 microseconds. We concluded that it could be just done in the hardirq handler. I've added you to that email thread. > > > > Also, would you mind adding me on CC for any patches for this driver? > > Since we use this driver in Chrome OS, I'd like to stay on top of any > > updates. Thanks in advance! > sure! > Thanks! [snip] > >>> -static void > >>> -rkisp1_stats_send_measurement(struct rkisp1_stats *stats, > >>> - struct rkisp1_isp_readout_work *meas_work) > >>> +irqreturn_t rkisp1_read_stats_threaded_irq(int irq, void *ctx) > >>> { > >>> + struct device *dev = ctx; > >>> + struct rkisp1_device *rkisp1 = dev_get_drvdata(dev); > >>> + struct rkisp1_stats *stats = &rkisp1->stats; > >>> + struct rkisp1_kstats_buffer *kstats_buf = NULL; > >>> struct rkisp1_stat_buffer *cur_stat_buf; > >>> - struct rkisp1_buffer *cur_buf = NULL; > >>> - unsigned int frame_sequence = > >>> - atomic_read(&stats->rkisp1->isp.frame_sequence); > >>> - u64 timestamp = ktime_get_ns(); > >>> unsigned long flags; > >>> - > >>> - if (frame_sequence != meas_work->frame_id) { > >>> - dev_warn(stats->rkisp1->dev, > >>> - "Measurement late(%d, %d)\n", > >>> - frame_sequence, meas_work->frame_id); > >>> - frame_sequence = meas_work->frame_id; > >>> - } > >>> + u64 timestamp = ktime_get_ns(); > >>> > >>> spin_lock_irqsave(&stats->stats_lock, flags); > >>> - /* get one empty buffer */ > >>> - if (!list_empty(&stats->stat)) { > >>> - cur_buf = list_first_entry(&stats->stat, > >>> - struct rkisp1_buffer, queue); > >>> - list_del(&cur_buf->queue); > >>> + if (!stats->is_streaming) { > >>> + spin_unlock_irqrestore(&stats->stats_lock, flags); > >>> + return IRQ_HANDLED; > >>> + } > >>> + if (list_empty(&stats->stat)) { > >>> + spin_unlock_irqrestore(&stats->stats_lock, flags); > >>> + WARN("%s: threaded irq waked but there are no buffers", > >>> + __func__); > >>> + return IRQ_HANDLED; > >>> + } > >>> + kstats_buf = list_first_entry(&stats->stat, > >>> + struct rkisp1_kstats_buffer, buff.queue); > >>> + > >>> + /* > >>> + * each waked irq thread reads exactly one ready statistics > >>> + * so it is a bug if no statistics are ready > >>> + */ > >> > >> What happens if this function takes too long to run ? With the > >> workqueue, if the work gets delayed once in a while, the work items will > >> accumulate, and all of them will be handled eventually. Here it seems > >> that an IRQ could get lost. > > If the irs runs and there are no available buffers then indeed we lost the interrupt. > You think it might be a problem if userspace expect an associated stats buffer > for every recieved frame? > Also with workqueue we will lose the stats if we don't have an available buffer > when the workqueue runs. I believe this hardware doesn't do any buffering internally, so if one doesn't read the statistics in time, the values are lost, because they are overridden with the ones coming from the next frame. So actually hardirq might be a better context to do the read-out indeed. Best regards, Tomasz _______________________________________________ Linux-rockchip mailing list Linux-rockchip@xxxxxxxxxxxxxxxxxxx http://lists.infradead.org/mailman/listinfo/linux-rockchip