On Tue, Sep 26, 2023 at 03:43:18PM +0300, Ivaylo Dimitrov wrote: > On 26.09.23 г. 10:16 ч., Sean Young wrote: > > On Mon, Sep 25, 2023 at 07:06:44PM +0300, Ivaylo Dimitrov wrote: > > > On 1.09.23 г. 17:18 ч., Sean Young wrote: > > > > The ir-rx51 is a pwm-based TX driver specific to the N900. This can be > > > > handled entirely by the generic pwm-ir-tx driver, and in fact the > > > > pwm-ir-tx driver has been compatible with ir-rx51 from the start. > > > > > > > > > > Unfortunately, pwm-ir-tx does not work on n900. My investigation shows that > > > for some reason usleep_range() sleeps for at least 300-400 us more than what > > > interval it is requested to sleep. I played with cyclictest from rt-tests > > > package and it gives similar results - increasing the priority helps, but I > > > was not able to make it sleep for less that 300 us in average. I tried > > > cpu_latency_qos_add_request() in pwm-ir-tx, but it made no difference. > > > > > > I get similar results on motorola droid4 (OMAP4), albeit there average sleep > > > is in 200-300 us range, which makes me believe that either OMAPs have issues > > > with hrtimers or the config we use has some issue which leads to scheduler > > > latency. Or, something else... > > > > The pwm-ir-tx driver does suffer from this problem, but I was under the > > impression that the ir-rx51 has the same problem. > > > > Could you elaborate on the "pwm-ir-tx driver does suffer from this problem"? > Where do you see that? So on a raspberry pi (model 3b), if I use the pwm-ir-tx driver, I get random delays of up to 100us. It's a bit random and certainly depends on the load. I'm measuring using a logic analyzer. There have been reports by others on different machines with random delays and/or transmit failures (as in the receiver occassionally fails to decode the IR). I usually suggest they use the gpio-ir-tx driver, which does work as far as I know (the signal looks perfect with a logic analyzer). So far I've taken the view that the driver works ok for most situations, since IR is usually fine with upto 100us missing here or there. The gpio-ir-tx driver works much better because it does the entire send under spinlock - obviously that has its own problems, because an IR transmit can be 10s or even 100s of milliseconds. I've never known of a solution to the pwm-ir-tx driver. If using hrtimers directly improves the situation even a bit, then that would be great. > ir-rx51 does not suffer from the same problem (albeit it has its own one, > see bellow) > > > > In either case help is appreciated to dig further trying to find the reason > > > for such a big delay. > > > > pwm-ir-tx uses usleep_range() and ir-rx51 uses hrtimers. I thought that > > usleep_range() uses hrtimers; however if you're not seeing the same delay > > on ir-rx51 then maybe it's time to switch pwm-ir-tx to hrtimers. > > > > usleep_range() is backed by hrtimers already, however the difference comes > from how hrtimer is used in ir-rx51: it uses timer callback function that > gets called in softirq context, while usleep_range() puts the task in > TASK_UNINTERRUPTIBLE state and then calls schedule_hrtimeout_range(). For > some reason it takes at least 200-400 us (on average) even on OMAP4 to > switch back to TASK_RUNNING state. > > The issue with ir-rx51 and the way it uses hrtimers is that it calls > pwm_apply_state() from hrtimer function, which is not ok, per the comment > here > https://elixir.bootlin.com/linux/v6.6-rc3/source/drivers/pwm/core.c#L502 > > I can make pwm-ir-tx switch to hrtimers, that's not an issue, but I am > afraid that there is some general scheduler or timers (or something else) > issue that manifests itself with usleep_range() misbehaving. If we can switch pwm-ir-tx to hrtimers, that would be great. The ir-rx51 removal patches have already been queued to media_staging; we may have to remove them from there if we can't solve this problem. > > I don't have a n900 to test on, unfortunately. > > > > I have and once I have an idea what's going on will port pwm-ir-tx to > hrtimers, if needed. Don't want to do it now as I am afraid the completion I > will have to use will have the same latency problems as usleep_range() That would be fantastic. Please do keep us up to date with how you are getting on. Like I said, it would be nice to this resolved before the next merge window. Thanks, Sean