Hi, sorry for the delay. Am Freitag, 10. März 2023, 14:08:15 CEST schrieb A. Sverdlin: > From: Alexander Sverdlin <alexander.sverdlin@xxxxxxxxxxx> > > One of the reasons to do it is to save some CPU cycles on cpu_freq_get() > under mutex. The second reason if the (false-positive) lockdep splat caused > by the recursive feature of the "prepare_lock" (one clock instance is I2C > peripheral clock and another is pcf85063 RTC as clock provider): > > ====================================================== > WARNING: possible circular locking dependency detected > 5.15.71+... #1 Tainted: G O > ------------------------------------------------------ > fs-value/2332 is trying to acquire lock: > ffff8000096cae08 (prepare_lock){+.+.}-{3:3}, at: clk_prepare_lock > > but task is already holding lock: > ffff000011021100 (i2c_register_adapter){+.+.}-{3:3}, at: > i2c_adapter_lock_bus > > which lock already depends on the new lock. > > the existing dependency chain (in reverse order) is: > > -> #2 (i2c_register_adapter){+.+.}-{3:3}: > lock_acquire > rt_mutex_lock_nested > i2c_adapter_lock_bus > i2c_transfer > regmap_i2c_read > _regmap_raw_read > _regmap_bus_read > _regmap_read > regmap_read > pcf85063_probe > i2c_device_probe > really_probe > __driver_probe_device > driver_probe_device > __device_attach_driver > bus_for_each_drv > __device_attach > device_initial_probe > bus_probe_device > device_add > device_register > i2c_new_client_device > of_i2c_register_devices > i2c_register_adapter > __i2c_add_numbered_adapter > i2c_add_adapter > lpi2c_imx_probe > platform_probe > really_probe > __driver_probe_device > driver_probe_device > __device_attach_driver > bus_for_each_drv > __device_attach > device_initial_probe > bus_probe_device > deferred_probe_work_func > process_one_work > worker_thread > kthread > ret_from_fork > > -> #1 (rtc_pcf85063:560:(&config->regmap)->lock){+.+.}-{3:3}: > lock_acquire > __mutex_lock > mutex_lock_nested > regmap_lock_mutex > regmap_read > pcf85063_clkout_recalc_rate > __clk_register > devm_clk_register > pcf85063_probe > i2c_device_probe > really_probe > __driver_probe_device > driver_probe_device > __device_attach_driver > bus_for_each_drv > __device_attach > device_initial_probe > bus_probe_device > device_add > device_register > i2c_new_client_device > of_i2c_register_devices > i2c_register_adapter > __i2c_add_numbered_adapter > i2c_add_adapter > lpi2c_imx_probe > platform_probe > really_probe > __driver_probe_device > driver_probe_device > __device_attach_driver > bus_for_each_drv > __device_attach > device_initial_probe > bus_probe_device > deferred_probe_work_func > process_one_work > worker_thread > kthread > ret_from_fork > > -> #0 (prepare_lock){+.+.}-{3:3}: > __lock_acquire > lock_acquire.part.0 > lock_acquire > __mutex_lock > mutex_lock_nested > clk_prepare_lock > clk_get_rate > lpi2c_imx_xfer > __i2c_transfer > i2c_transfer > regmap_i2c_read > _regmap_raw_read > regmap_raw_read > regmap_bulk_read > at24_read > nvmem_reg_read > bin_attr_nvmem_read > sysfs_kf_bin_read > kernfs_fop_read_iter > new_sync_read > vfs_read > ksys_read > __arm64_sys_read > invoke_syscall > ... > > other info that might help us debug this: > > Chain exists of: > prepare_lock --> rtc_pcf85063:560:(&config->regmap)->lock --> > i2c_register_adapter > > Possible unsafe locking scenario: > > CPU0 CPU1 > ---- ---- > lock(i2c_register_adapter); > > lock(rtc_pcf85063:560:(&config->regmap)->lock); lock(i2c_register_adapter); > lock(prepare_lock); > > *** DEADLOCK *** > > 4 locks held by .../2332: > #0: ffff0000146eb288 (&of->mutex){+.+.}-{3:3}, at: kernfs_fop_read_iter > #1: ffff000010fe4400 (kn->active#72){.+.+}-{0:0}, at: kernfs_fop_read_iter > #2: ffff0000110168e8 (&at24->lock){+.+.}-{3:3}, at: at24_read > #3: ffff000011021100 (i2c_register_adapter){+.+.}-{3:3}, at: > i2c_adapter_lock_bus > > stack backtrace: > CPU: 1 PID: 2332 Comm: ... Tainted: G O 5.15.71+... #1 > Hardware name: ... (DT) > Call trace: > dump_backtrace > show_stack > dump_stack_lvl > dump_stack > print_circular_bug > check_noncircular > __lock_acquire > lock_acquire.part.0 > lock_acquire > __mutex_lock > mutex_lock_nested > clk_prepare_lock > clk_get_rate > lpi2c_imx_xfer > __i2c_transfer > i2c_transfer > regmap_i2c_read > _regmap_raw_read > regmap_raw_read > regmap_bulk_read > at24_read > nvmem_reg_read > bin_attr_nvmem_read > sysfs_kf_bin_read > kernfs_fop_read_iter > new_sync_read > vfs_read > ksys_read > __arm64_sys_read > invoke_syscall > ... > > Fixes: a55fa9d0e42e ("i2c: imx-lpi2c: add low power i2c bus driver") > Signed-off-by: Alexander Sverdlin <alexander.sverdlin@xxxxxxxxxxx> > --- > drivers/i2c/busses/i2c-imx-lpi2c.c | 33 +++++++++++++++++++++++++++--- > 1 file changed, 30 insertions(+), 3 deletions(-) > > Changelog: > v3: fixed build error reported by kernel test robot <lkp@xxxxxxxxx> > > https://lore.kernel.org/oe-kbuild-all/202303102010.pAv56wKs-lkp@xxxxxxxxx/ > v2: added clk_notifier as Alexander suggested > > diff --git a/drivers/i2c/busses/i2c-imx-lpi2c.c > b/drivers/i2c/busses/i2c-imx-lpi2c.c index 188f2a36d2fd6..5f1d1d4e018bd > 100644 > --- a/drivers/i2c/busses/i2c-imx-lpi2c.c > +++ b/drivers/i2c/busses/i2c-imx-lpi2c.c > @@ -100,6 +100,8 @@ struct lpi2c_imx_struct { > __u8 *rx_buf; > __u8 *tx_buf; > struct completion complete; > + struct notifier_block clk_change_nb; > + unsigned int rate_per; > unsigned int msglen; > unsigned int delivered; > unsigned int block_data; > @@ -198,24 +200,37 @@ static void lpi2c_imx_stop(struct lpi2c_imx_struct > *lpi2c_imx) } while (1); > } > > +static int lpi2c_imx_clk_change_cb(struct notifier_block *nb, > + unsigned long action, void *data) > +{ > + struct clk_notifier_data *ndata = data; > + struct lpi2c_imx_struct *lpi2c_imx = container_of(nb, > + struct lpi2c_imx_struct, > + clk_change_nb); > + > + if (action & POST_RATE_CHANGE) > + lpi2c_imx->rate_per = ndata->new_rate; > + > + return NOTIFY_OK; > +} > + > /* CLKLO = I2C_CLK_RATIO * CLKHI, SETHOLD = CLKHI, DATAVD = CLKHI/2 */ > static int lpi2c_imx_config(struct lpi2c_imx_struct *lpi2c_imx) > { > u8 prescale, filt, sethold, clkhi, clklo, datavd; > - unsigned int clk_rate, clk_cycle; > + unsigned int clk_cycle; > enum lpi2c_imx_pincfg pincfg; > unsigned int temp; > > lpi2c_imx_set_mode(lpi2c_imx); > > - clk_rate = clk_get_rate(lpi2c_imx->clks[0].clk); > if (lpi2c_imx->mode == HS || lpi2c_imx->mode == ULTRA_FAST) > filt = 0; > else > filt = 2; > > for (prescale = 0; prescale <= 7; prescale++) { > - clk_cycle = clk_rate / ((1 << prescale) * lpi2c_imx- >bitrate) > + clk_cycle = lpi2c_imx->rate_per / ((1 << prescale) * lpi2c_imx->bitrate) > - 3 - (filt >> 1); > clkhi = (clk_cycle + I2C_CLK_RATIO) / (I2C_CLK_RATIO + 1); > clklo = clk_cycle - clkhi; > @@ -588,6 +603,18 @@ static int lpi2c_imx_probe(struct platform_device > *pdev) if (ret) > return ret; > > + lpi2c_imx->clk_change_nb.notifier_call = lpi2c_imx_clk_change_cb; > + ret = devm_clk_notifier_register(&pdev->dev, lpi2c_imx->clks[0].clk, > + &lpi2c_imx->clk_change_nb); > + if (ret) > + return dev_err_probe(&pdev->dev, ret, > + "can't register peripheral clock notifier\n"); > + lpi2c_imx->rate_per = clk_get_rate(lpi2c_imx->clks[0].clk); > + if (!lpi2c_imx->rate_per) { > + dev_err(&pdev->dev, "can't get I2C peripheral clock rate\n"); > + return -EINVAL; > + } > + I would switch the order of the calls to devm_clk_notifier_register() and clk_get_rate(). AFAICS this looks like a possible lost update. The notifier might change rate_per before the (old) value from clk_get_rate is actually assigned. With this change you fix the following call chain (from a kernel log) [ 4.562396] clk_prepare_lock+0x48/0x9c [ 4.566740] clk_get_rate+0x1c/0x74 [ 4.570736] lpi2c_imx_config+0x4c/0x190 [ 4.575167] lpi2c_imx_master_enable+0x54/0x128 [ 4.580205] lpi2c_imx_xfer+0x2c/0x3c8 Unfortuantely this was just hiding another call path from lpi2c_imx_xfer to clk_bulk_prepare. Here is my kernel log: [ 22.264508] ====================================================== [ 22.270697] WARNING: possible circular locking dependency detected [ 22.276893] 6.3.0-rc7-next-20230420+ #6 Not tainted [ 22.281782] ------------------------------------------------------ [ 22.287968] kworker/2:3/93 is trying to acquire lock: [ 22.293034] ffff800009d38bf8 (prepare_lock){+.+.}-{3:3}, at: clk_prepare_lock+0x48/0x9c [ 22.301095] [ 22.301095] but task is already holding lock: [ 22.306942] ffff0000039ab100 (i2c_register_adapter){+.+.}-{3:3}, at: i2c_adapter_lock_bus+0x20/0x2c [ 22.316046] [ 22.316046] which lock already depends on the new lock. [ 22.316046] [ 22.324238] [ 22.324238] the existing dependency chain (in reverse order) is: [ 22.331727] [ 22.331727] -> #2 (i2c_register_adapter){+.+.}-{3:3}: [ 22.338295] __lock_acquire+0x370/0x6e8 [ 22.342674] lock_acquire.part.0+0xcc/0x208 [ 22.347399] lock_acquire+0x94/0x14c [ 22.351509] rt_mutex_lock_nested+0x5c/0x98 [ 22.356235] i2c_adapter_lock_bus+0x20/0x2c [ 22.360961] i2c_transfer+0x80/0x114 [ 22.365069] regmap_i2c_read+0x5c/0xa0 [ 22.369362] _regmap_raw_read+0x110/0x2dc [ 22.373905] _regmap_bus_read+0x40/0x74 [ 22.378274] _regmap_read+0x74/0x248 [ 22.382393] regmap_read+0x48/0x70 [ 22.386337] pcf85063_probe+0xf0/0x4f4 [ 22.390640] i2c_device_probe+0x100/0x2d4 [ 22.395206] call_driver_probe+0x28/0x15c [ 22.399750] really_probe+0x180/0x320 [ 22.403946] __driver_probe_device+0x84/0x144 [ 22.408837] driver_probe_device+0x38/0x150 [ 22.413554] __device_attach_driver+0xcc/0x194 [ 22.418532] bus_for_each_drv+0x80/0xdc [ 22.422910] __device_attach+0xa8/0x1f8 [ 22.426545] systemd-journald[128]: Oldest entry in /var/log/journal/ e4579cc7db6747028247b9b859e132d6/system.joural is older than the configured file retention duration (1month), suggesting rotation. [ 22.427278] device_initial_probe+0x10/0x18 [ 22.427290] bus_probe_device+0xa4/0xa8 [ 22.427299] device_add+0x3b0/0x508 [ 22.427311] device_register+0x1c/0x28 [ 22.427323] i2c_new_client_device+0x1c8/0x2bc [ 22.427333] of_i2c_register_device+0xb4/0xdc [ 22.427344] of_i2c_register_devices+0x80/0x154 [ 22.456329] systemd-journald[128]: /var/log/journal/ e4579cc7db6747028247b9b859e132d6/system.journal: Journal heaer limits reached or header out-of-date, rotating. [ 22.458176] i2c_register_adapter+0x184/0x4c8 [ 22.458202] __i2c_add_numbered_adapter+0x5c/0xa4 [ 22.502097] i2c_add_adapter+0xa0/0xcc [ 22.502122] lpi2c_imx_probe+0x23c/0x350 [ 22.502135] platform_probe+0x64/0xfc [ 22.502146] call_driver_probe+0x28/0x15c [ 22.502155] really_probe+0x180/0x320 [ 22.502164] __driver_probe_device+0x84/0x144 [ 22.502173] driver_probe_device+0x38/0x150 [ 22.502182] __device_attach_driver+0xcc/0x194 [ 22.502191] bus_for_each_drv+0x80/0xdc [ 22.502204] __device_attach+0xa8/0x1f8 [ 22.502212] device_initial_probe+0x10/0x18 [ 22.502222] bus_probe_device+0xa4/0xa8 [ 22.502230] deferred_probe_work_func+0xa0/0xf0 [ 22.502239] process_one_work+0x284/0x5b0 [ 22.502252] worker_thread+0x68/0x39c [ 22.502263] kthread+0x104/0x108 [ 22.502274] ret_from_fork+0x10/0x20 [ 22.502286] [ 22.502286] -> #1 (rtc_pcf85063:594:(&config->regmap)->lock){+.+.}-{3:3}: [ 22.502310] __lock_acquire+0x370/0x6e8 [ 22.502322] lock_acquire.part.0+0xcc/0x208 [ 22.502332] lock_acquire+0x94/0x14c [ 22.502341] __mutex_lock+0x9c/0x838 [ 22.502353] mutex_lock_nested+0x20/0x28 [ 22.502364] regmap_lock_mutex+0x10/0x18 [ 22.502377] regmap_read+0x38/0x70 [ 22.502389] pcf85063_clkout_recalc_rate+0x2c/0x78 [ 22.502403] __clk_core_init+0x46c/0x4e0 [ 22.502417] __clk_register+0x160/0x23c [ 22.502429] devm_clk_register+0x58/0xb4 [ 22.502441] pcf85063_probe+0x238/0x4f4 [ 22.502451] i2c_device_probe+0x100/0x2d4 [ 22.502465] call_driver_probe+0x28/0x15c [ 22.502473] really_probe+0x180/0x320 [ 22.502482] __driver_probe_device+0x84/0x144 [ 22.502491] driver_probe_device+0x38/0x150 [ 22.502500] __device_attach_driver+0xcc/0x194 [ 22.502509] bus_for_each_drv+0x80/0xdc [ 22.502521] __device_attach+0xa8/0x1f8 [ 22.502530] device_initial_probe+0x10/0x18 [ 22.502539] bus_probe_device+0xa4/0xa8 [ 22.502548] device_add+0x3b0/0x508 [ 22.502559] device_register+0x1c/0x28 [ 22.502570] i2c_new_client_device+0x1c8/0x2bc [ 22.502580] of_i2c_register_device+0xb4/0xdc [ OK ] Started Network Time Synchronization[ 22.502589] of_i2c_register_devices+0x80/0x154 .[ 22.502599] i2c_register_adapter+0x184/0x4c8 [ 22.502607] __i2c_add_numbered_adapter+0x5c/0xa4 [ 22.502616] i2c_add_adapter+0xa0/0xcc [ 22.502624] lpi2c_imx_probe+0x23c/0x350 [ 22.502636] platform_probe+0x64/0xfc [ 22.502646] call_driver_probe+0x28/0x15c [ 22.502655] really_probe+0x180/0x320 [ 22.502663] __driver_probe_device+0x84/0x144 [ 22.502672] driver_probe_device+0x38/0x150 [ 22.502681] __device_attach_driver+0xcc/0x194 [ 22.502690] bus_for_each_drv+0x80/0xdc [ 22.502702] __device_attach+0xa8/0x1f8 [ 22.502711] device_initial_probe+0x10/0x18 [ 22.502720] bus_probe_device+0xa4/0xa8 [ 22.502729] deferred_probe_work_func+0xa0/0xf0 [ 22.502738] process_one_work+0x284/0x5b0 [ 22.502749] worker_thread+0x68/0x39c [ 22.502760] kthread+0x104/0x108 [ 22.502770] ret_from_fork+0x10/0x20 [ 22.502779] [ 22.502779] -> #0 (prepare_lock){+.+.}-{3:3}: [ 22.502799] check_prev_add+0xb0/0xc80 [ 22.502809] validate_chain+0x444/0x510 [ 22.502818] __lock_acquire+0x370/0x6e8 [ 22.502827] lock_acquire.part.0+0xcc/0x208 [ 22.502837] lock_acquire+0x94/0x14c [ 22.502846] __mutex_lock+0x9c/0x838 [ 22.502857] mutex_lock_nested+0x20/0x28 [ 22.502867] clk_prepare_lock+0x48/0x9c [ 22.502878] clk_prepare+0x1c/0x3c [ 22.502890] clk_bulk_prepare+0x48/0xcc [ 22.502899] lpi2c_runtime_resume+0x30/0x84 [ 22.502910] pm_generic_runtime_resume+0x28/0x3c [ 22.502923] __genpd_runtime_resume+0x2c/0xa0 [ 22.502935] genpd_runtime_resume+0xbc/0x308 [ 22.502944] __rpm_callback+0x44/0x19c [ 22.502953] rpm_callback+0x64/0x70 [ 22.502962] rpm_resume+0x438/0x6d8 [ 22.502970] __pm_runtime_resume+0x54/0xb0 [ 22.502978] lpi2c_imx_master_enable+0x24/0x128 [ 22.502989] lpi2c_imx_xfer+0x2c/0x3c8 [ 22.502999] __i2c_transfer+0xe4/0x5a0 [ 22.503008] i2c_transfer+0x90/0x114 [ 22.503016] i2c_transfer_buffer_flags+0x58/0x84 [ 22.503025] regmap_i2c_write+0x1c/0x4c [ 22.503035] _regmap_raw_write_impl+0x7dc/0x944 [ 22.503044] _regmap_bus_raw_write+0x5c/0x74 [ 22.503052] _regmap_write+0x64/0x238 [ 22.503060] _regmap_update_bits+0x100/0x11c [ 22.503069] regmap_update_bits_base+0x60/0x90 [ 22.503077] pca953x_gpio_set_value+0x74/0x90 [ 22.503088] gpiod_set_raw_value_commit+0x6c/0x7c [ 22.503098] gpiod_set_value_nocheck+0x68/0x70 [ 22.503107] gpiod_set_value_cansleep+0x3c/0xa8 [ 22.503116] gpio_led_set_blocking+0x54/0x7c [ 22.503128] set_brightness_delayed+0x90/0xd8 [ 22.503138] process_one_work+0x284/0x5b0 [ 22.503149] worker_thread+0x68/0x39c [ 22.503160] kthread+0x104/0x108 [ 22.503169] ret_from_fork+0x10/0x20 [ 22.503179] [ 22.503179] other info that might help us debug this: [ 22.503179] [ 22.503183] Chain exists of: [ 22.503183] prepare_lock --> rtc_pcf85063:594:(&config->regmap)->lock --> i2c_register_adapter [ 22.503183] [ 22.503207] Possible unsafe locking scenario: [ 22.503207] [ 22.503210] CPU0 CPU1 [ 22.503213] ---- ---- [ 22.503216] lock(i2c_register_adapter); [ 22.503225] lock(rtc_pcf85063:594:(&config- >regmap)->lock); [ 22.503235] lock(i2c_register_adapter); [ 22.503244] lock(prepare_lock); [ 22.503253] [ 22.503253] *** DEADLOCK *** Now lpi2c_runtime_resume will call into clk_prepare() which also calls clk_prepare_lock() (identical to clk_get_rate). Best regards, Alexader > pm_runtime_set_autosuspend_delay(&pdev->dev, I2C_PM_TIMEOUT); > pm_runtime_use_autosuspend(&pdev->dev); > pm_runtime_get_noresume(&pdev->dev); -- TQ-Systems GmbH | Mühlstraße 2, Gut Delling | 82229 Seefeld, Germany Amtsgericht München, HRB 105018 Geschäftsführer: Detlef Schneider, Rüdiger Stahl, Stefan Schneider http://www.tq-group.com/