From: Tang Junhui <tang.junhui@xxxxxxxxxx> Hello Coly, >dc->writeback_rate_update is a special delayed worker, it re-arms itself >to run after several seconds by, >>> schedule_delayed_work(&dc->writeback_rate_update, >>> dc->writeback_rate_update_seconds * HZ); > >I check the workqueue code, it seems cancel_delayed_work_sync() does not >prevent a delayed work to re-arm itself inside worker routine. And in my >test, around 5 seconds after cancel_delayed_work_sync() called, a NULL >pointer difference oops happens. I think I got how this issue would be occured: 1)work writeback_rate_update is running; 2)cancel_delayed_work_sync() is called, and waiting for work writeback_rate_update to run over, and conntinue to release cche set and cached device resources. 3)In the end of step 1, work writeback_rate_update re-armed again, and 5s later, the work runs again. >Cache set memory is freed within 2 seconds >after __cache_set_unregister() called, so inside >__update_writeback_rate() struct cache_set is referenced and causes the >NULL pointer deference bug. > So, if it occured like I said above, it is not safty to judged by: > struct cache_set *c = dc->disk.c; > if (test_bit(CACHE_SET_STOPPING, &c->flags)) > return; because cache_set, even bcache device and cache device are all released. Is that right? Thanks, Tang Junhui