On Mon, Sep 16, 2019 at 4:39 PM syzbot <syzbot+ac54455281db908c581e@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote: > > Hello, > > syzbot found the following crash on: > > HEAD commit: 1609d760 Merge tag 'for-linus' of git://git.kernel.org/pub.. > git tree: upstream > console output: https://syzkaller.appspot.com/x/log.txt?x=10236abe600000 > kernel config: https://syzkaller.appspot.com/x/.config?x=ed2b148cd67382ec > dashboard link: https://syzkaller.appspot.com/bug?extid=ac54455281db908c581e > compiler: clang version 9.0.0 (/home/glider/llvm/clang > 80fee25776c2fb61e74c1ecb1a523375c2500b69) > syz repro: https://syzkaller.appspot.com/x/repro.syz?x=116c4b11600000 > C reproducer: https://syzkaller.appspot.com/x/repro.c?x=15ff270d600000 > > The bug was bisected to: > > commit c266f64dbfa2a970a13b0574246c0ddfec492365 > Author: Vlad Buslov <vladbu@xxxxxxxxxxxx> > Date: Mon Feb 11 08:55:32 2019 +0000 > > net: sched: protect block state with mutex > > bisection log: https://syzkaller.appspot.com/x/bisect.txt?x=16e7ca65600000 > final crash: https://syzkaller.appspot.com/x/report.txt?x=15e7ca65600000 > console output: https://syzkaller.appspot.com/x/log.txt?x=11e7ca65600000 > > IMPORTANT: if you fix the bug, please add the following tag to the commit: > Reported-by: syzbot+ac54455281db908c581e@xxxxxxxxxxxxxxxxxxxxxxxxx > Fixes: c266f64dbfa2 ("net: sched: protect block state with mutex") > > BUG: sleeping function called from invalid context at > kernel/locking/mutex.c:909 > in_atomic(): 1, irqs_disabled(): 0, pid: 9297, name: syz-executor942 > INFO: lockdep is turned off. > Preemption disabled at: > [<ffffffff8604de24>] spin_lock_bh include/linux/spinlock.h:343 [inline] > [<ffffffff8604de24>] sch_tree_lock include/net/sch_generic.h:570 [inline] > [<ffffffff8604de24>] sfb_change+0x284/0xd30 net/sched/sch_sfb.c:519 > CPU: 0 PID: 9297 Comm: syz-executor942 Not tainted 5.3.0-rc8+ #0 > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS > Google 01/01/2011 > Call Trace: > __dump_stack lib/dump_stack.c:77 [inline] > dump_stack+0x1d8/0x2f8 lib/dump_stack.c:113 > ___might_sleep+0x3ff/0x530 kernel/sched/core.c:6608 > __might_sleep+0x8f/0x100 kernel/sched/core.c:6561 > __mutex_lock_common+0x4e/0x2820 kernel/locking/mutex.c:909 > __mutex_lock kernel/locking/mutex.c:1077 [inline] > mutex_lock_nested+0x1b/0x30 kernel/locking/mutex.c:1092 > tcf_chain0_head_change_cb_del+0x30/0x390 net/sched/cls_api.c:932 > tcf_block_put_ext+0x3d/0x2a0 net/sched/cls_api.c:1502 > tcf_block_put+0x6e/0x90 net/sched/cls_api.c:1515 > sfb_destroy+0x47/0x70 net/sched/sch_sfb.c:467 > qdisc_destroy+0x147/0x4d0 net/sched/sch_generic.c:968 > qdisc_put+0x58/0x90 net/sched/sch_generic.c:992 > sfb_change+0x52d/0xd30 net/sched/sch_sfb.c:522 I don't think we have to hold the qdisc tree lock when destroying the old qdisc. Does the following change make sense? diff --git a/net/sched/sch_sfb.c b/net/sched/sch_sfb.c index 1dff8506a715..726d0fa956b1 100644 --- a/net/sched/sch_sfb.c +++ b/net/sched/sch_sfb.c @@ -488,7 +488,7 @@ static int sfb_change(struct Qdisc *sch, struct nlattr *opt, struct netlink_ext_ack *extack) { struct sfb_sched_data *q = qdisc_priv(sch); - struct Qdisc *child; + struct Qdisc *child, *tmp; struct nlattr *tb[TCA_SFB_MAX + 1]; const struct tc_sfb_qopt *ctl = &sfb_default_ops; u32 limit; @@ -519,7 +519,7 @@ static int sfb_change(struct Qdisc *sch, struct nlattr *opt, sch_tree_lock(sch); qdisc_tree_flush_backlog(q->qdisc); - qdisc_put(q->qdisc); + tmp = q->qdisc; q->qdisc = child; q->rehash_interval = msecs_to_jiffies(ctl->rehash_interval); @@ -543,6 +543,7 @@ static int sfb_change(struct Qdisc *sch, struct nlattr *opt, sch_tree_unlock(sch); + qdisc_put(tmp); return 0; } What do you think, Vlad?