Hi Devesh, Testing 4.4.0-rc5, the ocrdma driver is failing for me (100% reliably). If you have vlans off of the main device, this is what I get from the Fedora rawhide 4.4.0-rc5 kernel: -- Doug Ledford <dledford@xxxxxxxxxx> GPG KeyID: 0E572FDD
[ 26.692881] be2net 0000:85:00.0 ocrdma_roce: Link is Up [ 26.693339] ====================================================== [ 26.693340] [ INFO: possible circular locking dependency detected ] [ 26.693341] 4.4.0-0.rc5.git3.1.fc24.x86_64 #1 Tainted: G I [ 26.693341] ------------------------------------------------------- [ 26.693342] NetworkManager/2867 is trying to acquire lock: [ 26.693348] (be_adapter_list_lock){+.+.+.}, at: [<ffffffffa053d7f5>] be_roce_dev_open+0x35/0x70 [be2net] [ 26.693349] but task is already holding lock: [ 26.693354] (rtnl_mutex){+.+.+.}, at: [<ffffffff8174961b>] rtnetlink_rcv+0x1b/0x40 [ 26.693355] which lock already depends on the new lock. [ 26.693355] the existing dependency chain (in reverse order) is: [ 26.693356] -> #2 (rtnl_mutex){+.+.+.}: [ 26.693361] [<ffffffff8110b56e>] lock_acquire+0xce/0x1c0 [ 26.693366] [<ffffffff8187c086>] mutex_lock_nested+0x86/0x400 [ 26.693368] [<ffffffff81747f27>] rtnl_lock+0x17/0x20 [ 26.693375] [<ffffffffa00770b5>] enum_all_gids_of_dev_cb+0x25/0xd0 [ib_core] [ 26.693379] [<ffffffffa0072918>] ib_enum_roce_netdev+0x128/0x130 [ib_core] [ 26.693382] [<ffffffffa00774e1>] roce_rescan_device+0x21/0x30 [ib_core] [ 26.693385] [<ffffffffa007521c>] ib_cache_setup_one+0x2bc/0x3b0 [ib_core] [ 26.693388] [<ffffffffa00725d3>] ib_register_device+0x2e3/0x420 [ib_core] [ 26.693391] [<ffffffffa076c85a>] ocrdma_add+0x43a/0x710 [ocrdma] [ 26.693393] [<ffffffffa053d58d>] _be_roce_dev_add+0x17d/0x1e0 [be2net] [ 26.693396] [<ffffffffa053d65a>] be_roce_register_driver+0x6a/0xd0 [be2net] [ 26.693402] [<ffffffffa0781015>] target_dev_control_store+0x15/0x20 [target_core_mod] [ 26.693406] [<ffffffff81002123>] do_one_initcall+0xb3/0x200 [ 26.693408] [<ffffffff811e3298>] do_init_module+0x5f/0x1e7 [ 26.693410] [<ffffffff81153246>] load_module+0x2126/0x27d0 [ 26.693411] [<ffffffff81153a62>] SyS_init_module+0x172/0x1b0 [ 26.693412] [<ffffffff8187fe32>] entry_SYSCALL_64_fastpath+0x12/0x76 [ 26.693414] -> #1 (device_mutex){+.+.+.}: [ 26.693415] [<ffffffff8110b56e>] lock_acquire+0xce/0x1c0 [ 26.693417] [<ffffffff8187c086>] mutex_lock_nested+0x86/0x400 [ 26.693420] [<ffffffffa007232f>] ib_register_device+0x3f/0x420 [ib_core] [ 26.693422] [<ffffffffa076c85a>] ocrdma_add+0x43a/0x710 [ocrdma] [ 26.693423] [<ffffffffa053d58d>] _be_roce_dev_add+0x17d/0x1e0 [be2net] [ 26.693425] [<ffffffffa053d65a>] be_roce_register_driver+0x6a/0xd0 [be2net] [ 26.693428] [<ffffffffa0781015>] target_dev_control_store+0x15/0x20 [target_core_mod] [ 26.693430] [<ffffffff81002123>] do_one_initcall+0xb3/0x200 [ 26.693431] [<ffffffff811e3298>] do_init_module+0x5f/0x1e7 [ 26.693432] [<ffffffff81153246>] load_module+0x2126/0x27d0 [ 26.693433] [<ffffffff81153a62>] SyS_init_module+0x172/0x1b0 [ 26.693435] [<ffffffff8187fe32>] entry_SYSCALL_64_fastpath+0x12/0x76 [ 26.693436] -> #0 (be_adapter_list_lock){+.+.+.}: [ 26.693437] [<ffffffff8110a969>] __lock_acquire+0x18f9/0x1b70 [ 26.693439] [<ffffffff8110b56e>] lock_acquire+0xce/0x1c0 [ 26.693440] [<ffffffff8187c086>] mutex_lock_nested+0x86/0x400 [ 26.693442] [<ffffffffa053d7f5>] be_roce_dev_open+0x35/0x70 [be2net] [ 26.693444] [<ffffffffa0532500>] be_open+0x670/0x700 [be2net] [ 26.693446] [<ffffffff81739df8>] __dev_open+0xc8/0x140 [ 26.693448] [<ffffffff8173a10d>] __dev_change_flags+0x9d/0x160 [ 26.693449] [<ffffffff8173a1f9>] dev_change_flags+0x29/0x70 [ 26.693451] [<ffffffff8174a486>] do_setlink+0x636/0xb80 [ 26.693452] [<ffffffff8174b0bc>] rtnl_newlink+0x5ac/0x8a0 [ 26.693454] [<ffffffff81749726>] rtnetlink_rcv_msg+0xe6/0x240 [ 26.693456] [<ffffffff81773a44>] netlink_rcv_skb+0xa4/0xc0 [ 26.693457] [<ffffffff8174962a>] rtnetlink_rcv+0x2a/0x40 [ 26.693459] [<ffffffff8177315a>] netlink_unicast+0x19a/0x290 [ 26.693460] [<ffffffff81773713>] netlink_sendmsg+0x4c3/0x620 [ 26.693462] [<ffffffff81715488>] sock_sendmsg+0x38/0x50 [ 26.693463] [<ffffffff81715fa9>] ___sys_sendmsg+0x2c9/0x2e0 [ 26.693465] [<ffffffff81716cf1>] __sys_sendmsg+0x51/0x90 [ 26.693466] [<ffffffff81716d42>] SyS_sendmsg+0x12/0x20 [ 26.693467] [<ffffffff8187fe32>] entry_SYSCALL_64_fastpath+0x12/0x76 [ 26.693468] other info that might help us debug this: [ 26.693469] Chain exists of: be_adapter_list_lock --> device_mutex --> rtnl_mutex [ 26.693470] Possible unsafe locking scenario: [ 26.693470] CPU0 CPU1 [ 26.693470] ---- ---- [ 26.693471] lock(rtnl_mutex); [ 26.693472] lock(device_mutex); [ 26.693472] lock(rtnl_mutex); [ 26.693473] lock(be_adapter_list_lock); [ 26.693473] *** DEADLOCK *** [ 26.693474] 1 lock held by NetworkManager/2867: [ 26.693476] #0: (rtnl_mutex){+.+.+.}, at: [<ffffffff8174961b>] rtnetlink_rcv+0x1b/0x40 [ 26.693476] stack backtrace: [ 26.693478] CPU: 14 PID: 2867 Comm: NetworkManager Tainted: G I 4.4.0-0.rc5.git3.1.fc24.x86_64 #1 [ 26.693479] Hardware name: Dell Inc. PowerEdge R730xd/0599V5, BIOS 1.0.4 08/28/2014 [ 26.693481] 0000000000000000 0000000022867838 ffff8820175d74a0 ffffffff81427df9 [ 26.693482] ffffffff82bd4410 ffff8820175d74e0 ffffffff81107653 ffff8820175d7550 [ 26.693483] ffff882017590cc8 ffff882017590000 ffff882017590c90 0000000000000000 [ 26.693484] Call Trace: [ 26.693487] [<ffffffff81427df9>] dump_stack+0x4b/0x72 [ 26.693489] [<ffffffff81107653>] print_circular_bug+0x1e3/0x250 [ 26.693490] [<ffffffff8110a969>] __lock_acquire+0x18f9/0x1b70 [ 26.693492] [<ffffffff81880964>] ? retint_kernel+0x10/0x10 [ 26.693493] [<ffffffff8110b56e>] lock_acquire+0xce/0x1c0 [ 26.693495] [<ffffffffa053d7f5>] ? be_roce_dev_open+0x35/0x70 [be2net] [ 26.693497] [<ffffffff8187c086>] mutex_lock_nested+0x86/0x400 [ 26.693499] [<ffffffffa053d7f5>] ? be_roce_dev_open+0x35/0x70 [be2net] [ 26.693500] [<ffffffff81733c0c>] ? netdev_info+0x6c/0x90 [ 26.693502] [<ffffffffa053d7f5>] ? be_roce_dev_open+0x35/0x70 [be2net] [ 26.693504] [<ffffffff8174ed17>] ? linkwatch_fire_event+0x57/0xa0 [ 26.693506] [<ffffffffa053d7f5>] be_roce_dev_open+0x35/0x70 [be2net] [ 26.693507] [<ffffffffa0532500>] be_open+0x670/0x700 [be2net] [ 26.693509] [<ffffffff81739df8>] __dev_open+0xc8/0x140 [ 26.693511] [<ffffffff8173a10d>] __dev_change_flags+0x9d/0x160 [ 26.693512] [<ffffffff8173a1f9>] dev_change_flags+0x29/0x70 [ 26.693513] [<ffffffff8174a486>] do_setlink+0x636/0xb80 [ 26.693515] [<ffffffff8110952a>] ? __lock_acquire+0x4ba/0x1b70 [ 26.693518] [<ffffffffa01168ed>] ? mga_dirty_update+0x21d/0x350 [mgag200] [ 26.693520] [<ffffffff810268b9>] ? sched_clock+0x9/0x10 [ 26.693522] [<ffffffff81458622>] ? nla_parse+0x32/0x100 [ 26.693523] [<ffffffff8174b0bc>] rtnl_newlink+0x5ac/0x8a0 [ 26.693527] [<ffffffff810b8028>] ? ns_capable+0x38/0x70 [ 26.693528] [<ffffffff81749726>] rtnetlink_rcv_msg+0xe6/0x240 [ 26.693530] [<ffffffff8174961b>] ? rtnetlink_rcv+0x1b/0x40 [ 26.693533] [<ffffffff810e82dc>] ? local_clock+0x1c/0x20 [ 26.693534] [<ffffffff8174961b>] ? rtnetlink_rcv+0x1b/0x40 [ 26.693535] [<ffffffff81749640>] ? rtnetlink_rcv+0x40/0x40 [ 26.693537] [<ffffffff81773a44>] netlink_rcv_skb+0xa4/0xc0 [ 26.693538] [<ffffffff8174962a>] rtnetlink_rcv+0x2a/0x40 [ 26.693539] [<ffffffff8177315a>] netlink_unicast+0x19a/0x290 [ 26.693540] [<ffffffff817730d4>] ? netlink_unicast+0x114/0x290 [ 26.693541] [<ffffffff81773713>] netlink_sendmsg+0x4c3/0x620 [ 26.693543] [<ffffffff81715488>] sock_sendmsg+0x38/0x50 [ 26.693544] [<ffffffff81715fa9>] ___sys_sendmsg+0x2c9/0x2e0 [ 26.693546] [<ffffffff810268b9>] ? sched_clock+0x9/0x10 [ 26.693548] [<ffffffff810e82dc>] ? local_clock+0x1c/0x20 [ 26.693551] [<ffffffff812978f2>] ? __fget+0x122/0x200 [ 26.693553] [<ffffffff812977d5>] ? __fget+0x5/0x200 [ 26.693554] [<ffffffff81297a3a>] ? __fget_light+0x2a/0x90 [ 26.693556] [<ffffffff81716cf1>] __sys_sendmsg+0x51/0x90 [ 26.693558] [<ffffffff81716d42>] SyS_sendmsg+0x12/0x20 [ 26.693559] [<ffffffff8187fe32>] entry_SYSCALL_64_fastpath+0x12/0x76 [ 26.706745] be2net 0000:85:00.0 ocrdma_roce: Link is Up
Attachment:
signature.asc
Description: OpenPGP digital signature