[Bug 206687] New: If IB link comes up, oops in port_pkey_list_insert triggered with "NULL pointer derefence"

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



https://bugzilla.kernel.org/show_bug.cgi?id=206687

            Bug ID: 206687
           Summary: If IB link comes up, oops in port_pkey_list_insert
                    triggered with "NULL pointer derefence"
           Product: Drivers
           Version: 2.5
    Kernel Version: v5.4.21 ongoing
          Hardware: x86-64
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: high
          Priority: P1
         Component: Infiniband/RDMA
          Assignee: drivers_infiniband-rdma@xxxxxxxxxxxxxxxxxxxx
          Reporter: hjl@xxxxxxxxxxxxxxxxxxxxx
        Regression: No

I am running a Fedora home server for testing purpose and using QLA7322
Infinband adapters to connect my workstation.

Since the upgrade to Fedora 31, that comes with a v5.5.5 kernel, the system
crashes as the network manager tries to configure the IPoIB interface.

I checked the last installed fedora 30 kernel and all went fine. 
Next I used the vanilla-kernels v5.4.19, v5.4.20 and v5.4.21 and tracked the
problem down to the v5.4.21

The problem still exists with the v5.5.6-rc3 kernel.

-------------------

Feb 27 09:58:27 odin.langes-netz.home kernel: IPv6: ADDRCONF(NETDEV_CHANGE):
ibp9s0: link becomes ready
Feb 27 09:58:27 odin.langes-netz.home NetworkManager[1046]: <info> 
[1582797507.0677] device (ibp9s0): carrier: link connected
Feb 27 09:58:27 odin.langes-netz.home NetworkManager[1046]: <info> 
[1582797507.0679] device (ibp9s0): state change: unavailable -> disconnected
(reason 'carrier-changed', sys-iface-state: 'managed')
Feb 27 09:58:27 odin.langes-netz.home NetworkManager[1046]: <info> 
[1582797507.0693] policy: auto-activating connection 'fastlane'
(e55b03b4-79d4-4cf7-89af-ea866965c8ba)
Feb 27 09:58:27 odin.langes-netz.home NetworkManager[1046]: <info> 
[1582797507.0700] device (ibp9s0): Activation: starting connection 'fastlane'
(e55b03b4-79d4-4cf7-89af-ea866965c8ba)
Feb 27 09:58:27 odin.langes-netz.home NetworkManager[1046]: <info> 
[1582797507.0702] device (ibp9s0): state change: disconnected -> prepare
(reason 'none', sys-iface-state: 'managed')
Feb 27 09:58:27 odin.langes-netz.home NetworkManager[1046]: <info> 
[1582797507.0708] manager: NetworkManager state is now CONNECTING
Feb 27 09:58:27 odin.langes-netz.home kernel: BUG: kernel NULL pointer
dereference, address: 0000000000000010
Feb 27 09:58:27 odin.langes-netz.home kernel: #PF: supervisor read access in
kernel mode
Feb 27 09:58:27 odin.langes-netz.home kernel: #PF: error_code(0x0000) -
not-present page
Feb 27 09:58:27 odin.langes-netz.home kernel: PGD 0 P4D 0 
Feb 27 09:58:27 odin.langes-netz.home kernel: Oops: 0000 [#1] SMP NOPTI
Feb 27 09:58:27 odin.langes-netz.home kernel: CPU: 0 PID: 1046 Comm:
NetworkManager Not tainted 5.4.21 #8
Feb 27 09:58:27 odin.langes-netz.home kernel: Hardware name: System
manufacturer System Product Name/PRIME X470-PRO, BIOS 5220 09/11/2019
Feb 27 09:58:27 odin.langes-netz.home kernel: RIP:
0010:get_pkey_idx_qp_list+0x5a/0x80 [ib_core]
Feb 27 09:58:27 odin.langes-netz.home kernel: Code: 06 48 69 ff b8 00 00 00 48
03 bd 88 04 00 00 4c 8b 47 20 48 8d 47 20 49 39 c0 74 26 0f b7 53 04 eb 08 4d
8b 00 49 39 c0 74 18 <66> 41 39 50 10 75 f1 48 83 c7 18 c6 07 00 0f 1f 40 00 4c
>
Feb 27 09:58:27 odin.langes-netz.home kernel: RSP: 0018:ffffb430c0b9f318
EFLAGS: 00010203
Feb 27 09:58:27 odin.langes-netz.home kernel: RAX: ffff895f3b41e030 RBX:
ffff895f3667bd80 RCX: 0000000000000000
Feb 27 09:58:27 odin.langes-netz.home kernel: RDX: 0000000000000000 RSI:
0000000000000000 RDI: ffff895f3b41e010
Feb 27 09:58:27 odin.langes-netz.home kernel: RBP: ffff895f498d8000 R08:
0000000000000000 R09: ffff895f3667bd80
Feb 27 09:58:27 odin.langes-netz.home kernel: R10: ffffb430c0b9f548 R11:
0000000000000000 R12: 0000000000000071
Feb 27 09:58:27 odin.langes-netz.home kernel: R13: 0000000000000000 R14:
ffff895f498d8000 R15: ffffb430c0b9f430
Feb 27 09:58:27 odin.langes-netz.home kernel: FS:  00007f6a13521bc0(0000)
GS:ffff895f4e800000(0000) knlGS:0000000000000000
Feb 27 09:58:27 odin.langes-netz.home kernel: CS:  0010 DS: 0000 ES: 0000 CR0:
0000000080050033
Feb 27 09:58:27 odin.langes-netz.home kernel: CR2: 0000000000000010 CR3:
00000003fc056000 CR4: 00000000003406f0
Feb 27 09:58:27 odin.langes-netz.home kernel: Call Trace:
Feb 27 09:58:27 odin.langes-netz.home kernel:  port_pkey_list_insert+0x30/0x1a0
[ib_core]
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
kmem_cache_alloc_trace+0x162/0x220
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
ib_security_modify_qp+0xae/0x3a0 [ib_core]
Feb 27 09:58:27 odin.langes-netz.home kernel: 
ib_security_modify_qp+0x23f/0x3a0 [ib_core]
Feb 27 09:58:27 odin.langes-netz.home kernel:  _ib_modify_qp+0x272/0x3e0
[ib_core]
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? __dev_mc_del+0x53/0x70
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? rt6_age_exceptions+0x61/0x70
Feb 27 09:58:27 odin.langes-netz.home kernel:  ipoib_init_qp+0x78/0x1a0
[ib_ipoib]
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? fib6_clean_tree+0x58/0x80
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? fib6_del+0x250/0x250
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? nf_conntrack_lock+0x17/0x50
[nf_conntrack]
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? node_free_rcu+0x20/0x20
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? ib_find_pkey+0x98/0xe0
[ib_core]
Feb 27 09:58:27 odin.langes-netz.home kernel: 
ipoib_ib_dev_open_default+0x1a/0x180 [ib_ipoib]
Feb 27 09:58:27 odin.langes-netz.home kernel:  ipoib_ib_dev_open+0x66/0xa0
[ib_ipoib]
Feb 27 09:58:27 odin.langes-netz.home kernel:  ipoib_open+0x44/0x110 [ib_ipoib]
Feb 27 09:58:27 odin.langes-netz.home kernel:  __dev_open+0xcf/0x160
Feb 27 09:58:27 odin.langes-netz.home kernel:  __dev_change_flags+0x1a7/0x200
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? __dev_notify_flags+0x96/0xf0
Feb 27 09:58:27 odin.langes-netz.home kernel:  dev_change_flags+0x21/0x60
Feb 27 09:58:27 odin.langes-netz.home kernel:  do_setlink+0x667/0xd70
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
__nla_validate_parse+0x51/0x830
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? cpumask_next+0x17/0x20
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
__nla_validate_parse+0x51/0x830
Feb 27 09:58:27 odin.langes-netz.home kernel:  __rtnl_newlink+0x553/0x8c0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? __nla_reserve+0x38/0x50
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? prep_new_page+0xc4/0xf0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? __nla_reserve+0x38/0x50
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? __nla_put+0xc/0x20
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? prep_new_page+0xc4/0xf0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
get_page_from_freelist+0x772/0x17a0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? nla_put+0x28/0x40
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
get_page_from_freelist+0x772/0x17a0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? new_slab+0x25e/0x4e0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? _cond_resched+0x15/0x30
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
kmem_cache_alloc_trace+0x162/0x220
Feb 27 09:58:27 odin.langes-netz.home kernel:  rtnl_newlink+0x44/0x70
Feb 27 09:58:27 odin.langes-netz.home kernel:  rtnetlink_rcv_msg+0x2b0/0x360
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? _cond_resched+0x15/0x30
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? kmem_cache_alloc+0x165/0x220
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? ktime_get_real_ts64+0x46/0xe0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? _cond_resched+0x15/0x30
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? rtnl_calcit.isra.0+0x110/0x110
Feb 27 09:58:27 odin.langes-netz.home kernel:  netlink_rcv_skb+0x49/0x110
Feb 27 09:58:27 odin.langes-netz.home kernel:  netlink_unicast+0x171/0x200
Feb 27 09:58:27 odin.langes-netz.home kernel:  netlink_sendmsg+0x21e/0x3e0
Feb 27 09:58:27 odin.langes-netz.home kernel:  sock_sendmsg+0x5e/0x60
Feb 27 09:58:27 odin.langes-netz.home kernel:  ____sys_sendmsg+0x1ef/0x260
Feb 27 09:58:27 odin.langes-netz.home kernel:  ?
copy_msghdr_from_user+0xd6/0x150
Feb 27 09:58:27 odin.langes-netz.home kernel:  ___sys_sendmsg+0x88/0xd0
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? do_filp_open+0xa5/0x100
Feb 27 09:58:27 odin.langes-netz.home kernel:  ? list_lru_add+0xb5/0x1d0
Feb 27 09:58:27 odin.langes-netz.home kernel:  __sys_sendmsg+0x59/0xa0
Feb 27 09:58:27 odin.langes-netz.home kernel:  do_syscall_64+0x5b/0x1a0
Feb 27 09:58:27 odin.langes-netz.home kernel: 
entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb 27 09:58:27 odin.langes-netz.home kernel: RIP: 0033:0x7f6a144ff80d
Feb 27 09:58:27 odin.langes-netz.home kernel: Code: 28 89 54 24 1c 48 89 74 24
10 89 7c 24 08 e8 ea ec ff ff 8b 54 24 1c 48 8b 74 24 10 41 89 c0 8b 7c 24 08
b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 2f 44 89 c7 48 89 44 24 08 e8 1e ed
>
Feb 27 09:58:27 odin.langes-netz.home kernel: RSP: 002b:00007fff7308a310
EFLAGS: 00000293 ORIG_RAX: 000000000000002e
Feb 27 09:58:27 odin.langes-netz.home kernel: RAX: ffffffffffffffda RBX:
000055fa8bfdb540 RCX: 00007f6a144ff80d
Feb 27 09:58:27 odin.langes-netz.home kernel: RDX: 0000000000000000 RSI:
00007fff7308a360 RDI: 000000000000000c
Feb 27 09:58:27 odin.langes-netz.home kernel: RBP: 00007fff7308a360 R08:
0000000000000000 R09: 0000000000000000
Feb 27 09:58:27 odin.langes-netz.home kernel: R10: 0000000000000000 R11:
0000000000000293 R12: 000055fa8bfdb540
Feb 27 09:58:27 odin.langes-netz.home kernel: R13: 00007fff7308a518 R14:
00007fff7308a50c R15: 0000000000000000
Feb 27 09:58:27 odin.langes-netz.home kernel: Modules linked in: xt_CHECKSUM
xt_MASQUERADE nf_nat_tftp nf_conntrack_tftp tun bridge stp llc
nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_REJECT nf_reject_ipv6
ip6t_rpfilter ipt>
Feb 27 09:58:27 odin.langes-netz.home kernel:  crct10dif_pclmul eeepc_wmi
ib_core crc32_pclmul snd_timer asus_wmi sp5100_tco ghash_clmulni_intel k10temp
i2c_piix4 wmi_bmof snd sparse_keymap rfkill soundcore pcspkr ccp gpio_amdpt
gpio_gen>
Feb 27 09:58:27 odin.langes-netz.home kernel: CR2: 0000000000000010
Feb 27 09:58:27 odin.langes-netz.home kernel: ---[ end trace 8fdfe3a62f9689a5
]---
Feb 27 09:58:27 odin.langes-netz.home kernel: RIP:
0010:get_pkey_idx_qp_list+0x5a/0x80 [ib_core]
Feb 27 09:58:27 odin.langes-netz.home kernel: Code: 06 48 69 ff b8 00 00 00 48
03 bd 88 04 00 00 4c 8b 47 20 48 8d 47 20 49 39 c0 74 26 0f b7 53 04 eb 08 4d
8b 00 49 39 c0 74 18 <66> 41 39 50 10 75 f1 48 83 c7 18 c6 07 00 0f 1f 40 00 4c
>
Feb 27 09:58:27 odin.langes-netz.home kernel: RSP: 0018:ffffb430c0b9f318
EFLAGS: 00010203
Feb 27 09:58:27 odin.langes-netz.home kernel: RAX: ffff895f3b41e030 RBX:
ffff895f3667bd80 RCX: 0000000000000000
Feb 27 09:58:27 odin.langes-netz.home kernel: RDX: 0000000000000000 RSI:
0000000000000000 RDI: ffff895f3b41e010
Feb 27 09:58:27 odin.langes-netz.home kernel: RBP: ffff895f498d8000 R08:
0000000000000000 R09: ffff895f3667bd80
Feb 27 09:58:27 odin.langes-netz.home kernel: R10: ffffb430c0b9f548 R11:
0000000000000000 R12: 0000000000000071
Feb 27 09:58:27 odin.langes-netz.home kernel: R13: 0000000000000000 R14:
ffff895f498d8000 R15: ffffb430c0b9f430
Feb 27 09:58:27 odin.langes-netz.home kernel: FS:  00007f6a13521bc0(0000)
GS:ffff895f4e800000(0000) knlGS:0000000000000000
Feb 27 09:58:27 odin.langes-netz.home kernel: CS:  0010 DS: 0000 ES: 0000 CR0:
0000000080050033
Feb 27 09:58:27 odin.langes-netz.home kernel: CR2: 0000000000000010 CR3:
00000003fc056000 CR4: 00000000003406f0

-- 
You are receiving this mail because:
You are watching the assignee of the bug.



[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]

  Powered by Linux