Hello! Thanks for your bug report. > On Feb 19, 2020, at 10:22 PM, Wang Yugui <wangyugui@xxxxxxxxxxxx> wrote: > > Hi, chuck.lever > > a bug(BUG: kernel NULL pointer dereference) of ib or mlx happened in 5.4.21 but not in 5.4.20. > > maybe some releationship to xprtrdma-fix-dma-scatter-gather-list-mapping-imbalance.patch I don't see an obvious connection to fix-dma-scatter-gather-list-mapping-imbalance. The backtrace below is through IPoIB code paths. Those have nothing to do with NFS/RDMA, which is the only ULP code that is changed by my commit. > maybe the info is useful. I'm copying linux-rdma for a bigger set of eyeballs. My knee-jerk recommendation is that if you have a reliable reproducer, try "git bisect" between .20 and .21 to nail down a specific commit where the BUG starts to occur. > Feb 20 10:05:58 T630 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000010 > ... > Feb 20 10:05:58 T630 kernel: port_pkey_list_insert+0x30/0x1a0 [ib_core] > Feb 20 10:05:58 T630 kernel: ? kmem_cache_alloc_trace+0x219/0x230 > Feb 20 10:05:58 T630 kernel: ib_security_modify_qp+0x244/0x3b0 [ib_core] > Feb 20 10:05:58 T630 kernel: _ib_modify_qp+0x1c0/0x3c0 [ib_core] > Feb 20 10:05:58 T630 kernel: ? dma_pool_free+0x24/0xc0 > Feb 20 10:05:58 T630 kernel: ipoib_init_qp+0x77/0x190 [ib_ipoib] > Feb 20 10:05:58 T630 kernel: ? __mlx4_ib_query_pkey+0xe7/0x110 [mlx4_ib] > Feb 20 10:05:58 T630 kernel: ? ib_find_pkey+0x98/0xe0 [ib_core] > Feb 20 10:05:58 T630 kernel: ipoib_ib_dev_open_default+0x1a/0x180 [ib_ipoib] > Feb 20 10:05:58 T630 kernel: ipoib_ib_dev_open+0x66/0xa0 [ib_ipoib] > Feb 20 10:05:58 T630 kernel: ipoib_open+0x44/0x110 [ib_ipoib] > Feb 20 10:05:58 T630 kernel: __dev_open+0xcd/0x160 > > > # ibstat > CA 'mlx4_0' > CA type: MT4099 > Number of ports: 2 > Firmware version: 2.42.5000 > Hardware version: 1 > Node GUID: 0xe41d2d03007b4080 > System image GUID: 0xe41d2d03007b4083 > Port 1: > State: Down > Physical state: Polling > Rate: 10 > Base lid: 0 > LMC: 0 > SM lid: 0 > Capability mask: 0x02594868 > Port GUID: 0xe41d2d03007b4081 > Link layer: InfiniBand > Port 2: > State: Down > Physical state: Disabled > Rate: 40 > Base lid: 0 > LMC: 0 > SM lid: 0 > Capability mask: 0x00010000 > Port GUID: 0xe61d2dfffe7b4082 > Link layer: Ethernet > > Best Regards > 王玉贵 > 2020/02/20 > > -------------------------------------- > 北京京垓科技有限公司 > 王玉贵 wangyugui@xxxxxxxxxxxx > 电话:+86-136-71123776 > <bug-of-ib-in-5.4.21.message> -- Chuck Lever