Is there any vmcore about this problem? On Wed, Dec 25, 2019 at 1:03 PM Frank Huang <tigerinxm@xxxxxxxxx> wrote: > > hi, there is a panic on rdma_rxe module when the restart > network.service or shutdown the switch. > > it looks like a use-after-free error. > > everytime it happens, there is the log "rdma_rxe: Unknown layer 3 protocol: 0" > > is it a known error? > > my kernel version is 4.14.97 > > [448840.314544] rdma_rxe: Unknown layer 3 protocol: 0 > [448840.314626] general protection fault: 0000 [#1] SMP PTI > [448840.314627] Modules linked in: binfmt_misc ib_isert > iscsi_target_mod ib_srpt target_core_mod rpcrdma ib_iser ib_srp > scsi_transport_srp rdma_rxe(OE) ib_ipoib ib_umad ip6_udp_tunnel > udp_tunnel rdma_ucm rdma_cm iw_cm ib_cm ib_uverbs ib_core > ebtable_filter ebtables devlink ip6table_filter ip6_tables > ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_netlink iptable_nat > xt_addrtype xt_conntrack br_netfilter bridge stp llc overlay > ip_set_hash_ip ip_set nfnetlink iscsi_tcp libiscsi_tcp libiscsi > scsi_transport_iscsi sch_ingress openvswitch nf_conntrack_ipv6 > nf_nat_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 > nf_defrag_ipv6 nf_nat nf_conntrack libcrc32c sunrpc intel_rapl > x86_pkg_temp_thermal intel_powerclamp coretemp vfat fat kvm_intel kvm > irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel > intel_cstate > [448840.314677] intel_uncore intel_rapl_perf mxm_wmi iTCO_wdt > iTCO_vendor_support ipmi_ssif pcspkr i2c_i801 lpc_ich ipmi_si > ipmi_devintf ipmi_msghandler pcc_cpufreq shpchp wmi ast drm_kms_helper > ttm crc32c_intel drm ixgbe igb mdio ptp pps_core dca i2c_algo_bit > [448840.314700] CPU: 1 PID: 17 Comm: ksoftirqd/1 Tainted: G > OE 4.14.97-el7.centos.x86_64 #1 > [448840.314701] Hardware name: /80010211 , BIOS 3.12 11/27/2018 > [448840.314703] task: ffff9ce768af8000 task.stack: ffffbd7c4c6c4000 > [448840.314710] RIP: 0010:rxe_elem_release+0xf/0x60 [rdma_rxe] > [448840.314711] RSP: 0018:ffffbd7c4c6c7d28 EFLAGS: 00010246 > [448840.314713] RAX: 0000000000000000 RBX: 2917351aae258b92 RCX: > 0000000000000000 > [448840.314714] RDX: ffff9cfb3f64ba40 RSI: 000000000000026c RDI: > ffff9cfb3f678008 > [448840.314715] RBP: ffff9cfb3f678000 R08: 0000000000000201 R09: > ffffbd7c4df35000 > [448840.314716] R10: 0000000000000000 R11: 0000000000000001 R12: > 0000000000000000 > [448840.314717] R13: 000000000000001d R14: 0000000000000006 R15: > ffff9cfb3f678000 > [448840.314719] FS: 0000000000000000(0000) GS:ffff9ce76f840000(0000) > knlGS:0000000000000000 > [448840.314720] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [448840.314721] CR2: 00007f4fc400f000 CR3: 000000260420a005 CR4: > 00000000001626e0 > [448840.314723] Call Trace: > [448840.314730] rxe_responder+0xcf0/0x1fe0 [rdma_rxe] > [448840.314738] ? check_preempt_wakeup+0x125/0x240 > [448840.314742] ? check_preempt_curr+0x84/0x90 > [448840.314745] ? ttwu_do_wakeup+0x19/0x140 > [448840.314747] ? try_to_wake_up+0x54/0x450 > [448840.314751] rxe_do_task+0x8b/0x100 [rdma_rxe] > [448840.314754] tasklet_action+0xfe/0x110 > [448840.314758] __do_softirq+0xd9/0x2a2 > [448840.314761] run_ksoftirqd+0x1e/0x70 > [448840.314763] smpboot_thread_fn+0x10e/0x160 > [448840.314766] kthread+0xff/0x140 > [448840.314768] ? sort_range+0x20/0x20 > [448840.314770] ? __kthread_parkme+0x90/0x90 > [448840.314771] ret_from_fork+0x35/0x40 > [448840.314773] Code: 7a 00 00 74 04 31 c0 eb c3 4c 89 e7 e8 bb f9 ff > ff 31 c0 eb b7 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 8d 6f f8 53 > 48 8b 5f f8 <48> 8b 43 20 48 85 c0 74 08 48 89 ef e8 60 1c 53 fb 8b 43 > 30 48 > [448840.314817] RIP: rxe_elem_release+0xf/0x60 [rdma_rxe] RSP: ffffbd7c4c6c7d28