Hi Florian Update kernel to 5.19.10 but still same now have small more info in dmesg check log down this rull add for flow: table inet filter { flowtable f { hook ingress priority 0 devices = { bond0.100, vlan200, vlan234, vlan340, vlan2432, vlan3214, vlan3675 } } chain forward { type filter hook forward priority raw; policy accept; ip protocol { tcp , udp, icmp } flow offload @f; ct state established,related accept ip protocol { tcp, udp, icmp } accept } } bond0.100 - is uplink other vlanXXXX is to users LOG. ————————>>>> Sep 23 07:10:01 [ 179.651419][ C16] invalid opcode: 0000 [#1] SMP Sep 23 07:10:01 [ 179.652068][ C16] CPU: 16 PID: 1565 Comm: zebra Tainted: G O 5.19.10 #1 Sep 23 07:10:01 [ 179.653170][ C16] Hardware name: Supermicro SYS-5038MR-H8TRF/X10SRD-F, BIOS 3.3 10/28/2020 Sep 23 07:10:01 [ 179.654320][ C16] RIP: 0010:__get_vm_area_node+0x120/0x130 Sep 23 07:10:01 [ 179.655094][ C16] Code: ff bd ff ff ff ff 48 0f bd e8 b8 0c 00 00 00 ff c5 39 c5 0f 4c e8 b8 1e 00 00 00 39 c5 0f 4f e8 c4 e2 d1 f7 e9 e9 2e ff ff ff <0f> 0b 4c 89 ff e8 86 80 01 00 45 31 ff eb b6 90 41 57 41 56 41 55 Sep 23 07:10:01 [ 179.657748][ C16] RSP: 0000:ffff8f47c0b8f888 EFLAGS: 00010206 Sep 23 07:10:01 [ 179.658553][ C16] RAX: 00000000ffffffff RBX: 0000000000002b20 RCX: 0000000000000100 Sep 23 07:10:01 [ 179.659619][ C16] RDX: 0000000000000015 RSI: 0000000000200000 RDI: 0000000000400040 Sep 23 07:10:01 [ 179.660684][ C16] RBP: 0000000000000015 R08: ffff8f47c0000000 R09: ffffaf47bfffffff Sep 23 07:10:01 [ 179.661750][ C16] R10: 0000000000000010 R11: ffff8cce2ad46e40 R12: 0000000000000422 Sep 23 07:10:01 [ 179.662815][ C16] R13: 0000000000400040 R14: 0000000000000015 R15: fffffffffffffff5 Sep 23 07:10:01 [ 179.663880][ C16] FS: 00007ff4c21d1bc0(0000) GS:ffff8cd41fa00000(0000) knlGS:0000000000000000 Sep 23 07:10:01 [ 179.665077][ C16] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Sep 23 07:10:01 [ 179.665953][ C16] CR2: 0000000002492e98 CR3: 000000010ffb3002 CR4: 00000000003706e0 Sep 23 07:10:01 [ 179.667020][ C16] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Sep 23 07:10:01 [ 179.668085][ C16] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Sep 23 07:10:01 [ 179.669153][ C16] Call Trace: Sep 23 07:10:01 [ 179.669582][ C16] <TASK> Sep 23 07:10:01 [ 179.669962][ C16] __vmalloc_node_range+0x96/0x1e0 Sep 23 07:10:01 [ 179.670639][ C16] ? bucket_table_alloc.isra.0+0x47/0x140 Sep 23 07:10:01 [ 179.670672][ C6] ------------[ cut here ]------------ Sep 23 07:10:01 [ 179.670714][ C14] ------------[ cut here ]------------ Sep 23 07:10:01 [ 179.670716][ C14] kernel BUG at mm/vmalloc.c:2437! Sep 23 07:10:01 [ 179.671395][ C16] kvmalloc_node+0x92/0xb0 Sep 23 07:10:01 [ 179.672111][ C27] ------------[ cut here ]------------ Sep 23 07:10:01 [ 179.672112][ C27] kernel BUG at mm/vmalloc.c:2437! Sep 23 07:10:01 [ 179.672118][ C6] kernel BUG at mm/vmalloc.c:2437! Sep 23 07:10:01 [ 179.672839][ C16] ? bucket_table_alloc.isra.0+0x47/0x140 Sep 23 07:10:01 [ 179.676931][ C16] bucket_table_alloc.isra.0+0x47/0x140 Sep 23 07:10:01 [ 179.677423][ C29] ------------[ cut here ]------------ Sep 23 07:10:01 [ 179.677662][ C16] rhashtable_try_insert+0x3a4/0x440 Sep 23 07:10:01 [ 179.678385][ C29] kernel BUG at mm/vmalloc.c:2437! Sep 23 07:10:01 [ 179.679082][ C16] rhashtable_insert_slow+0x1b/0x30 Sep 23 07:10:01 [ 179.680319][ C4] ------------[ cut here ]------------ Sep 23 07:10:01 [ 179.680444][ C16] flow_offload_add+0x4d/0x130 [nf_flow_table] Sep 23 07:10:01 [ 179.681167][ C4] kernel BUG at mm/vmalloc.c:2437! Sep 23 07:10:01 [ 179.681985][ C16] nft_flow_offload_eval+0x22c/0x2ab [nft_flow_offload] Sep 23 07:10:01 [ 179.683598][ C16] ? dev_hard_start_xmit+0x95/0xe0 Sep 23 07:10:01 [ 179.684275][ C16] nft_do_chain+0x120/0x4c0 [nf_tables] Sep 23 07:10:01 [ 179.685017][ C16] ? __pppoe_xmit+0x11c/0x190 [pppoe] Sep 23 07:10:01 [ 179.685727][ C16] ? fib_validate_source+0x37/0xd0 Sep 23 07:10:01 [ 179.686403][ C16] ? __mkroute_input+0x102/0x310 Sep 23 07:10:01 [ 179.687054][ C16] ? ip_route_input_slow+0x394/0x8c0 Sep 23 07:10:01 [ 179.687753][ C16] ? nf_nat_ipv4_manip_pkt+0x53/0xe0 [nf_nat] Sep 23 07:10:01 [ 179.714083][ C16] nft_do_chain_inet+0x76/0xc0 [nf_tables] Sep 23 07:10:01 [ 179.740597][ C16] nf_hook_slow+0x36/0xa0 Sep 23 07:10:01 [ 179.767009][ C16] ip_forward+0x46c/0x4c0 Sep 23 07:10:01 [ 179.792958][ C16] ? lookup+0x82/0xf0 Sep 23 07:10:01 [ 179.818252][ C16] ? ip4_obj_hashfn+0xc0/0xc0 Sep 23 07:10:01 [ 179.843125][ C16] __netif_receive_skb_one_core+0x3f/0x50 Sep 23 07:10:01 [ 179.867784][ C16] process_backlog+0x7c/0x110 Sep 23 07:10:01 [ 179.881772][ C1] ------------[ cut here ]------------ Sep 23 07:10:01 [ 179.891767][ C16] __napi_poll+0x20/0x100 Sep 23 07:10:01 [ 179.891772][ C16] net_rx_action+0x26d/0x330 Sep 23 07:10:01 [ 179.915310][ C1] kernel BUG at mm/vmalloc.c:2437! Sep 23 07:10:01 [ 179.938247][ C16] __do_softirq+0xaf/0x1d7 Sep 23 07:10:01 [ 180.006648][ C16] __irq_exit_rcu+0x9a/0xd0 Sep 23 07:10:01 [ 180.029429][ C16] sysvec_call_function_single+0x32/0x80 Sep 23 07:10:01 [ 180.052361][ C16] asm_sysvec_call_function_single+0x16/0x20 Sep 23 07:10:01 [ 180.075020][ C16] RIP: 0033:0x4cd65e Sep 23 07:10:01 [ 180.097176][ C16] Code: 5b 41 5c 41 5d 41 5e 41 5f 5d c3 90 48 8b 50 08 48 81 fa 20 f7 58 00 74 d3 48 85 d2 74 ce 31 f6 0f 1f 40 00 48 89 d0 48 8b 12 <8b> 48 38 48 81 fa 20 f7 58 00 74 26 83 f9 01 75 09 83 48 54 02 be Sep 23 07:10:01 [ 180.144341][ C16] RSP: 002b:00007ffce18b4490 EFLAGS: 00000246 Sep 23 07:10:01 [ 180.168545][ C16] RAX: 00000000020edb60 RBX: 0000000000000000 RCX: 00000000020e3f10 Sep 23 07:10:01 [ 180.192866][ C16] RDX: 000000000058f720 RSI: 0000000000000000 RDI: 00000000020e3f10 Sep 23 07:10:01 [ 180.217209][ C16] RBP: 00007ffce18b44d0 R08: 00007ff4c24d5b20 R09: 00007ff4c24d5b20 Sep 23 07:10:01 [ 180.241690][ C16] R10: 0000000000000000 R11: 0000000000000020 R12: 00000000ffffffff Sep 23 07:10:01 [ 180.266212][ C16] R13: 00000000020edec0 R14: 00007ff4c2ae6340 R15: 00007ffce18b4540 Sep 23 07:10:01 [ 180.290653][ C16] </TASK> Sep 23 07:10:01 [ 180.315127][ C16] Modules linked in: nf_conntrack_netlink nft_limit pppoe pppox ppp_generic slhc nft_ct nft_flow_offload nf_flow_table_inet nf_flow_table nft_nat nft_chain_nat nf_tables netconsole coretemp bonding i40e nf_nat_sip nf_conntrack_sip nf_nat_pptp nf_conntrack_pptp nf_nat_tftp nf_conntrack_tftp nf_nat_ftp nf_conntrack_ftp nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler rtc_cmos Sep 23 07:10:04 [ 180.427060][ C14] invalid opcode: 0000 [#2] SMP Sep 23 07:10:04 [ 180.427076][ C16] ---[ end trace 0000000000000000 ]--- Sep 23 07:10:04 [ 180.456698][ C14] CPU: 14 PID: 0 Comm: swapper/14 Tainted: G D O 5.19.10 #1 Sep 23 07:10:04 [ 180.456702][ C14] Hardware name: Supermicro SYS-5038MR-H8TRF/X10SRD-F, BIOS 3.3 10/28/2020 Sep 23 07:10:04 [ 180.456703][ C14] RIP: 0010:__get_vm_area_node+0x120/0x130 Sep 23 07:10:04 [ 180.486304][ C16] RIP: 0010:__get_vm_area_node+0x120/0x130 Sep 23 07:10:04 [ 180.516494][ C14] Code: ff bd ff ff ff ff 48 0f bd e8 b8 0c 00 00 00 ff c5 39 c5 0f 4c e8 b8 1e 00 00 00 39 c5 0f 4f e8 c4 e2 d1 f7 e9 e9 2e ff ff ff <0f> 0b 4c 89 ff e8 86 80 01 00 45 31 ff eb b6 90 41 57 41 56 41 55 Sep 23 07:10:04 [ 180.516497][ C14] RSP: 0018:ffff8f47c0450950 EFLAGS: 00010206 Sep 23 07:10:04 [ 180.547033][ C16] Code: ff bd ff ff ff ff 48 0f bd e8 b8 0c 00 00 00 ff c5 39 c5 0f 4c e8 b8 1e 00 00 00 39 c5 0f 4f e8 c4 e2 d1 f7 e9 e9 2e ff ff ff <0f> 0b 4c 89 ff e8 86 80 01 00 45 31 ff eb b6 90 41 57 41 56 41 55 Sep 23 07:10:04 [ 180.577559][ C14] Sep 23 07:10:04 [ 180.577560][ C14] RAX: 00000000ffffffff RBX: 0000000000002b20 RCX: 0000000000000100 Sep 23 07:10:04 [ 180.577563][ C14] RDX: 0000000000000015 RSI: 0000000000200000 RDI: 0000000000400040 Sep 23 07:10:04 [ 180.607691][ C16] RSP: 0000:ffff8f47c0b8f888 EFLAGS: 00010206 Sep 23 07:10:04 [ 180.670750][ C14] RBP: 0000000000000015 R08: ffff8f47c0000000 R09: ffffaf47bfffffff Sep 23 07:10:04 [ 180.670752][ C14] R10: 0000000000000010 R11: ffff8cce08348180 R12: 0000000000000422 Sep 23 07:10:04 [ 180.670753][ C14] R13: 0000000000400040 R14: 0000000000000015 R15: fffffffffffffff5 Sep 23 07:10:04 [ 180.670755][ C14] FS: 0000000000000000(0000) GS:ffff8cd41f980000(0000) knlGS:0000000000000000 Sep 23 07:10:04 [ 180.702393][ C16] Sep 23 07:10:04 [ 180.767917][ C14] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Sep 23 07:10:04 [ 180.767920][ C14] CR2: 00007fd6aad174a0 CR3: 0000000106941004 CR4: 00000000003706e0 Sep 23 07:10:04 [ 180.767921][ C14] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Sep 23 07:10:04 [ 180.767922][ C14] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Sep 23 07:10:04 [ 180.767923][ C14] Call Trace: Sep 23 07:10:04 [ 180.800058][ C16] RAX: 00000000ffffffff RBX: 0000000000002b20 RCX: 0000000000000100 Sep 23 07:10:04 [ 180.833230][ C14] <IRQ> Sep 23 07:10:04 [ 180.833233][ C14] __vmalloc_node_range+0x96/0x1e0 Sep 23 07:10:04 [ 180.866187][ C16] RDX: 0000000000000015 RSI: 0000000000200000 RDI: 0000000000400040 Sep 23 07:10:04 [ 180.899166][ C14] ? bucket_table_alloc.isra.0+0x47/0x140 Sep 23 07:10:04 [ 180.932671][ C16] RBP: 0000000000000015 R08: ffff8f47c0000000 R09: ffffaf47bfffffff Sep 23 07:10:04 [ 180.966165][ C14] kvmalloc_node+0x92/0xb0 Sep 23 07:10:04 [ 180.999641][ C16] R10: 0000000000000010 R11: ffff8cce2ad46e40 R12: 0000000000000422 Sep 23 07:10:04 [ 181.033426][ C14] ? bucket_table_alloc.isra.0+0x47/0x140 Sep 23 07:10:04 [ 181.066801][ C16] R13: 0000000000400040 R14: 0000000000000015 R15: fffffffffffffff5 Sep 23 07:10:04 [ 181.101028][ C14] bucket_table_alloc.isra.0+0x47/0x140 Sep 23 07:10:04 [ 181.135829][ C16] FS: 00007ff4c21d1bc0(0000) GS:ffff8cd41fa00000(0000) knlGS:0000000000000000 Sep 23 07:10:04 [ 181.170624][ C14] rhashtable_try_insert+0x3a4/0x440 Sep 23 07:10:04 [ 181.205056][ C16] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Sep 23 07:10:04 [ 181.239090][ C14] rhashtable_insert_slow+0x1b/0x30 Sep 23 07:10:04 [ 181.274046][ C16] CR2: 0000000002492e98 CR3: 000000010ffb3002 CR4: 00000000003706e0 Sep 23 07:10:04 [ 181.308594][ C14] flow_offload_add+0x4d/0x130 [nf_flow_table] Sep 23 07:10:04 [ 181.343542][ C16] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Sep 23 07:10:04 [ 181.379181][ C14] nft_flow_offload_eval+0x22c/0x2ab [nft_flow_offload] Sep 23 07:10:04 [ 181.414802][ C16] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Sep 23 07:10:04 [ 181.451204][ C14] nft_do_chain+0x120/0x4c0 [nf_tables] Sep 23 07:10:04 [ 181.487413][ C16] Kernel panic - not syncing: Fatal exception in interrupt Sep 23 07:10:04 [ 181.524356][ C14] ? rt_cache_route+0xc5/0xe0 Sep 23 07:10:04 [ 181.561275][ C14] ? rt_set_nexthop.constprop.0+0x1bf/0x420 Sep 23 07:10:04 [ 181.598336][ C14] ? dst_alloc+0x133/0x150 Sep 23 07:10:04 [ 181.634807][ C14] ? __mkroute_input+0x192/0x310 Sep 23 07:10:04 [ 181.671737][ C14] ? ip_route_input_slow+0x394/0x8c0 Sep 23 07:10:04 [ 181.708488][ C14] ? nf_nat_ipv4_manip_pkt+0x53/0xe0 [nf_nat] Sep 23 07:10:04 [ 181.745869][ C14] nft_do_chain_inet+0x76/0xc0 [nf_tables] Sep 23 07:10:04 [ 181.783168][ C14] nf_hook_slow+0x36/0xa0 Sep 23 07:10:04 [ 181.821108][ C14] ip_forward+0x46c/0x4c0 Sep 23 07:10:04 [ 181.859196][ C14] ? lookup+0x82/0xf0 Sep 23 07:10:04 [ 181.897547][ C14] ? ip4_obj_hashfn+0xc0/0xc0 Sep 23 07:10:04 [ 181.935410][ C14] __netif_receive_skb_one_core+0x3f/0x50 Sep 23 07:10:04 [ 181.972790][ C14] process_backlog+0x7c/0x110 Sep 23 07:10:04 [ 182.009175][ C14] __napi_poll+0x20/0x100 Sep 23 07:10:04 [ 182.045007][ C14] net_rx_action+0x26d/0x330 Sep 23 07:10:04 [ 182.079976][ C14] __do_softirq+0xaf/0x1d7 Sep 23 07:10:04 [ 182.115176][ C14] do_softirq+0x5a/0x80 Sep 23 07:10:04 [ 182.150034][ C14] </IRQ> Sep 23 07:10:04 [ 182.184488][ C14] <TASK> Sep 23 07:10:04 [ 182.218568][ C14] flush_smp_call_function_queue+0x3f/0x60 Sep 23 07:10:04 [ 182.253213][ C14] do_idle+0xa6/0xc0 Sep 23 07:10:04 [ 182.287348][ C14] cpu_startup_entry+0x14/0x20 Sep 23 07:10:04 [ 182.320668][ C14] start_secondary+0xd6/0xe0 Sep 23 07:10:04 [ 182.353300][ C14] secondary_startup_64_no_verify+0xd3/0xdb Sep 23 07:10:04 [ 182.385163][ C14] </TASK> Sep 23 07:10:04 [ 182.416402][ C14] Modules linked in: nf_conntrack_netlink nft_limit Sep 23 07:10:04 [ 182.516786][ C16] Shutting down cpus with NMI Sep 23 07:10:04 [ 182.620973][ C16] Kernel Offset: 0x13000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff) Sep 23 07:10:04 [ 182.624896][ C16] Rebooting in 10 seconds.. Best regrads, Martin > On 22 Sep 2022, at 15:12, Florian Westphal <fw@xxxxxxxxx> wrote: > > Martin Zaharinov <micron10@xxxxxxxxx> wrote: >> This is bug report for flowtable and kernel 5.19.9 >> >> simple config nat + flowtable > > CC mm experts. I'm not sure that this is a bug in netfilter/rhashtable, > looks like mm problem perhaps? > > I am a bit confused wrt. kvzalloc+GFP_ATOMIC. This looks like following is happening: > > 5.19.9 kernel BUGs with: > >> Sep 22 07:43:49 [460691.305266][ C28] kernel BUG at mm/vmalloc.c:2437! > > [ BUG_ON(in_interrupt ] > >> Sep 22 07:43:50 [460692.031617][ C28] Call Trace: >> Sep 22 07:43:50 [460692.064498][ C28] <IRQ> >> Sep 22 07:43:50 [460692.096177][ C28] __vmalloc_node_range+0x96/0x1e0 >> Sep 22 07:43:50 [460692.128014][ C28] ? bucket_table_alloc.isra.0+0x47/0x140 >> Sep 22 07:43:50 [460692.160134][ C28] kvmalloc_node+0x92/0xb0 >> Sep 22 07:43:50 [460692.191885][ C28] ? bucket_table_alloc.isra.0+0x47/0x140 >> Sep 22 07:43:50 [460692.224234][ C28] bucket_table_alloc.isra.0+0x47/0x140 >> Sep 22 07:43:50 [460692.256840][ C28] rhashtable_try_insert+0x3a4/0x440 > > [ rest irrelevant ] > > AFAICS this is caused by kvzalloc(GFP_ATOMIC) which somehow ends up in > GFP_KERNEL-only territory? Looking at recent history I see > > commit a421ef303008b0ceee2cfc625c3246fa7654b0ca > Author: Michal Hocko <mhocko@xxxxxxxx> > Date: Fri Jan 14 14:07:07 2022 -0800 > > mm: allow !GFP_KERNEL allocations for kvmalloc > > before this, GFP_ATOMIC made sure we stay with plain kmalloc, but > now it appears that we can end up in places where GFP_ATOMIC isn't > allowed? > > Original bug report is here: > https://lore.kernel.org/netdev/09BE0B8A-3ADF-458E-B75E-931B74996355@xxxxxxxxx/T/#u