Greetings! This patch series adds an eBPF JIT for MIPS32. The approach taken first updates existing code to support MIPS64/MIPS32 systems, then refactors source into a common core and dependent MIPS64 JIT, and finally adds a MIPS32 eBPF JIT implementation using the common framework. Compared to writing a standalone MIPS32 JIT, this approach has benefits for long-term maintainability, but has taken much longer than expected. This RFC posting is intended to share progress, gather feedback, and raise some questions with BPF and MIPS experts (which I'll cover later). Code Overview ============= The initial code updates and refactoring exposed a number of problems in the existing MIPS64 JIT, which the first several patches fix. Patch #11 updates common code to support MIPS64/MIPS32 operation. Patch #12 separates the common core from the MIPS64 JIT code. Patch #13 adds a needed MIPS32 uasm opcode, while patch #14 adds the MIPS32 eBPF JIT. On MIPS32, 64-bit BPF registers are mapped to 32-bit register pairs, and all 64-bit operations are built on 32-bit subregister ops. The MIPS32 tailcall counter is stored on the stack however. Notable changes from the MIPS64 JIT include: * BPF_JMP32: implement all conditionals * BPF_JMP | JSET | BPF_K: drop bbit insns only usable on MIPS64 Octeon Since MIPS32 does not include 64-bit div/mod or atomic opcodes, these BPF insns are implemented by directly calling the built-in kernel functions: (with thanks to Luke Nelson for posting similar code online) * BPF_STX | BPF_DW | BPF_XADD * BPF_ALU64 | BPF_DIV | BPF_X * BPF_ALU64 | BPF_DIV | BPF_K * BPF_ALU64 | BPF_MOD | BPF_X * BPF_ALU64 | BPF_MOD | BPF_K Testing ======= Testing used LTS kernel 5.10.x and stable 5.13.x running under QEMU. The test suite included the 'test_bpf' module and 'test_verifier' from kselftests. Using 'test_progs' from kselftests is too difficult in general since cross-compilation depends on libbpf/bpftool, which does not support cross-endian builds. The matrix of test configurations executed for this series covered the expected register sizes, MIPS ISA releases, and JIT settings: WORDSIZE={64-bit,32-bit} x ISA={R2,R6} x JIT={off,on,hardened} On MIPS32BE and MIPS32LE there was general parity between the results of interpreter vs. JIT-backed tests with respect to the numbers of PASSED, SKIPPED, and FAILED tests. The same was also true of MIPS64 retesting. For example, the results below on MIPS32 are typical. Note that skipped tests 854 and 855 are "scale" tests which result in OOM on the QEMU malta MIPS32 test systems. root@OpenWrt:~# sysctl net.core.bpf_jit_enable=1 root@OpenWrt:~# modprobe test_bpf ... test_bpf: Summary: 378 PASSED, 0 FAILED, [366/366 JIT'ed] root@OpenWrt:~# ./test_verifier 0 853 ... Summary: 1127 PASSED, 0 SKIPPED, 89 FAILED root@OpenWrt:~# ./test_verifier 855 1149 ... Summary: 408 PASSED, 7 SKIPPED, 53 FAILED Open Questions ============== 1. As seen in the patch series, the static analysis used by the MIPS64 JIT tends to be fragile in the face of verifier, insn and patching changes. After tracking down and fixing several related bugs, I wonder if it were better to remove the static analysis and leave things more robust and maintainable going forward. Paul, Thomas, David, what are your views? Do you have thoughts on how best to do this? Would it be possible to replace the static analysis by accessing verifier analysis results from a JIT? Daniel, Alexei, or Andrii? 2. The series tries to correctly handle tailcall counter across bpf2bpf and tailcalls, and it would be nice to properly support mixing these, but this is still a WIP for me. Much of what I've read seems very specific to the x86_64 JIT. Is there a good summary of the required changes for a JIT in general? Note: I built a MIPS32LE 'test_progs' after some horrible, ugly hacking, and the 'tailcall' tests pass but the 'tailcall_bpf2bpf' tests fail cryptically. I can send a log and strace if someone helpful could kindly take a look. Is there an alternative, good standalone test available? Possible Next Steps =================== 1. Implementing the new BPF_ATOMIC insns *should* be straightforward on MIPS32. I'm less certain of MIPS64 given the static analysis and related zext/sext logic. 2. The BPF_JMP32 class is another big gap on MIPS64. Has anyone looked at this before? It also ties to the static analysis, but on first glance appears feasible. Thanks in advance for any feedback or suggestions! Tony Ambardar (14): MIPS: eBPF: support BPF_TAIL_CALL in JIT static analysis MIPS: eBPF: mask 32-bit index for tail calls MIPS: eBPF: fix BPF_ALU|ARSH handling in JIT static analysis MIPS: eBPF: support BPF_JMP32 in JIT static analysis MIPS: eBPF: fix system hang with verifier dead-code patching MIPS: eBPF: fix JIT static analysis hang with bounded loops MIPS: eBPF: fix MOD64 insn on R6 ISA MIPS: eBPF: support long jump for BPF_JMP|EXIT MIPS: eBPF: drop src_reg restriction in BPF_LD|BPF_DW|BPF_IMM MIPS: eBPF: improve and clarify enum 'which_ebpf_reg' MIPS: eBPF: add core support for 32/64-bit systems MIPS: eBPF: refactor common MIPS64/MIPS32 functions and headers MIPS: uasm: Enable muhu opcode for MIPS R6 MIPS: eBPF: add MIPS32 JIT Documentation/admin-guide/sysctl/net.rst | 6 +- Documentation/networking/filter.rst | 6 +- arch/mips/Kconfig | 4 +- arch/mips/include/asm/uasm.h | 1 + arch/mips/mm/uasm-mips.c | 4 +- arch/mips/mm/uasm.c | 3 +- arch/mips/net/Makefile | 8 +- arch/mips/net/ebpf_jit.c | 1935 ---------------------- arch/mips/net/ebpf_jit.h | 295 ++++ arch/mips/net/ebpf_jit_comp32.c | 1241 ++++++++++++++ arch/mips/net/ebpf_jit_comp64.c | 987 +++++++++++ arch/mips/net/ebpf_jit_core.c | 1118 +++++++++++++ 12 files changed, 3663 insertions(+), 1945 deletions(-) delete mode 100644 arch/mips/net/ebpf_jit.c create mode 100644 arch/mips/net/ebpf_jit.h create mode 100644 arch/mips/net/ebpf_jit_comp32.c create mode 100644 arch/mips/net/ebpf_jit_comp64.c create mode 100644 arch/mips/net/ebpf_jit_core.c -- 2.25.1