Re: [PATCH bpf-next] bpf, x64: Fix a jit convergence issue

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




On 8/28/24 4:44 PM, Alexei Starovoitov wrote:
On Wed, Aug 28, 2024 at 3:47 PM Yonghong Song <yonghong.song@xxxxxxxxx> wrote:
It can be je/je too, no?
Yes. It is possible.

so 128 - 4 instead of 128 - 3 ?
You probably mean "127 - 4 instead of 127 - 3" since
the maximum value is 127.
Yes, of course :)

I checked 127 - 4 = 0x7c and indeed we should. See below examples:

     20e:    48 85 ff                test   rdi,rdi
     211:    XX XX                   je     0x291
     213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
     ...
     28d:    XX XX XX XX XX XX       je     0x212
     293:    bf 03 00 00 00          mov    edi,0x3

=>

     20e:    48 85 ff                test   rdi,rdi
     211:    XX XX XX XX XX XX       je     0x297 (0x293 - 0x213)
     217:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
     ...
     291:    XX XX                   je     0x217 (0x217 - 0x293)
     293:    bf 03 00 00 00          mov    edi,0x3

=>

     20e:    48 85 ff                test   rdi,rdi
     211:    XX XX                   je     0x28f (0x293 - 0x217)
     213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
     ...
     28d:    XX XX                   je     0x213 (0x213 - 0x293)  // -0x80 allowed
     293:    bf 03 00 00 00          mov    edi,0x3

=>

     20e:    48 85 ff                test   rdi,rdi
     211:    XX XX XX XX XX XX       je     0x28f (0x293 - 0x213)
     217:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
     ...
     291:    XX XX                   je     0x217 (0x217 - 0x293)
     293:    bf 03 00 00 00          mov    edi,0x3

=>
     ...


Here 0x293 - 0x217 = 0x7c
How did you craft such a test?
Can we add it as a selftest somehow?

This is not from a complete test. I assumed some state during convergence
and from there to further derive states. But I will try to see whether
I can construct actual test cases or not.


+static bool is_imm8_cond_offset(int value)
+{
+       return value <= 124 && value >= -128;
the other side needs the same treatment, no ?
good question. From my understanding, the non-convergence in the
above needs both forward and backport conditions. The solution we
are using is based on putting a limitation on forward conditions
w.r.t. jit code gen.

Another solution is actually to put a limitation on backward
conditions. For example, let us say the above is_imm8_cond_offset()
has
         return value <= 127 && value > -124

See below example:

     20e:    48 85 ff                test   rdi,rdi
     211:    XX XX                   je     0x291
     213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
     ...
     28d:    XX XX XX XX XX XX       je     0x212
     293:    bf 03 00 00 00          mov    edi,0x3

=>

     20e:    48 85 ff                test   rdi,rdi
     211:    XX XX XX XX XX XX       je     0x297 (0x293 - 0x213)
     217:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
     ...
     291:    XX XX XX XX XX XX       je     0x21b (0x217 - 0x293)
     297:    bf 03 00 00 00          mov    edi,0x3

=>

     20e:    48 85 ff                test   rdi,rdi
     211:    XX XX XX XX XX XX       je     0x297 (0x297 - 0x217)
     217:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
     ...
     291:    XX XX XX XX XX XX       je     0x217 (0x217 - 0x297)
     297:    bf 03 00 00 00          mov    edi,0x3

converged here.

So I think we do not need to limit both sides. One side should be enough.
I see and agree when both sides are je/je.
What if the earlier one is a jmp ?

Then we can hit:
            if (nops != 0 && nops != 3) {
                      pr_err("unexpected jump padding: %d bytes\n",
                                              nops);
?

So one side of "jmp_cond padding" and the same side in "jump padding"
needs to do it?

I did some further experiments with pattern like
  jmp <-> je
and
  jmp <-> jmp

The below is the illustration (not from a complete test):

================

    211:    XX XX                   jmp     0x291
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28d:    XX XX XX XX XX XX       je     0x212
    293:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX XX XX XX          jmp     (0x293 - 0x213)
    216:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    291:    XX XX                   je      (0x216 - 0x293)
    293:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX                   jmp     (0x293 - 0x216 = 0x7d)
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28d:    XX XX                   je      (0x213 - 0x293)
    293:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX XX XX XX          jmp     (0x293 - 0x213)
    216:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    291:    XX XX                   je      (0x216 - 0x293)
    293:    bf 03 00 00 00          mov    edi,0x3

=>
    ...

not converged!

================

    211:    XX XX                   jmp     0x291
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28c:    XX XX XX XX XX XX       je     0x212
    292:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX                   jmp     (0x292 - 0x213)
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28c:    XX XX                   je     (0x213 - 0x292)
    28e:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX                   jmp     (0x28e - 0x213 = 0x7b)
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28c:    XX XX                   je     (0x213 - 0x28e)
    28e:    bf 03 00 00 00          mov    edi,0x3

converged!

=================

    211:    XX XX                   jmp     0x291
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28e:    XX XX XX XX XX          jmp     0x212
    293:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX XX XX XX          jmp    (0x293 - 0x213)
    216:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    292:    XX XX                   jmp    (0x216 - 0x293)
    294:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX                   jmp    (0x294 - 0x216 = 0x7e)
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28e:    XX XX XX XX XX          jmp    (0x213 - 0x294)
    294:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX                   jmp    (0x294 - 0x216 = 0x7e)
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28e:    XX XX XX XX XX          jmp    (0x213 - 0x294)
    294:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX XX XX XX          jmp    (0x294 - 0x213)
    216:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    292:    XX XX                   jmp    (0x216 - 0x294)
    294:    bf 03 00 00 00          mov    edi,0x3

=>
   ...

no converged!

===================================

    211:    XX XX                   jmp     0x291
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28d:    XX XX XX XX XX          jmp     0x212
    292:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX                   jmp     (0x292 - 0x213)
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28d:    XX XX                   jmp     (0x213 - 0x292)
    290:    bf 03 00 00 00          mov    edi,0x3

=>

    211:    XX XX                   jmp     (0x290 - 0x213 = 0x7d)
    213:    48 8b 77 00             mov    rsi,QWORD PTR [rdi+0x0]
    ...
    28d:    XX XX                   jmp     (0x213 - 0x290)
    290:    bf 03 00 00 00          mov    edi,0x3

converged!

So I emulated je <-> je, je <-> jmp, jmp <-> je and jmp <-> jmp.

So we need to apply the same checking is_imm8_cond_offset() to jmp insn.
This should cover all cases.

Hitting the following
           if (nops != 0 && nops != 3) {
                     pr_err("unexpected jump padding: %d bytes\n",
                                             nops);

is not due to the above illustration with 'jmp' insn as indeed
its insn length changes with 0 or 3. But it is due to some jmp/cond_jmp
insn inside je/jmp <-> je/jmp.





[Index of Archives]     [Linux Samsung SoC]     [Linux Rockchip SoC]     [Linux Actions SoC]     [Linux for Synopsys ARC Processors]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]


  Powered by Linux