On Tue, Apr 23, 2019 at 12:07:26PM -0700, Andy Lutomirski wrote: > On Tue, Apr 23, 2019 at 11:59 AM Sean Christopherson > <sean.j.christopherson@xxxxxxxxx> wrote: > > > > On Mon, Apr 22, 2019 at 06:29:06PM -0700, Andy Lutomirski wrote: > > > What's not tested here is running this code with EFLAGS.TF set and > > > making sure that it unwinds correctly. Also, Jarkko, unless I missed > > > something, the vDSO extable code likely has a bug. If you run the > > > instruction right before ENCLU with EFLAGS.TF set, then do_debug() > > > will eat the SIGTRAP and skip to the exception handler. Similarly, if > > > you put an instruction breakpoint on ENCLU, it'll get skipped. Or is > > > the code actually correct and am I just remembering wrong? > > > > The code is indeed broken, and I don't see a sane way to make it not > > broken other than to never do vDSO fixup on #DB or #BP. But that's > > probably the right thing to do anyways since an attached debugger is > > likely the intended recipient the 99.9999999% of the time. > > > > The crux of the matter is that it's impossible to identify whether or > > not a #DB/#BP originated from within an enclave, e.g. an INT3 in an > > enclave will look identical to an INT3 at the AEP. Even if hardware > > provided a magic flag, #DB still has scenarios where the intended > > recipient is ambiguous, e.g. data breakpoint encountered in the enclave > > but on an address outside of the enclave, breakpoint encountered in the > > enclave and a code breakpoint on the AEP, etc... > > Ugh. It sounds like ignoring the fixup for #DB is the right call. > But what happens if the enclave contains an INT3 or ICEBP instruction? > Are they magically promoted to #GP, perhaps? #UD for opt-out, a.k.a. non-debug, enclaves. Delivered "normally" for opt-in debug enclaves, except they're fault-like instead of trap-like. > As a maybe possible alternative, if we made it so that the AEX address > was not the same as the ENCLU, could we usefully distinguish these > exceptions based on RIP? Not really, because a user could set a code breakpoint on the AEX or insert an INT3, e.g. to break on exit from the enclave. Theoretically the kernel could cross-reference addresses to determine whether or not a DRx match occurred on an enclave address, but a) that'd be pretty ugly to implement and b) there would still be ambiguity, e.g. if there's a code breakpoint on the AEX and a #DB occurs in the enclave, then DR6 will record both the in-enclave DRx match and the AEX (non-enclave) DRx match. > I suppose it's also worth considering > whether page faults from *inside* the enclave should result in SIGSEGV > or result in a fixup. We certainly want page faults from the ENCLU > instruction itself to get fixed up, but maybe we want most exceptions > inside the enclave to work a bit differently. Of course, if we do > this, we need to make sure that the semantics of returning from the > signal handler are reasonable. Hmm, I'm pretty sure any fault that is 100% in the domain of the enclave should result in fixup. Here are a few use cases off the top of my head that would require the enclave's runtime to intercept the signal, either to reenter the enclave or to feed the fault into the enclave's handler: - Handle EPC invalidation, e.g. due to VM migration, while a thread is in the enclave since the resulting #PF can occur inside the enclave. - During enclave development, configure the runtime to call into the enclave on any exception so that the enclave can dump register state. - Implement copy-on-write or lazy allocation using SGX2 instructions, which would require feeding the #PF back into the enclave. Purely theoretical AFAIK, but lazy allocation in particular could be interesting, e.g. don't allocate .bss pages at startup time. - An enclave and its runtime might feed #UDs back into the enclave, e.g. to run an unmodified binary in an enclave by wrapping it in a shim of sorts.