On Wed, Oct 07, 2020 at 06:14:02AM +0300, Jarkko Sakkinen wrote: > On Tue, Oct 06, 2020 at 06:17:38PM -0700, Sean Christopherson wrote: > > On Wed, Oct 07, 2020 at 03:22:36AM +0300, Jarkko Sakkinen wrote: > > > > And then a third flavor comes along, e.g. Jethro's request interrupt case, > > > > and exit_reason can also return '2'. How do you handle that with only the > > > > leaf? > > > > > > I'm listening. How was that handled before? I saw only '0' and '1'. Can > > > you bring some context on that? I did read the emails that were swapped > > > when the run structure was added but I'm not sure what is the exact > > > differentiator. Maybe I'm missing something. > > > > https://patchwork.kernel.org/patch/11719889/ > > Thank you. > > There's aboslutely nothing that is blocking adding such support for such > AEP handling in the current implementation. SGX_SYNCHRONOUS_EXIT is just > another name for EEXIT. Sure. And SGX_EXCEPTION_EXIT is just another name for EENTER|ERESUME. > Even if that was in place, you'd need to separate normal and interrupt. > Tristate is useless here. Huh? You mean like adding SGX_INTERRUPT_EXIT and SGX_EXCEPTION_EXIT? > As far as I'm concerned, no bottlenecks have been created. There's no bottleneck, just an inflexible and kludgy API for userspace. if (run->leaf == EEXIT) return handle_eexit(); if (run->leaf == EENTER || run->leaf == ERESUME) return handle_exception(run->leaf); return -EIO; Let's say we come up with a clever opt-in scheme that allows exception fixup to inform the vDSO that the enclave was invalid, even on SGX1. Now we're in a scenario where we want to tell userspace that the enclave is lost, but userspace assumes any exit EENTER or ERESUME is an exception. if (run->leaf == EEXIT) return handle_eexit(); if (run->leaf == EENTER || run->leaf == ERESUME) return handle_invalid_enclave_or_maybe_exception(); return -EIO; We could add a new exit reason, but we'd still need to ensure EENTER|ERESUME means "exception" for old userspace. Or we could add exit_reason now and end up with (IMO) a sane and extensible interface. if (run->exit_reason == SGX_ENCLAVE_INVALID) return handle_invalid_enclave(); if (run->exit_reason == SGX_SYNCHRONOUS_EXIT) return handle_eexit(); if (run->exit_reason == SGX_EXCEPTION) return handle_exception(); return -EIO; And maybe we get really clever and figure out a way to (deterministically) redirect SIGALRM to the vDSO. Then we'd want: if (run->exit_reason == SGX_ENCLAVE_INVALID) return handle_invalid_enclave(); if (run->exit_reason == SGX_SYNCHRONOUS_EXIT) return handle_eexit(); if (run->exit_reason == SGX_ALARM) return handle_reschedule(); if (run->exit_reason == SGX_EXCEPTION) return handle_exception(); return -EIO; Even more hypothetical would be if Andy gets one of his wishes, and EENTER2 comes along that doesn't allow the enclave to dictate the exit point, "returns" an error code on enclave failure, and allows the kernel to auto-restart the enclave on IRQs/NMIs. That (very hypothetical) scenario fits nicely into the exit_reason handling. I'm not arguing that any of the above is even remotely likely. I just don't understand why we'd want an API that at best requires heuristics in userspace to determine why the enclave stopped running, and at worst will saddle us with an ugly mess in the future. All to save 4 bytes that no one cares about (they literally cost nothing), and a single MOV in a flow that is hundreds, if not thousands, of cycles.