On Sun, Oct 14, 2012 at 06:44:12PM +0200, Daniel Mack wrote: > On Oct 14, 2012 6:40 PM, "Al Viro" <viro@xxxxxxxxxxxxxxxxxx> wrote: > > > > On Sun, Oct 14, 2012 at 05:35:23PM +0200, Daniel Mack wrote: > > > > > I rebased my ARM development branch and figured that your patch 9fff2fa > > > ("arm: switch to saner kernel_execve() semantics") breaks the boot on my > > > board right after init is invoked via NFS: > > > > OK, revert it is, then. Nothing in the tree has dependencies on that > sucker > > and while it survives testing here, it's obviously not ready for mainline. > > So, with abject apologies to everyone involved, please revert. > > Reverting it is not straight forward, and half of this patch doesn't seem > to cause issues. > > I can resend my patch with an S-o-b if you want me to. Um... That's _really_ interesting. First of all, revert is absolutely straightforward; the only change in Kconfig is "remove the damn select" and it's not hard to resolve. But I actually wonder what the hell is going on with that breakage - the *only* thing your revert changes is that instead of letting the kernel_thread callback return all the way to returning 0 to ret_from_kernel_thread() on do_execve() success you have it do ret_from_kernel_execve() instead. Hmm... Could you try to print current_pt_regs()->ARM_r0 in kernel_execve() before calling ret_from_kernel_execve() with your patch applied? If that somehow got non-zero, we'd see trouble, all right, but I don't see any places where it could. Wait a minute... I think I see what might be going on, but I don't understand it at all. Look: arm start_thread() is #define start_thread(regs,pc,sp) \ ({ \ unsigned long *stack = (unsigned long *)sp; \ memset(regs->uregs, 0, sizeof(regs->uregs)); \ if (current->personality & ADDR_LIMIT_32BIT) \ regs->ARM_cpsr = USR_MODE; \ else \ regs->ARM_cpsr = USR26_MODE; \ if (elf_hwcap & HWCAP_THUMB && pc & 1) \ regs->ARM_cpsr |= PSR_T_BIT; \ regs->ARM_cpsr |= PSR_ENDSTATE; \ regs->ARM_pc = pc & ~1; /* pc */ \ regs->ARM_sp = sp; /* sp */ \ regs->ARM_r2 = stack[2]; /* r2 (envp) */ \ regs->ARM_r1 = stack[1]; /* r1 (argv) */ \ regs->ARM_r0 = stack[0]; /* r0 (argc) */ \ nommu_start_thread(regs); \ }) and the last 3 make no sense whatsoever. Note that on normal execve() we'll be going through the syscall return, so the userland will see 0 in there, no matter what do we do here. Theoretically, it might've been done for ptrace sake (it will be able to observe the values in those registers before the tracee reaches userland), but there's another oddity involved - "stack" is a userland pointer here. Granted, it's been recently written to, so we are not likely to hit a pagefault here, but... What happens if we are under enough memory pressure to swap those pages out? PF in the kernel mode with no exception table entries for those insns? -- To unsubscribe from this list: send the line "unsubscribe linux-arch" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html