Re: aarch64 clone() man page omission

Colin Ian King <colin.king@xxxxxxxxxxxxx> · Wed, 11 May 2016 16:25:56 +0100



On 11/05/16 16:22, Catalin Marinas wrote:
> On Wed, May 11, 2016 at 02:33:01PM +0100, Colin Ian King wrote:
>> On 11/05/16 14:18, Catalin Marinas wrote:
>>> On Tue, May 10, 2016 at 10:50:40PM -0400, Mike Frysinger wrote:
>>>> On 09 May 2016 22:40, Colin Ian King wrote:
>>>>> On 09/05/16 22:31, Mike Frysinger wrote:
>>>>>> On 25 Apr 2016 20:42, Colin Ian King wrote:
>>>>>>> currently, the aarch64 clone() system call requires the stack to be
>>>>>>> aligned at a 16 byte boundary, see arch/arm64/kernel/process.c,
>>>>>>> copy_thread():
>>>>>>>
>>>>>>>                 if (stack_start) {
>>>>>>>                         if (is_compat_thread(task_thread_info(p)))
>>>>>>>                                 childregs->compat_sp = stack_start;
>>>>>>>                         /* 16-byte aligned stack mandatory on AArch64 */
>>>>>>>                         else if (stack_start & 15)
>>>>>>>                                 return -EINVAL;
>>>>>>>                         else
>>>>>>>                                 childregs->sp = stack_start;
>>>>>>>                 }
>>>>>>>
>>>>>>>
>>>>>>> ..and returns -EINVAL if not aligned correctly.  This should be added to
>>>>>>> the manual page clone(2) as it took me a while to figure out why clone()
>>>>>>> was failing with -EINVAL for aarch64 but not on x86.
>>>>>>
>>>>>> seems weird for the kernel to be enforcing this.  is it just because of
>>>>>> the stated ABI ?  or is there some weird requirement in the kernel itself
>>>>>> that requires this ?  it's not like other arches have this check, and
>>>>>> there are def ABI requirements about stack alignments in C.
>>>>>
>>>>> The article here indicates it is an aarch64 convention:
>>>>>
>>>>> https://community.arm.com/groups/processors/blog/2015/11/19/using-the-stack-in-aarch32-and-aarch64
>>>>
>>>> that checks my point about the ABI having alignment requirements, but
>>>> that doesn't mean it needs to be checked/enforced in the kernel.  all
>>>> the limitations i see there can be seen in other arches, but we don't
>>>> have those arches do any stack alignment checking.  so should we be
>>>> dropping it from aarch64 ?  why does it need to be special here ?
>>>
>>> It is not just a software ABI requirement but a hardware one. If you try
>>> to access the stack with an unaligned SP value, you get a fault followed
>>> by a SIGBUS delivered to the user application. We decided to enforce
>>> this at the copy_thread() level, it is easier to catch such issue early
>>> than debugging SIGBUS delivered to a thread.
>>
>> Rather than returning -EINVAL would it be more useful re-align
>> stack_start to the 16 byte boundary in copy_thread as a silent but
>> useful fixup?
> 
> I wouldn't silently re-align the stack, it's a significant kernel ABI
> change. Even dropping -EINVAL in favour of a later SIGBUS is an ABI
> change, though not sure if any user apps or libraries would be affected
> (I wouldn't expect them to rely on the -EINVAL return).
> 
> It seems that musl does this alignment in its clone(2) implementation:
> 
> https://git.musl-libc.org/cgit/musl/tree/src/thread/aarch64/clone.s
> 
> IIUC, glibc does not.
> 
>> It took me a while to debug the -EINVAL on the clone() system call to
>> figure out what was wrong because I didn't realize aarch64 has this
>> constraint.
> 
> Would it have been easier to get a SIGBUS on the first stack access?

Not sure if that's a rhetorical question, but needless to say, a SIGBUS
on the stack would be more of a hint from userspace that can be debugged
without diving into the kernel than having -EINVAL IMHO.
> 
> It's worth posting a patch removing -EINVAL on linux-arm-kernel for
> wider discussion.
> 
Yup, good idea.

--
To unsubscribe from this list: send the line "unsubscribe linux-man" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html