Re: [RESEND PATCH v3 1/2] compiler.h: add _static_assert()

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 13/11/2024 at 05:26, Yury Norov wrote:
On Wed, Nov 13, 2024 at 04:08:39AM +0900, Vincent Mailhol wrote:
__builtin_constant_p() is known for not always being able to produce
constant expression [1] which lead to the introduction of
__is_constexpr() [2]. Because of its dependency on
__builtin_constant_p(), statically_true() suffers from the same
issues.

For example:

   void foo(int a)
   {
   	 /* fail on GCC */
   	BUILD_BUG_ON_ZERO(statically_true(a));

   	 /* fail both clang and GCC */
   	static char arr[statically_true(a) ? 1 : 2];
   }

For the same reasons why __is_constexpr() was created to cover
__builtin_constant_p() edge cases, __is_constexpr() can be used to
resolve statically_true() limitations.

Note that, somehow, GCC is not always able to fold this:

   __is_constexpr(x) && (x)

It is OK in BUILD_BUG_ON_ZERO() but not in array declarations or in
static_assert():

   void bar(int a)
   {
   	/* success */
   	BUILD_BUG_ON_ZERO(__is_constexpr(a) && (a));

   	/* fail on GCC */
   	static char arr[__is_constexpr(a) && (a) ? 1 : 2];

   	/* fail on GCC */
   	static_assert(__is_constexpr(a) && (a));
   }

Encapsulating the expression in a __builtin_choose_expr() switch
resolves all these failed test.

Declare a new _statically_true() macro which, by making use of the
__builtin_choose_expr() and __is_constexpr(x) combo, always produces a
constant expression.
So, maybe name it const_true() then?


OK. I pretty like the _statically_true() because the link with statically_true() was obvious and the _ underscore prefix hinted that this variant was "special". But I have to admit that the const_true() is also really nice, and I finally adopted it in the v4.

It should be noted that statically_true() still produces better
folding:

   statically_true(!(var * 8 % 8))

always evaluates to true even if var is unknown, whereas

   _statically_true(!(var * 8 % 8))

fails to fold the expression and return false.

For this reason, usage of _statically_true() be should the exception.
Reflect in the documentation that _statically_true() is less powerful
and that statically_true() is the overall preferred solution.

[1] __builtin_constant_p cannot resolve to const when optimizing
Link: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=19449

[2] commit 3c8ba0d61d04 ("kernel.h: Retain constant expression output for max()/min()")

Signed-off-by: Vincent Mailhol <mailhol.vincent@xxxxxxxxxx>
---
Bonuses:

   - above examples, and a bit more:

       https://godbolt.org/z/zzqM1ajPj

   - a proof that statically_true() does better constant folding than _statically_true()

       https://godbolt.org/z/vK6KK4hMG
---
  include/linux/compiler.h | 14 ++++++++++++++
  1 file changed, 14 insertions(+)

diff --git a/include/linux/compiler.h b/include/linux/compiler.h
index 4d4e23b6e3e7..c76db8b50202 100644
--- a/include/linux/compiler.h
+++ b/include/linux/compiler.h
@@ -308,6 +308,20 @@ static inline void *offset_to_ptr(const int *off)
   */
  #define statically_true(x) (__builtin_constant_p(x) && (x))
+/*
+ * Similar to statically_true() but produces a constant expression
+ *
+ * To be used in conjunction with macros, such as BUILD_BUG_ON_ZERO(),
+ * which require their input to be a constant expression and for which
+ * statically_true() would otherwise fail.
+ *
+ * This is a tradeoff: _statically_true() is less efficient at
+ * constant folding and will fail to optimize any expressions in which
+ * at least one of the subcomponent is not constant. For the general
+ * case, statically_true() is better.
I agree with Rasmus. Would be nice to have examples where should I use
one vs another right here in the comment.


I rewrote the full set of examples in v4. I added the godbolt link in the patch description and I cherry picked what seems to me the two most meaningful examples and put them in the macro comment. Let me know what you think.

Yours sincerely,
Vincent Mailhol





[Index of Archives]     [Newbies FAQ]     [LKML]     [IETF Annouce]     [DCCP]     [Netdev]     [Networking]     [Security]     [Bugtraq]     [Yosemite]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux SCSI]     [Trinity Fuzzer Tool]

  Powered by Linux