Re: [PATCH] net: tls: enable __GFP_ZERO upon tls_init()

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2023/06/30 19:18, Ard Biesheuvel wrote:
> On Fri, 30 Jun 2023 at 12:11, Alexander Potapenko <glider@xxxxxxxxxx> wrote:
>>
>> On Fri, Jun 30, 2023 at 12:02 PM Ard Biesheuvel <ardb@xxxxxxxxxx> wrote:
>>>
>>> On Fri, 30 Jun 2023 at 11:53, Tetsuo Handa
>>> <penguin-kernel@xxxxxxxxxxxxxxxxxxx> wrote:
>>>>
>>>> On 2023/06/30 18:36, Ard Biesheuvel wrote:
>>>>> Why are you sending this now?
>>>>
>>>> Just because this is currently top crasher and I can reproduce locally.
>>>>
>>>>> Do you have a reproducer for this issue?
>>>>
>>>> Yes. https://syzkaller.appspot.com/text?tag=ReproC&x=12931621900000 works.
>>>>
>>>
>>> Could you please share your kernel config and the resulting kernel log
>>> when running the reproducer? I'll try to reproduce locally as well,
>>> and see if I can figure out what is going on in the crypto layer
>>
>> The config together with the repro is available at
>> https://syzkaller.appspot.com/bug?extid=828dfc12440b4f6f305d, see the
>> latest row of the "Crashes" table that contains a C repro.

Kernel is commit e6bc8833d80f of https://github.com/google/kmsan/commits/master .
Config is available in the dashboard page, but a smaller one is available at
https://I-love.SAKURA.ne.jp/tmp/config-6.4.0-rc7-kmsan .

I'm using a debug printk() patch shown below.

----------------------------------------
diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
index 1a53c8f481e9..b32bb015995c 100644
--- a/net/tls/tls_sw.c
+++ b/net/tls/tls_sw.c
@@ -1210,7 +1210,8 @@ static int tls_sw_do_sendpage(struct sock *sk, struct page *page,
 		if (!sk_stream_memory_free(sk))
 			goto wait_for_sndbuf;
 alloc_payload:
-		ret = tls_alloc_encrypted_msg(sk, required_size);
+		ret = tls_alloc_encrypted_msg(sk, required_size); /////
+		pr_info("required_size=%d ret=%d\n", required_size, ret);
 		if (ret) {
 			if (ret != -ENOSPC)
 				goto wait_for_memory;
@@ -1232,7 +1233,9 @@ static int tls_sw_do_sendpage(struct sock *sk, struct page *page,
 
 		tls_ctx->pending_open_record_frags = true;
 		if (full_record || eor || sk_msg_full(msg_pl)) {
-			ret = bpf_exec_tx_verdict(msg_pl, sk, full_record,
+			pr_info("full_record=%d eor=%d sk_msg_full(msg_pl)=%d copied=%d\n",
+				full_record, eor, sk_msg_full(msg_pl), copied);
+			ret = bpf_exec_tx_verdict(msg_pl, sk, full_record, /////
 						  record_type, &copied, flags);
 			if (ret) {
 				if (ret == -EINPROGRESS)
----------------------------------------

Output (on Ubuntu 22.04 on Oracle VM VirtualBox) is shown below.
Please check tendency of the sum of required_size= values up to the full_record= line.
It seems that the value of required_size= might vary depending on the timings, but
the sum of the values seems to have some rule.

  4125+8221+12317+16413=41076 (the lower 4 bits are 0100)
  2461+6557+10653+14749+16413=50833 (the lower 4 bits are 0001)
  2461+6573+10669+14765+16413=50881 (the lower 4 bits are 0001)

KMSAN reports this problem when the lower 4 bits became 0001 for the second time.
Unless KMSAN's reporting is asynchronous, maybe the reason of "for the second time"
part is that the previous state is relevant...

----------------------------------------
[  157.471712][ T3414] required_size=4125 ret=0
[  157.475879][ T3414] required_size=8221 ret=0
[  157.480471][ T3414] required_size=12317 ret=0
[  157.484604][ T3414] required_size=16413 ret=0
[  157.490499][ T3414] full_record=1 eor=0 sk_msg_full(msg_pl)=0 copied=4096
[  157.513772][ T3414] required_size=4125 ret=0
[  157.523782][ T3414] required_size=8221 ret=0
[  157.533658][ T3414] required_size=12317 ret=0
[  157.539579][ T3414] required_size=16413 ret=0
[  157.543785][ T3414] full_record=1 eor=0 sk_msg_full(msg_pl)=0 copied=4096
[  157.572869][ T3414] required_size=4125 ret=0
[  157.579350][ T3414] required_size=8221 ret=0
[  157.584699][ T3414] required_size=12317 ret=0
[  157.591756][ T3414] required_size=16413 ret=0
[  157.595891][ T3414] full_record=1 eor=0 sk_msg_full(msg_pl)=0 copied=4096
[  157.790734][ T3424] required_size=2461 ret=0
[  157.800725][ T3424] required_size=6557 ret=0
[  157.804560][ T3424] required_size=10653 ret=0
[  157.808433][ T3424] required_size=14749 ret=0
[  157.810125][ T3424] required_size=16413 ret=0
[  157.829564][ T3424] full_record=1 eor=0 sk_msg_full(msg_pl)=0 copied=1664
[  157.848397][ T3424] required_size=2461 ret=0
[  157.854875][ T3424] required_size=6573 ret=0
[  157.860883][ T3424] required_size=10669 ret=0
[  157.865463][ T3424] required_size=14765 ret=0
[  157.871794][ T3424] required_size=16413 ret=0
[  157.877333][ T3424] full_record=1 eor=0 sk_msg_full(msg_pl)=0 copied=1648
[  157.885187][ T3424] =====================================================
[  157.887262][ T3424] BUG: KMSAN: uninit-value in aes_encrypt+0x1692/0x1fa0
[  157.887262][ T3424]  aes_encrypt+0x1692/0x1fa0
[  157.887262][ T3424]  aesti_encrypt+0xe1/0x160
[  157.887262][ T3424]  crypto_cipher_encrypt_one+0x1d1/0x2e0
[  157.887262][ T3424]  crypto_cbcmac_digest_update+0x3ff/0x5a0
[  157.887262][ T3424]  shash_ahash_finup+0x79d/0xd00
[  157.887262][ T3424]  shash_async_finup+0xbf/0x110
[  157.887262][ T3424]  crypto_ahash_finup+0x244/0x500
[  157.887262][ T3424]  crypto_ccm_auth+0x14df/0x15a0
[  157.887262][ T3424]  crypto_ccm_encrypt+0x2ad/0x8b0
[  157.887262][ T3424]  crypto_aead_encrypt+0x116/0x1a0
[  157.887262][ T3424]  tls_push_record+0x2bbe/0x3bf0
[  157.887262][ T3424]  bpf_exec_tx_verdict+0x5ba/0x2530
[  157.887262][ T3424]  tls_sw_do_sendpage+0x1779/0x21f0
[  157.887262][ T3424]  tls_sw_sendpage+0x247/0x2b0
[  157.887262][ T3424]  inet_sendpage+0x1de/0x2f0
[  157.887262][ T3424]  kernel_sendpage+0x4cc/0x940
[  158.004827][ T3424]  sock_sendpage+0x162/0x220
[  158.004827][ T3424]  pipe_to_sendpage+0x3df/0x4f0
[  158.004827][ T3424]  __splice_from_pipe+0x5c7/0x1010
[  158.004827][ T3424]  generic_splice_sendpage+0x1c6/0x2a0
[  158.004827][ T3424]  do_splice+0x26d8/0x32f0
[  158.004827][ T3424]  __se_sys_splice+0x81f/0xba0
[  158.004827][ T3424]  __x64_sys_splice+0x1a1/0x200
[  158.004827][ T3424]  do_syscall_64+0x41/0x90
[  158.004827][ T3424]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
[  158.004827][ T3424] 
[  158.004827][ T3424] Uninit was stored to memory at:
[  158.004827][ T3424]  __crypto_xor+0x285/0x1700
[  158.004827][ T3424]  crypto_cbcmac_digest_update+0x2ba/0x5a0
[  158.004827][ T3424]  shash_ahash_finup+0x79d/0xd00
[  158.004827][ T3424]  shash_async_finup+0xbf/0x110
[  158.004827][ T3424]  crypto_ahash_finup+0x244/0x500
[  158.004827][ T3424]  crypto_ccm_auth+0x14df/0x15a0
[  158.004827][ T3424]  crypto_ccm_encrypt+0x2ad/0x8b0
[  158.004827][ T3424]  crypto_aead_encrypt+0x116/0x1a0
[  158.004827][ T3424]  tls_push_record+0x2bbe/0x3bf0
[  158.004827][ T3424]  bpf_exec_tx_verdict+0x5ba/0x2530
[  158.004827][ T3424]  tls_sw_do_sendpage+0x1779/0x21f0
[  158.004827][ T3424]  tls_sw_sendpage+0x247/0x2b0
[  158.004827][ T3424]  inet_sendpage+0x1de/0x2f0
[  158.004827][ T3424]  kernel_sendpage+0x4cc/0x940
[  158.004827][ T3424]  sock_sendpage+0x162/0x220
[  158.004827][ T3424]  pipe_to_sendpage+0x3df/0x4f0
[  158.004827][ T3424]  __splice_from_pipe+0x5c7/0x1010
[  158.004827][ T3424]  generic_splice_sendpage+0x1c6/0x2a0
[  158.004827][ T3424]  do_splice+0x26d8/0x32f0
[  158.004827][ T3424]  __se_sys_splice+0x81f/0xba0
[  158.004827][ T3424]  __x64_sys_splice+0x1a1/0x200
[  158.004827][ T3424]  do_syscall_64+0x41/0x90
[  158.004827][ T3424]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
[  158.004827][ T3424] 
[  158.004827][ T3424] Uninit was created at:
[  158.004827][ T3424]  __alloc_pages+0x925/0x1050
[  158.004827][ T3424]  alloc_pages+0xe30/0x11b0
[  158.004827][ T3424]  skb_page_frag_refill+0x362/0x910
[  158.004827][ T3424]  sk_page_frag_refill+0xa2/0x1c0
[  158.004827][ T3424]  sk_msg_alloc+0x278/0x1560
[  158.004827][ T3424]  tls_sw_do_sendpage+0xbec/0x21f0
[  158.004827][ T3424]  tls_sw_sendpage+0x247/0x2b0
[  158.004827][ T3424]  inet_sendpage+0x1de/0x2f0
[  158.004827][ T3424]  kernel_sendpage+0x4cc/0x940
[  158.004827][ T3424]  sock_sendpage+0x162/0x220
[  158.004827][ T3424]  pipe_to_sendpage+0x3df/0x4f0
[  158.004827][ T3424]  __splice_from_pipe+0x5c7/0x1010
[  158.004827][ T3424]  generic_splice_sendpage+0x1c6/0x2a0
[  158.260226][ T3424]  do_splice+0x26d8/0x32f0
[  158.260226][ T3424]  __se_sys_splice+0x81f/0xba0
[  158.260226][ T3424]  __x64_sys_splice+0x1a1/0x200
[  158.260226][ T3424]  do_syscall_64+0x41/0x90
[  158.260226][ T3424]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
[  158.260226][ T3424] 
[  158.260226][ T3424] CPU: 7 PID: 3424 Comm: a.out Not tainted 6.4.0-rc7-ge6bc8833d80f-dirty #26
[  158.260226][ T3424] Hardware name: innotek GmbH VirtualBox/VirtualBox, BIOS VirtualBox 12/01/2006
[  158.260226][ T3424] =====================================================
[  158.260226][ T3424] Disabling lock debugging due to kernel taint
[  158.260226][ T3424] Kernel panic - not syncing: kmsan.panic set ...
[  158.260226][ T3424] CPU: 7 PID: 3424 Comm: a.out Tainted: G    B              6.4.0-rc7-ge6bc8833d80f-dirty #26
[  158.320898][ T3424] Hardware name: innotek GmbH VirtualBox/VirtualBox, BIOS VirtualBox 12/01/2006
[  158.334186][ T3424] Call Trace:
[  158.334186][ T3424]  <TASK>
[  158.334186][ T3424]  dump_stack_lvl+0x1f6/0x280
[  158.334186][ T3424]  dump_stack+0x29/0x30
[  158.334186][ T3424]  panic+0x4e7/0xc60
[  158.334186][ T3424]  ? add_taint+0x185/0x210
[  158.334186][ T3424]  kmsan_report+0x2d1/0x2e0
[  158.334186][ T3424]  ? __msan_warning+0x98/0x120
[  158.334186][ T3424]  ? aes_encrypt+0x1692/0x1fa0
[  158.334186][ T3424]  ? aesti_encrypt+0xe1/0x160
[  158.334186][ T3424]  ? crypto_cipher_encrypt_one+0x1d1/0x2e0
[  158.334186][ T3424]  ? crypto_cbcmac_digest_update+0x3ff/0x5a0
[  158.334186][ T3424]  ? shash_ahash_finup+0x79d/0xd00
[  158.334186][ T3424]  ? shash_async_finup+0xbf/0x110
[  158.334186][ T3424]  ? crypto_ahash_finup+0x244/0x500
[  158.334186][ T3424]  ? crypto_ccm_auth+0x14df/0x15a0
[  158.334186][ T3424]  ? crypto_ccm_encrypt+0x2ad/0x8b0
[  158.334186][ T3424]  ? crypto_aead_encrypt+0x116/0x1a0
[  158.334186][ T3424]  ? tls_push_record+0x2bbe/0x3bf0
[  158.334186][ T3424]  ? bpf_exec_tx_verdict+0x5ba/0x2530
[  158.334186][ T3424]  ? tls_sw_do_sendpage+0x1779/0x21f0
[  158.334186][ T3424]  ? tls_sw_sendpage+0x247/0x2b0
[  158.334186][ T3424]  ? inet_sendpage+0x1de/0x2f0
[  158.334186][ T3424]  ? kernel_sendpage+0x4cc/0x940
[  158.334186][ T3424]  ? sock_sendpage+0x162/0x220
[  158.334186][ T3424]  ? pipe_to_sendpage+0x3df/0x4f0
[  158.334186][ T3424]  ? __splice_from_pipe+0x5c7/0x1010
[  158.334186][ T3424]  ? generic_splice_sendpage+0x1c6/0x2a0
[  158.334186][ T3424]  ? do_splice+0x26d8/0x32f0
[  158.334186][ T3424]  ? __se_sys_splice+0x81f/0xba0
[  158.334186][ T3424]  ? __x64_sys_splice+0x1a1/0x200
[  158.334186][ T3424]  ? do_syscall_64+0x41/0x90
[  158.334186][ T3424]  ? entry_SYSCALL_64_after_hwframe+0x63/0xcd
[  158.334186][ T3424]  ? filter_irq_stacks+0xb9/0x230
[  158.334186][ T3424]  ? __stack_depot_save+0x22/0x490
[  158.334186][ T3424]  ? kmsan_internal_set_shadow_origin+0x66/0xe0
[  158.334186][ T3424]  ? kmsan_internal_chain_origin+0x110/0x120
[  158.334186][ T3424]  ? kmsan_get_shadow_origin_ptr+0x4d/0xa0
[  158.334186][ T3424]  __msan_warning+0x98/0x120
[  158.334186][ T3424]  aes_encrypt+0x1692/0x1fa0
[  158.334186][ T3424]  aesti_encrypt+0xe1/0x160
[  158.334186][ T3424]  crypto_cipher_encrypt_one+0x1d1/0x2e0
[  158.334186][ T3424]  ? aesti_set_key+0xb0/0xb0
[  158.334186][ T3424]  ? kmsan_get_shadow_origin_ptr+0x4d/0xa0
[  158.334186][ T3424]  crypto_cbcmac_digest_update+0x3ff/0x5a0
[  158.334186][ T3424]  ? crypto_cbcmac_digest_init+0x140/0x140
[  158.334186][ T3424]  shash_ahash_finup+0x79d/0xd00
[  158.334186][ T3424]  ? kmsan_get_shadow_origin_ptr+0x4d/0xa0
[  158.334186][ T3424]  shash_async_finup+0xbf/0x110
[  158.334186][ T3424]  crypto_ahash_finup+0x244/0x500
[  158.334186][ T3424]  ? shash_async_final+0x3d0/0x3d0
[  158.334186][ T3424]  crypto_ccm_auth+0x14df/0x15a0
[  158.334186][ T3424]  crypto_ccm_encrypt+0x2ad/0x8b0
[  158.334186][ T3424]  ? kmsan_get_shadow_origin_ptr+0x4d/0xa0
[  158.334186][ T3424]  ? crypto_ccm_setauthsize+0x100/0x100
[  158.334186][ T3424]  crypto_aead_encrypt+0x116/0x1a0
[  158.653332][ T3424]  tls_push_record+0x2bbe/0x3bf0
[  158.653332][ T3424]  bpf_exec_tx_verdict+0x5ba/0x2530
[  158.653332][ T3424]  ? _printk+0x181/0x1b0
[  158.653332][ T3424]  ? tls_sw_do_sendpage+0xc81/0x21f0
[  158.653332][ T3424]  tls_sw_do_sendpage+0x1779/0x21f0
[  158.653332][ T3424]  tls_sw_sendpage+0x247/0x2b0
[  158.653332][ T3424]  ? tls_sw_do_sendpage+0x21f0/0x21f0
[  158.653332][ T3424]  inet_sendpage+0x1de/0x2f0
[  158.653332][ T3424]  ? inet_sendmsg+0x1d0/0x1d0
[  158.653332][ T3424]  kernel_sendpage+0x4cc/0x940
[  158.653332][ T3424]  sock_sendpage+0x162/0x220
[  158.653332][ T3424]  pipe_to_sendpage+0x3df/0x4f0
[  158.653332][ T3424]  ? sock_fasync+0x240/0x240
[  158.653332][ T3424]  __splice_from_pipe+0x5c7/0x1010
[  158.653332][ T3424]  ? generic_splice_sendpage+0x2a0/0x2a0
[  158.653332][ T3424]  generic_splice_sendpage+0x1c6/0x2a0
[  158.653332][ T3424]  ? iter_file_splice_write+0x1a30/0x1a30
[  158.653332][ T3424]  do_splice+0x26d8/0x32f0
[  158.653332][ T3424]  ? kmsan_get_shadow_origin_ptr+0x4d/0xa0
[  158.653332][ T3424]  ? __se_sys_splice+0x292/0xba0
[  158.653332][ T3424]  ? __msan_metadata_ptr_for_load_8+0x24/0x40
[  158.653332][ T3424]  ? filter_irq_stacks+0xb9/0x230
[  158.653332][ T3424]  __se_sys_splice+0x81f/0xba0
[  158.870673][ T3424]  __x64_sys_splice+0x1a1/0x200
[  158.870673][ T3424]  do_syscall_64+0x41/0x90
[  158.870673][ T3424]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
[  158.870673][ T3424] RIP: 0033:0x7f6bbd51ea3d
[  158.895223][ T3424] Code: 5b 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c3 a3 0f 00 f7 d8 64 89 01 48
[  158.895223][ T3424] RSP: 002b:00007f6bbd731e08 EFLAGS: 00000246 ORIG_RAX: 0000000000000113
[  158.895223][ T3424] RAX: ffffffffffffffda RBX: 000055ccd9ea6080 RCX: 00007f6bbd51ea3d
[  158.895223][ T3424] RDX: 0000000000000004 RSI: 0000000000000000 RDI: 0000000000000003
[  158.895223][ T3424] RBP: 000055ccd9ea41f4 R08: 00080000fffffffc R09: 0000000000000000
[  158.895223][ T3424] R10: 0000000000000000 R11: 0000000000000246 R12: 0100000000000000
[  158.895223][ T3424] R13: e65b75b4ec4292eb R14: f2300cdb85a45425 R15: 000055ccd9ea6088
[  159.041467][ T3424]  </TASK>
[  159.041467][ T3424] Kernel Offset: disabled
[  159.041467][ T3424] Rebooting in 10 seconds..
----------------------------------------

> 
> Could you explain why that bug contains ~50 reports that seem entirely
> unrelated? AIUI, this actual issue has not been reproduced since
> 2020??

Multiple different bugs are reported as the same problem.
Reproducer is available for only bpf_exec_tx_verdict() one, and the reproducer still works.

> 
> 
>> Config: https://syzkaller.appspot.com/text?tag=KernelConfig&x=ee5f7a0b2e48ed66
>> Report: https://syzkaller.appspot.com/text?tag=CrashReport&x=1325260d900000
>> Syz repro: https://syzkaller.appspot.com/text?tag=ReproSyz&x=11af973e900000
>> C repro: https://syzkaller.appspot.com/text?tag=ReproC&x=163a1e45900000
>>
>> The bug is reproducible for me locally as well (and Tetsuo's patch
>> makes it disappear, although I have no opinion on its correctness).
> 
> What I'd like to do is run a kernel plus initrd locally in OVMF and
> reproduce the issue - can I do that without all the syzkaller
> machinery?

I'm using Ubuntu 22.04 on Oracle VM VirtualBox.
I don't know if this can be reproduced with kernel plus initrd only. But
since the C reproducer is standalone, syzkaller machinery is not involved.




[Index of Archives]     [Kernel]     [Gnu Classpath]     [Gnu Crypto]     [DM Crypt]     [Netfilter]     [Bugtraq]
  Powered by Linux