Re: GPF from target_put_nacl()

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Aug 6, 2017 at 4:24 PM, Nicholas A. Bellinger
<nab@xxxxxxxxxxxxxxx> wrote:
> On Mon, 2017-07-31 at 14:48 -0700, Justin Maggard wrote:
>> On Sun, Jul 30, 2017 at 5:15 PM, Nicholas A. Bellinger
>> <nab@xxxxxxxxxxxxxxx> wrote:
>> > On Thu, 2017-07-27 at 13:56 -0700, Justin Maggard wrote:
>> >> I ran across a GPF after disabling demo-mode on an iSCSI target.  The
>> >> backtrace pointed to target_put_nacl(), which was called from
>> >> transport_free_session().
>> >
>> > Can we see actual stack trace, and kernel version please..?
>> >
>>
>> Sure.
>
> <SNIP>
>
>> Thanks a lot, that clears up my misunderstanding.  I should have read
>> the kref documentation carefully first.
>>
>> So it looks like what's *really* happening is,
>> transport_free_session() does a list_del(acl_list). At least in my
>> case, this empties the list. Then, target_put_nacl() calls
>> target_complete_nacl(), which also does a list_del(acl_list) and
>> crashes.
>>
>
> Ugh.  I seem to recall the initial version of this patch was using
> list_del_init(), but review feedback listed in commit 01d4d6735589
> shows that it was dropped as unnecessary:
>
>    (Drop unnecessary list_del_init usage - HCH)
>
> Note the scenario in question is specific to generate_node_acls = 1 +
> cache_dynamic_acls = 0.
>
> Here's the patch I'm applying to target-pending/master, with a CC' to
> v4.1.y stable.
>
> Please verify on your end.
>
> Thank you.
>

Yes, that takes care of the issue for me.  Thanks!

Tested-by: Justin Maggard <jmaggard10@xxxxxxxxx>

> From 73f6806a8124357ca00fcadaa9d58f617e44947e Mon Sep 17 00:00:00 2001
> From: Nicholas Bellinger <nab@xxxxxxxxxxxxxxx>
> Date: Sun, 6 Aug 2017 16:10:03 -0700
> Subject: [PATCH] target: Fix node_acl demo-mode + uncached dynamic shutdown
>  regression
>
> This patch fixes a generate_node_acls = 1 + cache_dynamic_acls = 0
> regression, that was introduced by
>
>   commit 01d4d673558985d9a118e1e05026633c3e2ade9b
>   Author: Nicholas Bellinger <nab@xxxxxxxxxxxxxxx>
>   Date:   Wed Dec 7 12:55:54 2016 -0800
>
> which originally had the proper list_del_init() usage, but was
> dropped during list review as it was thought unnecessary by HCH.
>
> However, list_del_init() usage is required during the special
> generate_node_acls = 1 + cache_dynamic_acls = 0 case when
> transport_free_session() does a list_del(&se_nacl->acl_list),
> followed by target_complete_nacl() doing the same thing.
>
> This was manifesting as a general protection fault as reported
> by Justin:
>
> kernel: general protection fault: 0000 [#1] SMP
> kernel: Modules linked in:
> kernel: CPU: 0 PID: 11047 Comm: iscsi_ttx Not tainted 4.13.0-rc2.x86_64.1+ #20
> kernel: Hardware name: Intel Corporation S5500BC/S5500BC, BIOS S5500.86B.01.00.0064.050520141428 05/05/2014
> kernel: task: ffff88026939e800 task.stack: ffffc90007884000
> kernel: RIP: 0010:target_put_nacl+0x49/0xb0
> kernel: RSP: 0018:ffffc90007887d70 EFLAGS: 00010246
> kernel: RAX: dead000000000200 RBX: ffff8802556ca000 RCX: 0000000000000000
> kernel: RDX: dead000000000100 RSI: 0000000000000246 RDI: ffff8802556ce028
> kernel: RBP: ffffc90007887d88 R08: 0000000000000001 R09: 0000000000000000
> kernel: R10: ffffc90007887df8 R11: ffffea0009986900 R12: ffff8802556ce020
> kernel: R13: ffff8802556ce028 R14: ffff8802556ce028 R15: ffffffff88d85540
> kernel: FS:  0000000000000000(0000) GS:ffff88027fc00000(0000) knlGS:0000000000000000
> kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> kernel: CR2: 00007fffe36f5f94 CR3: 0000000009209000 CR4: 00000000003406f0
> kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
> kernel: Call Trace:
> kernel:  transport_free_session+0x67/0x140
> kernel:  transport_deregister_session+0x7a/0xc0
> kernel:  iscsit_close_session+0x92/0x210
> kernel:  iscsit_close_connection+0x5f9/0x840
> kernel:  iscsit_take_action_for_connection_exit+0xfe/0x110
> kernel:  iscsi_target_tx_thread+0x140/0x1e0
> kernel:  ? wait_woken+0x90/0x90
> kernel:  kthread+0x124/0x160
> kernel:  ? iscsit_thread_get_cpumask+0x90/0x90
> kernel:  ? kthread_create_on_node+0x40/0x40
> kernel:  ret_from_fork+0x22/0x30
> kernel: Code: 00 48 89 fb 4c 8b a7 48 01 00 00 74 68 4d 8d 6c 24 08 4c
> 89 ef e8 e8 28 43 00 48 8b 93 20 04 00 00 48 8b 83 28 04 00 00 4c 89
> ef <48> 89 42 08 48 89 10 48 b8 00 01 00 00 00 00 ad de 48 89 83 20
> kernel: RIP: target_put_nacl+0x49/0xb0 RSP: ffffc90007887d70
> kernel: ---[ end trace f12821adbfd46fed ]---
>
> To address this, go ahead and use proper list_del_list() for all
> cases of se_nacl->acl_list deletion.
>
> Reported-by: Justin Maggard <jmaggard01@xxxxxxxxx>
> Cc: Justin Maggard <jmaggard01@xxxxxxxxx>
> Cc: stable@xxxxxxxxxxxxxxx # 4.1+
> Signed-off-by: Nicholas Bellinger <nab@xxxxxxxxxxxxxxx>
> ---
>  drivers/target/target_core_tpg.c       | 4 ++--
>  drivers/target/target_core_transport.c | 4 ++--
>  2 files changed, 4 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/target/target_core_tpg.c b/drivers/target/target_core_tpg.c
> index 3691373..02e8a5d 100644
> --- a/drivers/target/target_core_tpg.c
> +++ b/drivers/target/target_core_tpg.c
> @@ -364,7 +364,7 @@ void core_tpg_del_initiator_node_acl(struct se_node_acl *acl)
>         mutex_lock(&tpg->acl_node_mutex);
>         if (acl->dynamic_node_acl)
>                 acl->dynamic_node_acl = 0;
> -       list_del(&acl->acl_list);
> +       list_del_init(&acl->acl_list);
>         mutex_unlock(&tpg->acl_node_mutex);
>
>         target_shutdown_sessions(acl);
> @@ -548,7 +548,7 @@ int core_tpg_deregister(struct se_portal_group *se_tpg)
>          * in transport_deregister_session().
>          */
>         list_for_each_entry_safe(nacl, nacl_tmp, &node_list, acl_list) {
> -               list_del(&nacl->acl_list);
> +               list_del_init(&nacl->acl_list);
>
>                 core_tpg_wait_for_nacl_pr_ref(nacl);
>                 core_free_device_list_for_node(nacl, se_tpg);
> diff --git a/drivers/target/target_core_transport.c b/drivers/target/target_core_transport.c
> index 97fed9a..836d552 100644
> --- a/drivers/target/target_core_transport.c
> +++ b/drivers/target/target_core_transport.c
> @@ -466,7 +466,7 @@ static void target_complete_nacl(struct kref *kref)
>         }
>
>         mutex_lock(&se_tpg->acl_node_mutex);
> -       list_del(&nacl->acl_list);
> +       list_del_init(&nacl->acl_list);
>         mutex_unlock(&se_tpg->acl_node_mutex);
>
>         core_tpg_wait_for_nacl_pr_ref(nacl);
> @@ -538,7 +538,7 @@ void transport_free_session(struct se_session *se_sess)
>                         spin_unlock_irqrestore(&se_nacl->nacl_sess_lock, flags);
>
>                         if (se_nacl->dynamic_stop)
> -                               list_del(&se_nacl->acl_list);
> +                               list_del_init(&se_nacl->acl_list);
>                 }
>                 mutex_unlock(&se_tpg->acl_node_mutex);
>
--
To unsubscribe from this list: send the line "unsubscribe target-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux SCSI]     [Kernel Newbies]     [Linux SCSI Target Infrastructure]     [Share Photos]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Device Mapper]

  Powered by Linux