Re: Hang in cm_destroy_id

Erez Shitrit <erezsh@xxxxxxxxxxxxxxxxxx> · Sun, 27 Mar 2016 08:53:01 +0300

Hi Nikolay,

looks like some cm event/packet was missing, no idea what/how we need
to debug this.

On Fri, Mar 25, 2016 at 2:35 PM, Nikolay Borisov <kernel@xxxxxxxx> wrote:
> Hello,
>
> I have an infiniband network which was running in connected mode but due
> to some misconfiguration the infiniband nodes couldn't communicate with
> the opensm and I was seeing a lot of messages such as:

What do you mean with misconfiguration? what was the configuration of
the sm before and after you changed that ?

something caused the cm to get out of sync in its one or more open cm
connections ..

>
> 2745676.137840] ib0: queue stopped 1, tx_head 599, tx_tail 556
> [2745677.137564] ib0: transmit timeout: latency 137170 msecs
> [2745677.137735] ib0: queue stopped 1, tx_head 599, tx_tail 556
>
> Now this is all fine and dandy until I started seeing splats such as:
>
> [2745677.407573] INFO: task kworker/u24:0:8332 blocked for more than 120
> seconds.
> [2745677.407748]       Tainted: P        W  O    4.4.1-clouder2 #69
> [2745677.407916] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
> disables this message.
> [2745677.408087] kworker/u24:0   D ffff8801445dbaf8     0  8332      2
> 0x00000000
> [2745677.408353] Workqueue: ipoib_wq ipoib_cm_rx_reap [ib_ipoib]
> [2745677.408565]  ffff8801445dbaf8 ffff88046d730d00 ffff880216218d00
> 0000000000000000
> [2745677.408916]  0000000000000004 0000000000000004 0000000000000004
> 000000000000049a
> [2745677.409267]  0000000000000090 0000000000000001 0000000000000002
> 0000000000000000
> [2745677.409612] Call Trace:
> [2745677.409773]  [<ffffffff8134e12b>] ? find_next_bit+0xb/0x10
> [2745677.409935]  [<ffffffff813399d1>] ? cpumask_next_and+0x21/0x40
> [2745677.410099]  [<ffffffff810d8628>] ? load_balance+0x1f8/0x8e0
> [2745677.410262]  [<ffffffff81679b67>] schedule+0x47/0x90
> [2745677.410422]  [<ffffffff8167cab6>] schedule_timeout+0x136/0x1c0
> [2745677.410585]  [<ffffffff8167ac03>] wait_for_completion+0xb3/0x120
> [2745677.410749]  [<ffffffff810cb5c0>] ? try_to_wake_up+0x3b0/0x3b0
> [2745677.410914]  [<ffffffffa02c6c0f>] cm_destroy_id+0x8f/0x310 [ib_cm]
> [2745677.411080]  [<ffffffffa02c6fd0>] ib_destroy_cm_id+0x10/0x20 [ib_cm]
> [2745677.411251]  [<ffffffffa0668357>]
> ipoib_cm_free_rx_reap_list+0xa7/0x110 [ib_ipoib]
> [2745677.411422]  [<ffffffffa06683d5>] ipoib_cm_rx_reap+0x15/0x20 [ib_ipoib]
> [2745677.411591]  [<ffffffff810bb798>] process_one_work+0x178/0x500
> [2745677.411759]  [<ffffffff810bcd92>] worker_thread+0x132/0x630
> [2745677.411928]  [<ffffffff810cb5d2>] ? default_wake_function+0x12/0x20
> [2745677.412096]  [<ffffffff810e08e6>] ? __wake_up_common+0x56/0x90
> [2745677.412263]  [<ffffffff810bcc60>] ? create_worker+0x1d0/0x1d0
> [2745677.412430]  [<ffffffff810bcc60>] ? create_worker+0x1d0/0x1d0
> [2745677.412598]  [<ffffffff810bcc60>] ? create_worker+0x1d0/0x1d0
> [2745677.412766]  [<ffffffff810c1777>] kthread+0xd7/0xf0
> [2745677.412933]  [<ffffffff810ca3ee>] ? schedule_tail+0x1e/0xd0
> [2745677.413100]  [<ffffffff810c16a0>] ?
> kthread_freezable_should_stop+0x80/0x80
> [2745677.413270]  [<ffffffff8167de2f>] ret_from_fork+0x3f/0x70
> [2745677.413439]  [<ffffffff810c16a0>] ?
> kthread_freezable_should_stop+0x80/0x80
>
>
> And having kernel.hung_task_panic sysctl set to 1 caused a lot of
> machines to reboot. In any case I don't think it's normal to have hung
> tasks when your network is out. This happens due to the
> wait_for_completion(&cm_id_priv->comp); never returning in cm_destroy_id
> function. I saw there is one place where the cm_id refcount is
> decremented via normal atomic_dec and not cm_deref_id under
> cm_req_handle's rejected label. I dunno if this is correct or now, but
> there definitely seems to be some refcounting problem. I've since moved
> away from CM due to this thing (and the network got eventually fixed)
> but I thought I'd do a bug-report so that this can be fixed
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html