Re: [PATCH] IB/mlx5: Fix outstanding_pi index for GSI qps

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Dec 15, 2019 at 10:55 AM Leon Romanovsky <leon@xxxxxxxxxx> wrote:
>
> On Thu, Dec 12, 2019 at 05:11:29PM -0700, Prabhath Sajeepa wrote:
> > b0ffeb537f3a changed the way how outstanding WRs are tracked for GSI QP. But the
> > fix did not cover the case when a call to ib_post_send fails and index
> > to track outstanding WRs need to be updated correctly.
> >
> > Fixes: b0ffeb537f3a ('IB/mlx5: Fix iteration overrun in GSI qps ')
> > Signed-off-by: Prabhath Sajeepa <psajeepa@xxxxxxxxxxxxxxx>
> > ---
> >  drivers/infiniband/hw/mlx5/gsi.c | 3 +--
> >  1 file changed, 1 insertion(+), 2 deletions(-)
> >
> > diff --git a/drivers/infiniband/hw/mlx5/gsi.c b/drivers/infiniband/hw/mlx5/gsi.c
> > index ac4d8d1..1ae6fd9 100644
> > --- a/drivers/infiniband/hw/mlx5/gsi.c
> > +++ b/drivers/infiniband/hw/mlx5/gsi.c
> > @@ -507,8 +507,7 @@ int mlx5_ib_gsi_post_send(struct ib_qp *qp, const struct ib_send_wr *wr,
> >               ret = ib_post_send(tx_qp, &cur_wr.wr, bad_wr);
> >               if (ret) {
> >                       /* Undo the effect of adding the outstanding wr */
> > -                     gsi->outstanding_pi = (gsi->outstanding_pi - 1) %
> > -                                           gsi->cap.max_send_wr;
> > +                     gsi->outstanding_pi--;
>
> I'm a little bit confused, what is the difference before and after
> except dropping "gsi->cap.max_send_wr"?
>
> Thanks
>
> >                       goto err;
> >               }
> >               spin_unlock_irqrestore(&gsi->lock, flags);
> > --
> > 2.7.4
> >

This patch needs to be considered in conjunction with the below patch
done by Slava Shwartsman

commit b0ffeb537f3a726931d962ab6d03e34a2f070ea4

Author: Slava Shwartsman <slavash@xxxxxxxxxxxx>

Date:   Sun Jul 3 06:28:19 2016

    IB/mlx5: Fix iteration overrun in GSI qps



    Number of outstanding_pi may overflow and as a result may indicate that

    there are no elements in the queue. The effect of doing this is that the

    MAD layer will get stuck waiting for completions. The MAD layer will

    think that the QP is full - because it didn't receive these completions.



    This fix changes it so the outstanding_pi number is increased

    with 32-bit wraparound and is not limited to max_send_wr so

    that the difference between outstanding_pi and outstanding_ci will

    really indicate the number of outstanding completions.


    Cc: Stable <stable@xxxxxxxxxxxxxxx>

    Fixes: ea6dc2036224 ('IB/mlx5: Reorder GSI completions')

    Signed-off-by: Slava Shwartsman <slavash@xxxxxxxxxxxx>

    Signed-off-by: Leon Romanovsky <leon@xxxxxxxxxx>

    Reviewed-by: Haggai Eran <haggaie@xxxxxxxxxxxx>

    Reviewed-by: Sagi Grimberg <sagi@xxxxxxxxxxx>

    Signed-off-by: Doug Ledford <dledford@xxxxxxxxxx>

diff --git a/drivers/infiniband/hw/mlx5/gsi.c b/drivers/infiniband/hw/mlx5/gsi.c

index 53e03c8..79e6309 100644

--- a/drivers/infiniband/hw/mlx5/gsi.c

+++ b/drivers/infiniband/hw/mlx5/gsi.c

@@ -69,15 +69,6 @@ static bool mlx5_ib_deth_sqpn_cap(struct mlx5_ib_dev *dev)

        return MLX5_CAP_GEN(dev->mdev, set_deth_sqpn);

 }



-static u32 next_outstanding(struct mlx5_ib_gsi_qp *gsi, u32 index)

-{

-       return ++index % gsi->cap.max_send_wr;

-}

-

-#define for_each_outstanding_wr(gsi, index) \

-       for (index = gsi->outstanding_ci; index != gsi->outstanding_pi; \

-            index = next_outstanding(gsi, index))

-

 /* Call with gsi->lock locked */

 static void generate_completions(struct mlx5_ib_gsi_qp *gsi)

 {

@@ -85,8 +76,9 @@ static void generate_completions(struct mlx5_ib_gsi_qp *gsi)

        struct mlx5_ib_gsi_wr *wr;

        u32 index;



-       for_each_outstanding_wr(gsi, index) {

-               wr = &gsi->outstanding_wrs[index];

+       for (index = gsi->outstanding_ci; index != gsi->outstanding_pi;

+            index++) {

+               wr = &gsi->outstanding_wrs[index % gsi->cap.max_send_wr];



                if (!wr->completed)

                        break;

@@ -430,8 +422,9 @@ static int mlx5_ib_add_outstanding_wr(struct
mlx5_ib_gsi_qp *gsi,

                return -ENOMEM;

        }



-       gsi_wr = &gsi->outstanding_wrs[gsi->outstanding_pi];

-       gsi->outstanding_pi = next_outstanding(gsi, gsi->outstanding_pi);

+       gsi_wr = &gsi->outstanding_wrs[gsi->outstanding_pi %

+                                      gsi->cap.max_send_wr];

+       gsi->outstanding_pi++;



        if (!wc) {

                memset(&gsi_wr->wc, 0, sizeof(gsi_wr->wc));



The above fix was incomplete since it did not fix the ib_post_send
failure case, which is fixed by the patch I submitted.


-- 
Thanks,
Prabhath



[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]

  Powered by Linux