Re: [PATCH] vhost: get 2% performance improved by reducing spin_lock race in vhost_work_queue

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



From: Chuanyu Qin <qinchuanyu@xxxxxxxxxx>
Subject: [PATCH] get 2% or more performance improved by reducing spin_lock race in vhost_work_queue

the wake_up_process func is included by spin_lock/unlock in vhost_work_queue, 
but it could be done outside the spin_lock. 
I have test it with kernel 3.0.27 and guest suse11-sp2 using iperf, the num as below.
                 orignal                   modified
thread_num  tp(Gbps)   vhost(%)  |  tp(Gbps)     vhost(%)
1           9.59         28.82   |      9.59        27.49
8            9.61        32.92   |      9.62        26.77
64            9.58        46.48  |     9.55        38.99
256            9.6        63.7   |      9.6         52.59

Signed-off-by: Chuanyu Qin <qinchuanyu@xxxxxxxxxx>
---
 drivers/vhost/vhost.c |    5 +++--
 1 files changed, 3 insertions(+), 2 deletions(-)

diff --git a/drivers/vhost/vhost.c b/drivers/vhost/vhost.c
index 94dbd25..8bee109 100644
--- a/drivers/vhost/vhost.c
+++ b/drivers/vhost/vhost.c
@@ -146,9 +146,10 @@ static inline void vhost_work_queue(struct vhost_dev *dev,
 	if (list_empty(&work->node)) {
 		list_add_tail(&work->node, &dev->work_list);
 		work->queue_seq++;
+		spin_unlock_irqrestore(&dev->work_lock, flags);
 		wake_up_process(dev->worker);
-	}
-	spin_unlock_irqrestore(&dev->work_lock, flags);
+	} else
+		spin_unlock_irqrestore(&dev->work_lock, flags);
 }
 
 void vhost_poll_queue(struct vhost_poll *poll)
-- 
1.7.3.1.msysgit.0


> On 05/20/2013 12:22 PM, Qinchuanyu wrote:
> > The patch below is base on
> > https://git.kernel.org/cgit/linux/kernel/git/next/linux-
> next.git/tree/drivers/vhost/vhost.c?id=refs/tags/next-20130517
> >
> > Signed-off-by: Chuanyu Qin <qinchuanyu@xxxxxxxxxx>
> > --- a/drivers/vhost/vhost.c     2013-05-20 11:47:05.000000000 +0800
> > +++ b/drivers/vhost/vhost.c     2013-05-20 11:48:24.000000000 +0800
> > @@ -154,9 +154,10 @@
> >         if (list_empty(&work->node)) {
> >                 list_add_tail(&work->node, &dev->work_list);
> >                 work->queue_seq++;
> > +               spin_unlock_irqrestore(&dev->work_lock, flags);
> >                 wake_up_process(dev->worker);
> > -       }
> > -       spin_unlock_irqrestore(&dev->work_lock, flags);
> > +       } else
> > +               spin_unlock_irqrestore(&dev->work_lock, flags);
> >  }
> >
> >  void vhost_poll_queue(struct vhost_poll *poll)
> >
> > I did the test by using iperf in 10G environment, the test num as
> below:
> >                  orignal                   modified
> > thread_num  tp(Gbps)   vhost(%)  |  tp(Gbps)     vhost(%)
> > 1           9.59         28.82   |      9.59        27.49
> > 8            9.61        32.92   |      9.62        26.77
> > 64            9.58        46.48  |     9.55        38.99
> > 256            9.6        63.7   |      9.6         52.59
> >
> > The cost of vhost reduced while the throughput is almost unchanged.
> 
> Thanks, and please generate a formal patch based on
> Documentation/SubmittingPatches (put the description and perf numbers
> in the commit log). Then resubmit it to let the maintainer apply it.

��.n��������+%������w��{.n�����o�^n�r������&��z�ޗ�zf���h���~����������_��+v���)ߣ�


[Index of Archives]     [KVM ARM]     [KVM ia64]     [KVM ppc]     [Virtualization Tools]     [Spice Development]     [Libvirt]     [Libvirt Users]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite Questions]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux