Re: [PATCH V2] Expose all CPU features in host definition

Don Dugger <n0ano@xxxxxxxxx> · Sun, 14 Jul 2013 23:43:57 -0600

On Mon, Jul 01, 2013 at 09:43:58AM -0600, Don Dugger wrote:
> On Mon, Jul 01, 2013 at 09:44:52AM +0100, Daniel P. Berrange wrote:
> > On Fri, Jun 28, 2013 at 02:26:02PM -0600, Don Dugger wrote:
> > > On Fri, Jun 28, 2013 at 09:03:55PM +0100, Daniel P. Berrange wrote:
> > > > On Fri, Jun 28, 2013 at 11:51:32AM -0600, Don Dugger wrote:
> > > > > On Fri, Jun 28, 2013 at 10:24:48AM +0100, Daniel P. Berrange wrote:
> > > > > > On Thu, Jun 27, 2013 at 10:35:58AM -0600, Don Dugger wrote:
> > > > > > > On Mon, Jun 17, 2013 at 10:27:36AM +0200, Jiri Denemark wrote:
> > > > > > > > On Fri, Jun 14, 2013 at 12:32:35 -0600, Don Dugger wrote:
> > > > > > > > > On Fri, Jun 14, 2013 at 03:06:52PM +0200, Jiri Denemark wrote:
> > > > > > > > > > I was just trying to say that it doesn't provide anything more than
> > > > > > > > > > "it's supported by the host CPU", which gives mostly no value in the
> > > > > > > > > > context of libvirt. Can you explain more what the use case is in which a
> > > > > > > > > > virt client would need to know what specific feature are supported by
> > > > > > > > > > host CPU? I feel like we should avoid people from being under the
> > > > > > > > > > impression that they can actually use the CPU capabilities for checking
> > > > > > > > > > whether a host can run guests that require specific CPU features.
> > > > > > > > > 
> > > > > > > > > The specific use case I'm trying to address is a cloud environment where,
> > > > > > > > > with hundreds of hosts, you might want to schedule an instance to a host
> > > > > > > > > that supports a particular HW acceleration, like AES/NI.  I agree, what
> > > > > > > > > I `really` want is an API that shows the capabilities of a specific guest
> > > > > > > > > that could be created on the host but, absent that API, at least knowing
> > > > > > > > > that a host doesn't support the feature is more information than I can get
> > > > > > > > > right now.
> > > > > > > > 
> > > > > > > > Hmm, fair enough. Let's wait a few days for Daniel to return from
> > > > > > > > vacation in case he wants to express his opinion here.
> > > > > > > 
> > > > > > > So, any progress here?
> > > > > > 
> > > > > > I believe the cloud use case above is approaching the problem in the wrong
> > > > > > way. We designed our APIs such that apps should never need to write logic
> > > > > > for comparing CPU features directly. If an application has a set of CPU
> > > > > > features it requires from the host, then it should use a libvirt API to
> > > > > > query whether those features are available, not try to write the CPU
> > > > > > fetaure comparison logic itself.
> > > > > > 
> > > > > > You can already pretty much do this with te virConnectCompareCPU()
> > > > > > method, by passing an XML document which specifies the AES/NI feature
> > > > > > flag that you want to check for support of. Then libvirt will tell
> > > > > > you whether the host CPU can support it. It is entirely possible to
> > > > > > make use of this capability as is in OpenStack.
> > > > > 
> > > > > I don't think this would work with the way scheduling in OpenStack works.
> > > > > The problem is that the OpenStack scheduler doesn't want to query each node
> > > > > in the system on every schedule request (with 100s if not 1,000s of compute
> > > > > nodes this would not be practical).  Instead the scheduler maintains info
> > > > > about all of the compute nodes and, when a request comes in, the scheduler
> > > > > identifies the best compute node for the request and then causes the VM
> > > > > to be started on that node.  Apriori the scheduler doesn't even know which
> > > > > CPU features users are interested in, that information only becomes available
> > > > > when a schedule request comes in so trying to do a `virConnectCompareCPU()'
> > > > > call at that point in time is too late.
> > > > 
> > > > I think your model for user interaction is wrong here. I don't believe
> > > > that OpenStack should be directly exposing the ability for a user to
> > > > explicitly request individual CPU flags for individual VMs. This is
> > > > too risky from a cloud administrator POV, because it could result in
> > > > a user monopolizing a small subset of machines in the guest with
> > > > particular features.  Instead an administrator should be defining
> > > > new flavours with specific CPU feature characteristics. The user can
> > > 
> > > That's the exact mechanism that is being proposed, the ability to define
> > > a flavor that specifies capabilities that are required.  The issue is
> > > that the flavor is defined independently from the scheduler, it's only
> > > when a schedule request is made that the flavor is presented to the scheduler
> > > and then the scheduler needs to identify which of 1,000s of nodes can
> > > satisfy that flavor.
> > 
> > Every 60 seconds or so, every node issues an update indicating what its
> > capabilities are. In that update the nodes could do the CPU compatbility
> > checks and then include the list of which flavours they are capable of
> > executing in their capability update, so that it is then available to
> > the schedular when needed
> 
> This doesn't work for multiple reasons.
> 
>   1.  Ultimately, I want to remove the periodic capability update completely.
>       The better technique is to update compute node state when the state
>       changes, periodic updates are just extra overhead.
>   2.  There's no concept of which nodes support which flavors so this is
>       a completely new infrastructure that would have to be added to the
>       scheduler.
>   3.  There's no easy way for the compute node to know which flavors it
>       supports.  It doesn't know which filters are enabled in the scheduler
>       so it doesn't know which clauses of a flavor actually apply (ignoring
>       that the compute node would now have to duplicate the filtering
>       mechanism from the scheduler even if it knew which filters were
>       enabled).
> 
> The virConnectGetCapabilities already returns a list of CPU features, all
> my patch does is have it explicitly return a complete set of features, which
> I think is the right thing to do and certainly simplifies my specific use
> case.

So have you had any more thoughts on this?  I'd still like to get my patch
incoporated into the tree.

> 
> > 
> > > > then choose the flavour with the CPU characteristics. In this way the
> > > > system can know ahead of time what flavours are possible on what
> > > > host, and do you don't need to query all nodes at scheduling time.
> > > 
> > > Note I am not proposing that we make a query at schedule time.  The
> > > system is already setup to have the compute nodes send configuration
> > > info to the scheduler, all I want to do is sent complete info (e.g. all
> > > of the CPU features).
> > 
> > 
> > Regards,
> > Daniel
> > -- 
> > |: http://berrange.com      -o-    http://www.flickr.com/photos/dberrange/ :|
> > |: http://libvirt.org              -o-             http://virt-manager.org :|
> > |: http://autobuild.org       -o-         http://search.cpan.org/~danberr/ :|
> > |: http://entangle-photo.org       -o-       http://live.gnome.org/gtk-vnc :|
> > 
> 
> -- 
> Don Dugger
> "Censeo Toto nos in Kansa esse decisse." - D. Gale
> n0ano@xxxxxxxxx
> Ph: 303/443-3786
> 
> --
> libvir-list mailing list
> libvir-list@xxxxxxxxxx
> https://www.redhat.com/mailman/listinfo/libvir-list
> 

-- 
Don Dugger
"Censeo Toto nos in Kansa esse decisse." - D. Gale
n0ano@xxxxxxxxx
Ph: 303/443-3786

--
libvir-list mailing list
libvir-list@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/libvir-list