On Wed, 2012-01-11 at 10:55 +1030, Rusty Russell wrote: > On Tue, 10 Jan 2012 19:03:36 +0200, "Michael S. Tsirkin" <mst@xxxxxxxxxx> wrote: > > On Wed, Dec 21, 2011 at 11:03:25AM +1030, Rusty Russell wrote: > > > Yes. The idea that we can alter fields in the device-specific config > > > area is flawed. There may be cases where it doesn't matter, but as an > > > idea it was holed to begin with. > > > > > > We can reduce probability by doing a double read to check, but there are > > > still cases where it will fail. > > > > Okay - want me to propose an interface for that? > > Had a brief chat with BenH (CC'd). Thanks Rusty :-) > I think we should deprecate writing to the config space. Only balloon > does it AFAICT, and I can't quite figure out *why* it has an 'active' > field. This solves half the problem, of sync guest writes. For the > other half, I suggest a generation counter; odd means inconsistent. The > guest can poll. Agreed. Commands are the way to go. > BenH also convinced me we should finally make the config space LE if > we're going to change things. Since PCI is the most common transport, > guest-endian confuses people. And it sucks for really weird machines. I have a few more "good" reasons but yes, basically it confuses things. Just see the bugs we are still fixing in qemu :-) A fixed endian makes everything simpler, there is no need to play games when using virtio in heterogenous environment (slave CPUs), no need to handle variable endian on CPUs that support both etc... and the cost of byteswap is negligible (we do it routinely for normal PCI hardware and it's never been a hot spot). > We should also change the ring (to a single ring, I think). Descriptors > to 24 bytes long (8 byte cookie, 8 byte addr, 4 byte len, 4 byte flags). Do we care about space waste ? 32 bytes is a nicely round power of two and would make general math, but also debugging (visual inspection of the ring) etc... easier. If we support immediate data, that means we'd have enough for most common commands and status straight in the ring. Another approach that may or may not be worthwhile is to have the size of the ring elements configurable (as power of two) so that it can be adjusted to contain both immediate and a descriptor. This is done typically in network hardware to put the routing headers in the immediate part and the rest of the packet elsewhere which improves cache affinity when processing large amount of packets. > We might be able to squeeze it into 20 bytes but that means packing. We > should support inline, chained or indirect. Let the other side ack by > setting flag, cookie and len (if written). > > Moreover, I think we should make all these changes at once (at least, in > the spec). That makes it a big change, and it'll take longer to > develop, but makes it easy in the long run to differentiate legacy and > modern virtio. Agreed, Cheers, Ben. _______________________________________________ Virtualization mailing list Virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx https://lists.linuxfoundation.org/mailman/listinfo/virtualization