Ok, question; Does TAP:AIO work on networked filesystems .. my testing says not. (I've tried on local filesystems and seem to have it partially working) XEN without a good underlying cluster filesystem is to say the least "limited". If you can't use AIO in this sort of environment, it also becomes limited .. and if libvirt requires this, it means libvirt can't be used on large-scale roll-outs on (certain?) network filesystems. (I'm using gluster) Interestingly (looking at the Xen list re; performance problems) I'm getting > 65Mb/sec on my DomU's .vs. 70Mb/sec on my host nodes Dom0 ... and it's proving to be reliable atm .. Be interesting to know why Xen only documents "file" given the critical nature / apparent flaws in the driver. fyi; I run root fs's off DomU's but store data on separately mounted shared filesystems .. (more efficient when it comes to fail-over / file-system self-heal in the event of a filesystem node outage) Is there "no" way of forcing a loopback interface to flush itself? Gareth. ----- Original Message ----- step 3.: "Daniel P. Berrange" <berrange@xxxxxxxxxx> To: "Gareth Bult" <gareth@xxxxxxxxxxxxx> Cc: "libvir-list" <libvir-list@xxxxxxxxxx> Sent: 27 January 2008 02:14:26 o'clock (GMT) Europe/London Subject: Re: Re; virDomainBlockStats On Sat, Jan 26, 2008 at 11:29:54PM +0000, Gareth Bult wrote: > Is there any documentation to the effect that "file" is bad?? > The official XEN documentation lists "file" as the standard and makes no mention of "aio". The upstream documentation leaves alot to be desired. First of all I should say, there is a difference between file: with PV vs file: with HVM. file: with HVM has no problem, because all IO is handled by QEMU. It is only file: with PV guests that is flawed. This is because it uses loopback devices to acess the file. Loop devices cache data writes in memory for an undefined amount of time, even if the guest requests a sycn to disk. The result is that if the host OS crashes your guest disks can loss huge amounts of data that they thought were already synced to disk. Not even a journalling FS helps, because there's no write ordering guarentees. tap:aio: addresses all these issues by doing Direct IO + Async I/O to allow you to have multiple outstanding I/O operations, to avoid the host OS buffer cache, and to provide firm guarentee that the data is on disk when the guest asks for it to be. > On Sat, Jan 19, 2008 at 05:18:58PM +0000, Gareth Bult wrote: > > Mmm, > > > > Interesting .. > > > > First off, xentop doesn't display block device stats for tap:aio based systems and it does for file. > > Second, tap:aio generated kernel Ooops's when you shutdown a DomU. > > > > Not exactly what I'd call mainstream (!) > > That must be a flaw in Ubuntu's kernels. tap:aio works flawlessly > in Fedora / RHEL and is the only supported option, because file: > has catastrophic data loss issues during host crashes. Dan. -- |=- Red Hat, Engineering, Emerging Technologies, Boston. +1 978 392 2496 -=| |=- Perl modules: http://search.cpan.org/~danberr/ -=| |=- Projects: http://freshmeat.net/~danielpb/ -=| |=- GnuPG: 7D3B9505 F3C9 553F A1DA 4AC2 5648 23C1 B3DF F742 7D3B 9505 -=| -- Libvir-list mailing list Libvir-list@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/libvir-list