Seems like virtio (kvm 1.0) doesnt expose timeout on the guest side (ubuntu 12.04 on host and guest). So, how can i adjust the tinmeout on the guest ? This solution is the most logical one, but i cannot apply it! thanks for all the responses! regards Alejandro Comisario MercadoLibre Cloud Services Arias 3751, Piso 7 (C1430CRG) Ciudad de Buenos Aires - Argentina Cel: +549(11) 15-3770-1857 Tel : +54(11) 4640-8443 On Thu, Mar 27, 2014 at 5:53 AM, Stefan Hajnoczi <stefanha@xxxxxxxxx> wrote: > On Thu, Mar 27, 2014 at 10:10:40AM +0200, Michael S. Tsirkin wrote: >> On Thu, Mar 27, 2014 at 08:36:57AM +0100, Markus Armbruster wrote: >> > "Michael S. Tsirkin" <mst@xxxxxxxxxx> writes: >> > >> > > On Wed, Mar 26, 2014 at 11:08:03PM -0300, Alejandro Comisario wrote: >> > >> Hi List! >> > >> Hope some one can help me, we had a big issue in our cloud the other >> > >> day, a couple of our openstack regions ( +2000 kvm guests with qcow2 ) >> > >> went read only filesystem from the guest side because the backing >> > >> files directory (the openstack _base directory) was compromised and >> > >> the data was lost, when we realized the data was lost, it took us 5 >> > >> mins to restore the backup of the backing files, but by that time all >> > >> the kvm guests received some kind of IO error from the hypervisor >> > >> layer, and went read only on root filesystem. >> > >> >> > >> My question would be, is there a way to hold the IO operations against >> > >> the backing files ( i thought that would be 99% READ operations ) for >> > >> a little longer ( im asking this because i dont quite understand what >> > >> is the process and when it raises the error ) in a case the backing >> > >> files are missing (no IO possible) but is recoverable within minutes ? >> > >> >> > >> Any tip on how to achieve this if possible, or information about how >> > >> backing files works on kvm, will be amazing. >> > >> Waiting for feedback! >> > >> >> > >> kindest regards. >> > >> Alejandro Comisario >> > > >> > > >> > > I'm guessing this is what happened: guests timed out meanwhile. >> > > You can increase the timeout within the guest: >> > > echo 600 > /sys/block/sda/device/timeout >> > > to timeout after 10 minutes. >> > > >> > > If you have installed qemu guest agent on your system, you can do this >> > > from the host. Unfortunately by default it's memory can be pushed out to swap >> > > and then on disk error access there might will fail :( >> > > Maybe we should consider mlock on all its memory at least as an option. >> > > >> > > You could pause your guests, restart them after the issue is resolved, >> > > and we could I guess add functionality to pause VM on disk errors >> > > automatically. >> > > Stefan? >> > >> > Would -drive rerror=stop do? >> >> I think it will. It's a pity it doesn't appear in --help output - >> would make it easier to find. > > It is documented on the man page. I'll send a patch to document it in > the --help output too. > > But there's still a problem because the guest can have a shorter timeout > or the image may be NFS mounted on the host. In that case the guest may > give up on the request before the host. Then there is nothing QEMU can > do to avoid an error being returned to the application or the guest file > system going into read-only mode. > > So make sure the timeout inside the guest is high. > > Stefan -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html