Re: [RFC]VM live snapshot proposal

Stefan Hajnoczi <stefanha@xxxxxxxxx> · Mon, 3 Mar 2014 13:32:34 +0100

On Mon, Mar 03, 2014 at 01:13:41AM +0000, Huangpeng (Peter) wrote:

Just to summarize the idea of live savevm for people joining the
discussion:

It should be possible to save a snapshot of the guest (including memory,
devices, and disk) without noticable downtime.

The 'savevm' command pauses the guest until the snapshot has been
completed and therefore doesn't meet the requirements.

> Here I have another proposal, based on the live-migration scheme, add consistent 
> memory state tracking and saving.
> The idea is simple:
> 1.First round use live-migration to save all memory to a snapshot file.
> 2.intercept the action of memory-modify, save old pages to a temporary file and mark dirty-bits,
> 3.Merge temporary file to the original snapshot file
> 
> Detailed process:
> (1)Pause VM
> (2) Save the device status to a temporary file (live-migration already supported )
> (3) Make disk snapshot
> (4) Enable page dirty log and old dirty pages save function(which we need to add)
> (5) Resume VM
> (6) Begin the first round of iteration, we save the entire contents of the VM memory pages
> to the snapshot file
> (7) In the second round of iteration , we save the old page to the snapshot file
> (8) Merge data of device status which is pre-saved in temporary files to the snapshot file
> (8) End ram snapshot and some cleanup work
> 
> Due to memory-modifications may happen in kvm, qemu, or vhost, the key-part is how we
> can provide common page-modify-tracking-and-saving api, we completed a prototype by 
> simply add modified-page tracking/saving function in qemu, and it seems worked fine.

Yes, this is the tricky part.  To be honest, I think this is the reason
no one has submitted patches - it's a hard task and the win isn't that
great (you can already migrate to file).

But back to the options:

If the host has enough free memory to fork QEMU, a small helper process
can be used to save the copy-on-write memory snapshot (thanks to fork(2)
semantics).  The hard part about the fork(2) approach is that QEMU isn't
really designed to fork, so work is necessary to reach a quiescent state
for the child process.

If there is not enough memory to fork, then a synchronous approach to
catching guest memory writes is needed.  I'm not sure if a good
mechanism for that exists but the simplest would be mprotect(2) and a
signal handler (which will make the guest run very slowly).

Stefan
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html