Re: [PATCH v8 03/21] backup: Introduce virDomainCheckpoint APIs

Peter Krempa <pkrempa@xxxxxxxxxx> · Thu, 25 Apr 2019 16:36:54 +0200

On Wed, Apr 24, 2019 at 13:43:03 -0500, Eric Blake wrote:
> On 4/24/19 8:26 AM, Peter Krempa wrote:
> > On Wed, Apr 17, 2019 at 09:09:03 -0500, Eric Blake wrote:
> >> Introduce a bunch of new public APIs related to backup checkpoints.
> >> Checkpoints are modeled heavily after virDomainSnapshotPtr (both
> >> represent a point in time of the guest), although a snapshot exists
> >> with the intent of rolling back to that state, while a checkpoint
> >> exists to make it possible to create an incremental backup at a later
> >> time.
> >>
> >> The following map shows the API relations to snapshots, with new APIs
> >> on the right:
> >>
> >> Operate on a domain object to create/redefine a child:
> >> virDomainSnapshotCreateXML          virDomainCheckpointCreateXML
> >>
> >> Operate on a child object for lifetime management:
> >> virDomainSnapshotDelete             virDomainCheckpointDelete
> >> virDomainSnapshotFree               virDomainCheckpointFree
> >> virDomainSnapshotRef                virDomainCheckpointRef
> >>
> >> Operate on a child object to learn more about it:
> >> virDomainSnapshotGetXMLDesc         virDomainCheckpointGetXMLDesc
> >> virDomainSnapshotGetConnect         virDomainCheckpointGetConnect
> >> virDomainSnapshotGetDomain          virDomainCheckpointGetDomain
> >> virDomainSnapshotGetName            virDomainCheckpiontGetName
> >> virDomainSnapshotGetParent          virDomainCheckpiontGetParent
> >> virDomainSnapshotHasMetadata        virDomainCheckpointHasMetadata

One additional thing I remembered is that snapshots without metadata are
a dead end and don't make much sense. While backups without metadata can
be at least somewhat more usable since we don't really need a VM
definition to go with them I'd consider dropping that notion completely.

If users wan't to use backups, metadata should be present at all time.

> >> virDomainSnapshotIsCurrent          virDomainCheckpointIsCurrent
> > 
> > The 'current' checkpoint has very sparse documentation. While it makes
> > some sense for the current snapshot to exist I'm not persuaded we need
> > this for checkpoints.
> > 
> > In case of checkpoints the bitmaps backing it track the state even if
> > you create a new bitmap unlike checkpoints. Thus it does not seem to
> > make sense to me.
> > 
> > If you think it does please elaborate, ideally in form of documentation.
> 
> I can add documentation, but the short answer is that there are two ways
> to implement differential backups:
> 
> 1. One active bitmap for every point in time that you might ever need a
> differential backup from; if you have 20 checkpoints, then you have 20
> active bitmaps, and each guest write has to potentially modify all 20
> bitmaps, but backups can directly use the desired bitmap and nothing
> else for that particular backup:
> 
> Time:    T1    T2     T3     T4   present
> Bitmap1: --------------------------------
> Bitmap2:       --------------------------
> Bitmap3:              -------------------
> Bitmap4:                     ------------
> 
> 2. One disabled bitmap for every delta between two consecutive points in
> time, plus one active bitmap since the most recent point in time; if you
> have 20 checkpoints, then you have 19 disabled bitmap and 1 active 1,
> where each guest write only has to modify 1 bitmap, but backups have to
> construct a temporary bitmap that contains the union of all bitmaps that
> span the time in question:
> 
> Time:    T1    T2     T3     T4   present
> Bitmap1: ------
> Bitmap2:       -------
> Bitmap3:              -------
> Bitmap4:                     ------------
> 
> Given that guest writes are more frequent than backup operations, and
> that performing the creation of a temporary bitmap to merge together
> other bitmaps at the start of a bitmap is not all that onerous, the code
> goes with approach 2. And, since there is always 1 active bitmap while
> the rest are disabled, it lends itself well to the notion of a current
> checkpoint (one that is tracking active guest writes).

This makes sense as long as other implementations do/allow the same. If
you have a hypervisor that implements only type 1 you won't have another
option.

In that case though we should model it differently. The current or
active bitmap/checkpoint should be that one that actually receives
bitmap updates. That means that the value should reflec the actual state
from the VM rather than something saved. The problem is that this is not
compatible with the mode described above. As the notion of the 'current'
checkpoint supports only 1, while there can be possibly more of them
that receive updates.

Also it seems to me that the user does not need to care which one is
active at all. It's just internal implementation. User only cares about
the times when they were created and whether they can do a backup of
differences between a time in the past and now.

Also it's probably good to mention that a bitmap currently will not
support a difference between two points in the past, that means that
from the top level, both behave identically.

I just don't see enough solid reasons to have 'current' as a property of
the checkpoint.

> >> Operate on a domain object to list all children:
> >> virDomainSnapshotNum                (no counterpart, this is the old
> >> virDomainSnapshotListNames           racy interface)
> >> virDomainSnapshotListAllSnapshots   virDomainListAllCheckpoints
> >>
> >> Operate on a child object to list descendents:
> >> virDomainSnapshotNumChildren        (no counterpart, this is the old
> >> virDomainSnapshotListChildrenNames   racy interface)
> >> virDomainSnapshotListAllChildren    virDomainCheckpointListAllChildren
> >>
> >> Operate on a domain to locate a particular child:
> >> virDomainSnapshotLookupByName       virDomainCheckpointLookupByName
> >> virDomainHasCurrentSnapshot         virDomainHasCurrentCheckpoint
> >> virDomainSnapshotCurrent            virDomainCheckpointCurrent
> >>
> >> Operate on a snapshot to roll back to earlier state:
> >> virDomainSnapshotRevert             (no counterpart, instead checkpoints
> >>                                      are used in incremental backups via
> > 
> > This patch or a different one should also add docs to virDomainSnapshotRevert
> > that outline what happens to the checkpoints when reverting snapshots.
> 
> Right now, it needs to be an error (until we actually design how the two
> should play together nicely, the most conservative approach is to state
> that reverting a domain with checkpoints risks enough confusion to be
> forbidden in the initial implementation, even if we lift that
> restriction later when we iron out how it should work).

I think we need more than that. At the point when a snapshot is taken it
needs also snapshot the state of checkpoints. Otherwise we won't be able
to reconstruct the state from the point when the snapshot was taken.
Attachment:
signature.asc

Description: PGP signature
--
libvir-list mailing list
libvir-list@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/libvir-list