Re: Snapshots of consistency groups

Jason Dillaman <jdillama@xxxxxxxxxx> · Tue, 16 Aug 2016 09:26:21 -0400

Way back in April when we had the CDM, I was originally thinking we
should implement option 3. Essentially, you have a prepare group
snapshot RPC message that extends a "paused IO" lease to the caller.
When that lease expires, IO would automatically be resumed even if the
group snapshot hasn't been created yet.  This would also require
commit/abort group snapshot RPC messages.

However, thinking about this last night, here is another potential option:

Option 4 - require images to have the exclusive lock feature before
they can be added to a consistency group (and prevent disabling of
exclusive-lock while they are part of a group). Then librbd, via the
rbd CLI (or client application of the rbd consistency group snap
create API), can co-operatively acquire the lock from all active image
clients within the group (i.e. all IO has been flushed and paused) and
can proceed with snapshot creation. If the rbd CLI dies, the normal
exclusive lock handling process will automatically take care of
re-acquiring the lock from the dead client and resuming IO.

This option not only re-uses existing code, it would also eliminate
the need to add/update the RPC messages for prepare/commit/abort
snapshot creation to support group snapshots (since it could all be
handled internally).

On Mon, Aug 15, 2016 at 7:46 PM, Victor Denisov <vdenisov@xxxxxxxxxxxx> wrote:
> Gentlemen,
>
> I'm writing to you to ask for your opinion regarding quiescing writes.
>
> Here is the situation. In order to take snapshots of all images in a
> consistency group,
> we first need to quiesce all the image writers in the consistency group.
> Let me call
> group client - a client which requests a consistency group to take a snapshot.
> Image client - the client that writes to an image.
> Let's say group client starts sending notify_quiesce to all image
> clients that write to the images in the group. After quiescing half of
> the image clients the group client can die.
>
> It presents us with a dilemma - what should we do with those quiesced
> image clients.
>
> Option 1 - is to wait till someone manually runs recover for that
> consistency group.
> We can show warning next to those unfinished groups when user runs
> group list command.
> There will be a command like group recover, which allows users to
> rollback unsuccessful snapshots
> or continue them using create snapshot command.
>
> Option 2 - is to establish some heart beats between group client and
> image client. If group client fails to heart beat then image client
> unquiesces itself and continues normal operation.
>
> Option 3 - is to have a timeout for each image client. If group client
> fails to make a group snapshot within this timeout then we resume our
> normal operation informing group client of the fact.
>
> Which of these options do you prefer? Probably there are other options
> that I miss.
>
> Thanks,
> Victor.

-- 
Jason
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html