... one more thing: I was also thinking that we need a new RBD feature bit to be used to indicate that an image is part of a consistency group to prevent older librbd clients from removing the image or group snapshots. This could be a RBD_FEATURES_RW_INCOMPATIBLE feature bit so older clients can still open the image R/O while its part of a group. On Tue, Aug 16, 2016 at 9:26 AM, Jason Dillaman <jdillama@xxxxxxxxxx> wrote: > Way back in April when we had the CDM, I was originally thinking we > should implement option 3. Essentially, you have a prepare group > snapshot RPC message that extends a "paused IO" lease to the caller. > When that lease expires, IO would automatically be resumed even if the > group snapshot hasn't been created yet. This would also require > commit/abort group snapshot RPC messages. > > However, thinking about this last night, here is another potential option: > > Option 4 - require images to have the exclusive lock feature before > they can be added to a consistency group (and prevent disabling of > exclusive-lock while they are part of a group). Then librbd, via the > rbd CLI (or client application of the rbd consistency group snap > create API), can co-operatively acquire the lock from all active image > clients within the group (i.e. all IO has been flushed and paused) and > can proceed with snapshot creation. If the rbd CLI dies, the normal > exclusive lock handling process will automatically take care of > re-acquiring the lock from the dead client and resuming IO. > > This option not only re-uses existing code, it would also eliminate > the need to add/update the RPC messages for prepare/commit/abort > snapshot creation to support group snapshots (since it could all be > handled internally). > > On Mon, Aug 15, 2016 at 7:46 PM, Victor Denisov <vdenisov@xxxxxxxxxxxx> wrote: >> Gentlemen, >> >> I'm writing to you to ask for your opinion regarding quiescing writes. >> >> Here is the situation. In order to take snapshots of all images in a >> consistency group, >> we first need to quiesce all the image writers in the consistency group. >> Let me call >> group client - a client which requests a consistency group to take a snapshot. >> Image client - the client that writes to an image. >> Let's say group client starts sending notify_quiesce to all image >> clients that write to the images in the group. After quiescing half of >> the image clients the group client can die. >> >> It presents us with a dilemma - what should we do with those quiesced >> image clients. >> >> Option 1 - is to wait till someone manually runs recover for that >> consistency group. >> We can show warning next to those unfinished groups when user runs >> group list command. >> There will be a command like group recover, which allows users to >> rollback unsuccessful snapshots >> or continue them using create snapshot command. >> >> Option 2 - is to establish some heart beats between group client and >> image client. If group client fails to heart beat then image client >> unquiesces itself and continues normal operation. >> >> Option 3 - is to have a timeout for each image client. If group client >> fails to make a group snapshot within this timeout then we resume our >> normal operation informing group client of the fact. >> >> Which of these options do you prefer? Probably there are other options >> that I miss. >> >> Thanks, >> Victor. > > > > -- > Jason -- Jason -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html