On Tue, 17 Apr 2012 11:14:11 -0400 Jeff Layton <jlayton@xxxxxxxxxx> wrote: > On Tue, 17 Apr 2012 16:34:48 +0200 > Lukas Hejtmanek <xhejtman@xxxxxxxxxxx> wrote: > > > Hi, > > > > On Tue, Apr 10, 2012 at 09:13:21AM -0400, Jeff Layton wrote: > > > Nope. It'll all work just great...until it doesn't. I don't have any > > > specific failure scenarios, but most of the problems will be issues > > > with state recovery when a server node is restarted. > > > > > > That may manifest in different ways -- problems reclaiming locks for > > > instance, or even silent data corruption depending on the application. > > > > would it work if I relax active-active scenario to just active-passive in the > > following way: > > > > Server A actively exports /export/A > > Server B actively exports /export/B > > > > Server B is passive backup for Server A > > Server A is passive backup for Server B > > > > would it work to migrate the failed Server B to Server A so that Server A will > > server both /export/A and /export/B? > > > > There will be a problem with v4recovery dir. Would it be possible just to > > merge v4recovery from Server B to Server A (nfs export would be stopped while > > merging v4recovery). > > > > It seems that cp -r B/v4recovery/* A/v4recovery/ would do all the things. Am > > I right? > > > > Do I need to copy recovery state if I delay migration of the failed Server B to > > Server A for 91 secs? I.e., longer than lease expiry time.. Or do I still need > > a record for the client in v4recovery dir in such a case? > > > > That'll still be dangerous. Suppose (for instance) that a client1 lost > communication with server B for a period of time and then it expired > the lease and handed out a lock to client2 that it was holding > previously. client2 modifies the file and drops the lock. At the same > time, client1 has uninterrupted communication with serverA, and holds > state on it. > > Eventually, you fail over server B and merge the directories. client1 > attempts to renew its lease, but gets back an error and starts > reclaiming things. Now, server B would have denied reclaim of that lock > -- its lease had expired, but in this case it's allowed because you > merged the directory and it client1 held state on serverA. client1 > reclaims the lock and thinks that it's held the lock the entire time -- > data corruption and other hilarity ensues... > Now that I've had some time to think about this, you may actually be OK to just merge those directories when you fail over. The caveat is that you need to know for certain that the clients are using non-uniform clientid strings when they talk to the server. When a client makes a SETCLIENTID call to the server, it sends an opaque identifier string to the server. Traditionally (and I think per a SHOULD in the RFC) Linux clients have varied that string based on the IP address of the server. That's called the non-UCS (uniform client string) based model. There is some debate on this practice though, as it makes it difficult to identify clients for recovery purposes in migration scenarios (Dave Novak has a paper on this). In order to facilitate that, we're considering moving to a UCS based model in the linux client. The upshot here is that if you do it that way, then a client that holds state on both server addresses will look like two different clients even after the service floats to the backup server. In that case, you'd have no problems with reclaim (in principle, of course!). The catch here is that if any clients have a UCS based model for generating client strings (where the client string is invariant vs. the server's IP address), then you'll be subject to the scenario above. Still, merging those directories is enough of an uncharted territory that I'd advise against it even if it would theoretically work. -- Jeff Layton <jlayton@xxxxxxxxxx> -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html