On Thu, 14 Jul 2011, na an wrote: > hi all: > > when MDS is to restart , it will replay journal, and client will > replay 'unsafe requests' that have replyed from that MDS while the > metadata change maybe have not been writen to disk actually. > I have a question that if there are many clients, how ceph keeps > consistence? because multiple clients may operate on the same file, > just like one to create a file and another to delete it. > And how the ceph deal with such a situation that one or more > clients fail in the process of clientsreplay ? once it happen, the > whole system may be untrusty. > if we have hundreds and thousands of clients , does it may take > a long time to recover? The MDS locking operates such that no side-effect of one client's actions are exposed to any other client until they have committed to the journal. That means you won't have dependent operations being replayed by the clients (unless they also committed on the MDS and the client just didn't know that yet, in which case the replay is a no-op). There is currently an annoyance with the clientreplay that forces a log flush on each operation due to a subtle interaction with the locking/caps code, but it's fixable. See #90. sage -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html