Hi Bill, 2011/12/18 Bill Hastings <bllhastings@xxxxxxxxx>: > I am trying to get my feet wet with Ceph and RADOS. My aim is to use > it as a block device for KVM instances. My understanding is that > virtual disks get striped at 1 MB boundaries by default. Does that > mean that there are going to be 1MB files on disks? Yes, the virtual disk is striped over multiple objects. By default they have a size of 4MB (not 1MB). Ceph is storing objects, but in the end they will be written as files on the different object stores. > Let's say I want > to update a particular vdisk with 16 bytes of data at offset 4096. > This would mean I want to update the first 1MB chunk. Yes, but you don't need to write the whole chunk again. You can update the 16 bytes withour rewriting everything. (In fact rbd is using sparse objects by default - "thin provisioning"). > Let us assume I > have 3 way replication and the replicas are A, B and C. The write may > succeed at A and B and fail at C. Is there any state kept in the > metadata indicating at which replicas the write succeeded? Objects are grouped into placement groups (PGs).The ceph monitor is tracking the state of the PGs. With this information, the clients will be directed to the working replicas. When an object store is failing, it will start rebuilding the missing objects on other object stores. Regards, Christian -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html