Hello! I'm a GSoC student this year and my job is to introduce Missing Rate Curve (or reuse distance exactly) of objects into OSD. Now I'm trying to find a proper algorithm to implement but there is a problem: Should I take the number of objects tracked in an OSD as infinite or constant? The point is that there is an algorithm that use hash to sample only constant number of references to do the analysis and is proved to be accurate, which makes it possible to do online MRC construction. That accuracy is supported by the fact that the memory addresses is bounded, while objects can be deleted and created again and again in Ceph. Is is reasonable to think that an OSD only serves bounded number of objects in its life time (or the time period that we want to compute MRC)? Any other comment about this project is also welcomed :) -- Li Peilun (李沛伦) Yao Class J10 Institute for Interdisciplinary Information Sciences Tsinghua University Beijing,100084 P.R.China Tel:86-18810671857 E-mail: lpl6338236@xxxxxxxxx -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html