My problems were memory pressure plus an XFS bug, so it took a while to manifest.
The following (long, ongoing) thread on linux-mm discusses our [severe] problems with memory pressure taking out entire OSD servers. The upstream problems are still unresolved as at Linux 3.18, but anyone running Ceph on XFS over especially Infiniband or *anything* that does custom allocation in the kernel should probably be aware of this.
http://marc.info/?l=linux-mm&m=141605213522925&w=2
AfC
Sydney
-- Andrew Frederick Cowie Head of Engineering Anchor Systems afcowie anchor hosting |
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com