Hi, Quoting Stefan Kooman (stefan@xxxxxx): > Hi, > > TL;DR: we see "used" memory grows indefinitely on our OSD servers. > Until the point that either 1) a OSD process gets killed by OOMkiller, > or 2) OSD aborts (proably because malloc cannot provide more RAM). I > suspect a memory leak of the OSDs. I got quite some feedback on this thread, thanks for that! I'm pretty sure we were not hit by a Ceph memory leak, but an Intel i40e driver leak, specifically in linux kernel 4.13 (Ubuntu Xenial HWE), see [1]. Running 4.13 kernel with Intel X710? You will definitely want to update to 4.13.0-38 where this issue is fixed. We are running this kernel now for a week or so and memory is "under control". Now it's time to crank bluestore cache again :-). FYI. Gr. Stefan [1]: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1748408 -- | BIT BV http://www.bit.nl/ Kamer van Koophandel 09090351 | GPG: 0xD14839C6 +31 318 648 688 / info@xxxxxx _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com