On Fri, Jul 22, 2016 at 09:57:26AM +1000, Brad Hubbard wrote: > On Thu, Jul 21, 2016 at 08:41:42PM +0200, Wido den Hollander wrote: > > > > > Op 21 juli 2016 om 20:01 schreef Gregory Farnum <gfarnum@xxxxxxxxxx>: > > > > > > > > > On Thu, Jul 21, 2016 at 8:49 AM, Wido den Hollander <wido@xxxxxxxx> wrote: > > > > Hi, > > > > > > > > On a CentOS 7 system with a Jewel 10.2.2 cluster I'm trying to create a pool which fails. > > > > > > > > Any pool I try to create, with or without a ruleset applied to it fails with this error: > > > > > > > > "Error ENOMEM: crushtool check failed with -12: failed run crushtool: fork failed: (12) Cannot allocate memory" > > > > > > > > At first I thought it was a package version mismatch, but it doesn't seem to be the case. > > > > > > > > There are other commands like 'radosgw-admin' I see fail with -12 error codes as well. > > > > > > > > Any ideas what might be going on here? The system has roughly 29GB of free memory, so that should be sufficient. > > > > > > ulimits? > > > > Good suggestion, didn't check that, but after looking at them I don't think they are: > > > > [root@srv-zmb16-21 ~]# ulimit -a > > core file size (blocks, -c) 0 > > data seg size (kbytes, -d) unlimited > > scheduling priority (-e) 0 > > file size (blocks, -f) unlimited > > pending signals (-i) 128505 > > max locked memory (kbytes, -l) 64 > > max memory size (kbytes, -m) unlimited > > open files (-n) 65536 > > pipe size (512 bytes, -p) 8 > > POSIX message queues (bytes, -q) 819200 > > real-time priority (-r) 0 > > stack size (kbytes, -s) 8192 > > cpu time (seconds, -t) unlimited > > max user processes (-u) 4096 > > virtual memory (kbytes, -v) unlimited > > file locks (-x) unlimited > > [root@srv-zmb16-21 ~]# > > > > Wouldn't you say? > > > As a test try doubling vm.max_map_count. We've seen the ENOMEM before in cases > where the number of memory allocations mapped by a process exceeded this value. > Note that if this is the issue it likely indicates somewhere in excess of > 32700 threads are being created so you may want to look at just how many > threads *are* being created when this issue is seen as well as taking a look > at the /proc/<PID>/maps file for the process to verify the number of > allocations. If you are seeing > 32700 threads created we should look at > whether that number makes sense in your environment. Sorry, I looked back at the actual issue I recalled and the error in that case was "Resource temporarily unavailable" or EAGAIN/EWOULDBLOCK. Might still be worth looking at the number of threads we are dealing with here and the number, and type, of memory allocations since it's fork that's failing after all. > > HTH, > Brad > > > > > I also checked, SELinux is disabled. 'setenforce 0'. > > > > Wido > > > > > -- > > > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > > > the body of a message to majordomo@xxxxxxxxxxxxxxx > > > More majordomo info at http://vger.kernel.org/majordomo-info.html > > -- > > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > > the body of a message to majordomo@xxxxxxxxxxxxxxx > > More majordomo info at http://vger.kernel.org/majordomo-info.html -- Cheers, Brad -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html