On Mon, 5 Jun 2017, Leon Romanovsky wrote: > It is rough calculation for 1MB, when I asked Ahmad about this > limitation (4K) he explained to me that exposed device memory is less > than 1MB. Still doesnt that mean more than 256 MPI instances or so per node? The use case for a semaphore indicates that a 4k page would be shared between multiple processes? Therefore there is even less of a need of multiple pages. You may not be able to avoid the 4k page since page protection works only on a 4k level. The kernel futexes rely on 4k page protection tricks. Please come up with a reasonable use case here.... We do not run MPI but our use cases work fine with mmapped 4k pages. There are some who actually would like 2M pages for that use case since some of the adapters have quite a bit of memory available. A small object allocator with the need to go through an intermediate layer seems to be not very productive. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html