Optimizing mmap_queue on AVX/AVX2 CPUs

Rebecca Cran <rebecca@xxxxxxxxxxxx> · Fri, 25 Aug 2017 19:46:10 -0600

I'm not sure how far we want to get into optimizing fio for specific CPUs?

I've done some testing and found that when running the mmap ioengine 
against an NVDIMM-N on a modern Intel CPU I can gain a few hundred MB/s 
by optimizing the memory copy using avx/avx2 versus the system's memcpy 
implementation.

Should I proceed with submitting a patch, or do we want to avoid getting 
into these sort of optimizations?

--
Rebecca

--
To unsubscribe from this list: send the line "unsubscribe fio" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html