Optimizing mmap_queue on AVX/AVX2 CPUs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I'm not sure how far we want to get into optimizing fio for specific CPUs?

I've done some testing and found that when running the mmap ioengine against an NVDIMM-N on a modern Intel CPU I can gain a few hundred MB/s by optimizing the memory copy using avx/avx2 versus the system's memcpy implementation.


Should I proceed with submitting a patch, or do we want to avoid getting into these sort of optimizations?


--
Rebecca

--
To unsubscribe from this list: send the line "unsubscribe fio" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Kernel]     [Linux SCSI]     [Linux IDE]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux SCSI]

  Powered by Linux