> -----Original Message----- > From: Gavriliuk, Anton (HPS Ukraine) > Sent: Monday, December 4, 2017 11:12 AM ... > I have run next command several times, > > perf record /usr/local/bin/fio --filename=/dev/dax0.0 --rw=randrw -- > refill_buffers --norandommap --randrepeat=0 --ioengine=mmap -- > bssplit=4k/4:8k/7:16k/7:32k/15:64k/65:128k/1:256k/1 --rwmixread=5 -- > iodepth=1 --numjobs=16 --runtime=1800 --group_reporting --name=4-rand-rw- > 3xx --size=290g --numa_cpu_nodes=0 > > each perf report shows me toppest clock_thread_fn(), > > Samples: 8M of event 'cycles:ppp', Event count (approx.): 23391179916235 > Overhead Command Shared Object Symbol > 75.27% fio fio [.] clock_thread_fn > 13.68% fio libc-2.22.so [.] __memcpy_avx_unaligned > 10.05% fio fio [.] fill_random_buf This just runs one pass through memory, so the startup overhead is magnified. Add --time_based=1 so it really runs for 1800 s. ��.n��������+%������w��{.n�������^n�r������&��z�ޗ�zf���h���~����������_��+v���)ߣ�