On Mon, 2018-02-05 at 14:16 +0200, Sagi Grimberg wrote: > - Your latency measurements are surprisingly high for a null target > device (even for low end nvme device actually) regardless of the > transport implementation. > > For example: > - QD=1 read latency is 648.95 for ibnbd (I assume usecs right?) which is > fairly high. on nvme-rdma its 1058 us, which means over 1 millisecond > and even 1.254 ms for srp. Last time I tested nvme-rdma read QD=1 > latency I got ~14 us. So something does not add up here. If this is > not some configuration issue, then we have serious bugs to handle.. > > - QD=16 the read latencies are > 10ms for null devices?! I'm having > troubles understanding how you were able to get such high latencies > (> 100 ms for QD>=100) > > Can you share more information about your setup? It would really help > us understand more. I would also appreciate it if more information could be provided about the measurement results. In addition to answering Sagi's questions, would it be possible to share the fio job that was used for measuring latency? In https://events.static.linuxfound.org/sites/events/files/slides/Copy%20of%20IBNBD-Vault-2017-5.pdf I found the following: iodepth=128 iodepth_batch_submit=128 If you want to keep the pipeline full I think that you need to set the iodepth_batch_submit parameter to a value that is much lower than iodepth. I think that setting iodepth_batch_submit equal to iodepth will yield suboptimal IOPS results. Jens, please correct me if I got this wrong. Thanks, Bart.