async messenger random read performance on NVMe

Mark Nelson <mnelson@xxxxxxxxxx> · Wed, 21 Sep 2016 13:49:58 -0500

Recently in master we made async messenger default.  After doing a bunch 
of bisection, it turns out that this caused a fairly dramatic decrease 
in bluestore random read performance.  This is on a cluster with fairly 
fast NVMe cards, 16 OSDs across 4 OSD hosts.  There are 8 fio client 
processes with 32 concurrent threads each.

Ceph master using bluestore

Parameters tweaked:

ms_async_send_inline
ms_async_op_threads
ms_async_max_op_threads

simple: 168K IOPS

send_inline: true
async 3/5   threads: 111K IOPS
async 4/8   threads: 125K IOPS
async 8/16  threads: 128K IOPS
async 16/32 threads: 128K IOPS
async 24/48 threads: 128K IOPS
async 25/50 threads: segfault
async 26/52 threads: segfault
async 32/64 threads: segfault

send_inline: false
async 3/5   threads: 153K IOPS
async 4/8   threads: 153K IOPS
async 8/16  threads: 152K IOPS

So definitely setting send_inline to false helps pretty dramatically, 
though we're still a little slower for small random reads than simple 
messenger.  Haomai, regarding the segfaults, I took a quick look with 
gdb at the core file but didn't see anything immediately obvious.  It 
might be worth seeing if you can reproduce.

On the performance front, I'll try to see if I can see anything obvious 
in perf.

Mark
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html