rados bench single instance vs. multiple instances

"Deneau, Tom" <tom.deneau@xxxxxxx> · Mon, 11 May 2015 14:18:21 +0000

I have noticed the following while running rados bench seq read tests with a 40M object size

    single rados bench, 4 concurrent ops,                     bandwidth = 190 MB/s
    4 copies of rados bench, 1 concurrent op each,  aggregate bandwidth = 310 MB/s

and in fact the single rados bench seems limited to 190, no matter how many concurrent ops.

I don't see this kind of behavior with a 4M object size.

(The above are with caches dropped on the osd targets)

It doesn't seem to be related to the total number of bytes being processed by the single 
because if I don't drop the caches, both the single rados bench and the 4-copy rados bench
get much higher numbers (600 vs. 900) but still the single rados bench appears limited, no matter
how many concurrent ops are used.

Is there kind of throttling going on by design here?

-- Tom Deneau, AMD
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html