On Wed, 28 Sep 2011, mowang da wrote: > hi all: > > l used ping_pong tool to test ceph, as shown below : > 1.If it supports coherent byte range locks between cluster nodes > 2.How fast it handles lock contention > 3.If it supports coherent read/write IO between nodes > 4.How fast it handles contended IO between nodes > 5.If it supports coherent mmap between nodes > 6.How fast the mmap coherence works > > the ping_pong code is here: > http://junkcode.samba.org/ftp/unpacked/junkcode/ping_pong.c > > l start by running ping_pong on just one of the kernel client mount > point like this: > > ping_pong /mnt/test.dat 3 > > it lock very fast : > > T02-OSD161:/usr/src/ping_pong# ./ping_pong /mnt/test.dat 3 > 1664 locks/sec > > but when l start a second copy of ping_pong on another kernel client > node in my cluster, there have been some errors: > > T02-OSD186:/usr/src/getosd# ./ping_pong /mnt/test.dat 3 > lock at 0 failed! - Interrupted system call > lock at 0 failed! - Interrupted system call > lock at 0 failed! - Interrupted system call > lock at 2 failed! - Interrupted system call > lock at 1 failed! - Interrupted system call > lock at 2 failed! - Interrupted system call > > ls this a bug of ceph ,or l do something wrong? It's a bug. We've seen isolated reports, but didn't have an easy way to reproduce it. This tool looks like exactly what we need! Added info to http://tracker.newdream.net/issues/1150 Thanks- sage