Hello Zhou, The performance improvement shows great according to your data. However, the data is not that enough to show the change of common use. Since the amount of 0/1 in a page will cause the vary of time used for compression, I would like you get more benchmark on different core with different amount of 0/1. On 10/22/2015 11:11 AM, "Zhou, Wenjian/???" wrote: > Hello Kumagai, > > I test it, and it works well. The following is the results. > > in virtual machine(memory 2G): > with empty memory: > version | num-threads | time(sec) > ----------------+---------------+------------- > devel | 0 | 12.76 > devel | 1 | 19.29 > devel | 2 | 11.56 > + this patch | 0 | 12.85 > + this patch | 1 | 5.61 > + this patch | 2 | 2.68 > > with full memory: full memory means memory is occupied by 1, without 0? > version | num-threads | time(sec) > ----------------+---------------+------------- > devel | 0 | 51.18 > devel | 1 | 57.82 > devel | 2 | 41.54 > + this patch | 0 | 49.25 > + this patch | 1 | 44.80 > + this patch | 2 | 33.87 > > > in real machine(memory 16G): > with empty memory: > version | num-threads | time(sec) > ----------------+---------------+------------- > devel | 0 | 86.12 > devel | 1 | 222.37 > devel | 8 | 81.50 > devel | 16 | 98.44 > + this patch | 0 | 86.07 > + this patch | 1 | 84.33 > + this patch | 8 | 14.95 > + this patch | 16 | 13.96 > > with full memory: > version | num-threads | time(sec) > ----------------+---------------+------------- > devel | 0 | 540.89 > devel | 1 | 715.25 > devel | 8 | 132.54 > devel | 16 | 112.89 > + this patch | 0 | 542.79 > + this patch | 1 | 538.22 > + this patch | 8 | 108.28 > + this patch | 16 | 107.83 > -- Regards Qiao Nuohan