Re: Performance on new 64bit server compared to my 32bit desktop

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Jose Ildefonso Camargo Tolosa wrote:
Also, nowadays, Intel has better performance than AMD, at least when
comparing Athlon 64 vs Core2, I'm still saving to get a Phenom II
system in order to benchmark them and see how it goes (does anyone
have one of these for testing?).
root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# cat /proc/cpuinfo
processor       : 0
vendor_id       : AuthenticAMD
cpu family      : 16
model           : 4
model name      : AMD Phenom(tm) II X4 940 Processor
stepping        : 2
cpu MHz         : 3000.000
cache size      : 512 KB
physical id     : 0
siblings        : 4
core id         : 0
cpu cores       : 4
apicid          : 0
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 5
wp              : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips        : 6020.46
TLB size        : 1024 4K pages
clflush size    : 64
cache_alignment : 64
address sizes   : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate


stream compiled with -O3

root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out
-------------------------------------------------------------
STREAM version $Revision: 5.9 $
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 5031 microseconds.
  (= 5031 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        5056.0434       0.0064       0.0063       0.0064
Scale:       4950.4916       0.0065       0.0065       0.0065
Add:         5322.0173       0.0091       0.0090       0.0091
Triad:       5395.1815       0.0089       0.0089       0.0089
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

two parallel
root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out & ./a.out

-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2984.2741       0.0108       0.0107       0.0108
Scale:       2945.8261       0.0109       0.0109       0.0110
Add:         3282.4631       0.0147       0.0146       0.0149
Triad:       3321.2893       0.0146       0.0145       0.0148
-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2981.4898       0.0108       0.0107       0.0108
Scale:       2943.3067       0.0109       0.0109       0.0109
Add:         3283.8552       0.0147       0.0146       0.0149
Triad:       3313.9634       0.0147       0.0145       0.0148


four parallel
root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out & ./a.out & ./a.out & ./a.out

-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        1567.4880       0.0208       0.0204       0.0210
Scale:       1525.3401       0.0211       0.0210       0.0213
Add:         1739.7735       0.0279       0.0276       0.0282
Triad:       1763.4858       0.0274       0.0272       0.0276
-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        1559.0759       0.0208       0.0205       0.0210
Scale:       1536.2520       0.0211       0.0208       0.0212
Add:         1740.4503       0.0279       0.0276       0.0283
Triad:       1758.4951       0.0276       0.0273       0.0279
-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        1552.7271       0.0208       0.0206       0.0210
Scale:       1527.5275       0.0211       0.0209       0.0212
Add:         1737.9263       0.0279       0.0276       0.0282
Triad:       1757.3439       0.0276       0.0273       0.0278
-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        1515.5912       0.0213       0.0211       0.0214
Scale:       1544.7033       0.0210       0.0207       0.0212
Add:         1754.4495       0.0278       0.0274       0.0281
Triad:       1856.3659       0.0279       0.0259       0.0284



--
Sent via pgsql-performance mailing list (pgsql-performance@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


[Postgresql General]     [Postgresql PHP]     [PHP Users]     [PHP Home]     [PHP on Windows]     [Kernel Newbies]     [PHP Classes]     [PHP Books]     [PHP Databases]     [Yosemite]

  Powered by Linux