Jose Ildefonso Camargo Tolosa wrote:
Also, nowadays, Intel has better performance than AMD, at least when
comparing Athlon 64 vs Core2, I'm still saving to get a Phenom II
system in order to benchmark them and see how it goes (does anyone
have one of these for testing?).
root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# cat /proc/cpuinfo
processor : 0
vendor_id : AuthenticAMD
cpu family : 16
model : 4
model name : AMD Phenom(tm) II X4 940 Processor
stepping : 2
cpu MHz : 3000.000
cache size : 512 KB
physical id : 0
siblings : 4
core id : 0
cpu cores : 4
apicid : 0
initial apicid : 0
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge
mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext
fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good
nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm
extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips : 6020.46
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate
stream compiled with -O3
root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out
-------------------------------------------------------------
STREAM version $Revision: 5.9 $
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 5031 microseconds.
(= 5031 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 5056.0434 0.0064 0.0063 0.0064
Scale: 4950.4916 0.0065 0.0065 0.0065
Add: 5322.0173 0.0091 0.0090 0.0091
Triad: 5395.1815 0.0089 0.0089 0.0089
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------
two parallel
root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out & ./a.out
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 2984.2741 0.0108 0.0107 0.0108
Scale: 2945.8261 0.0109 0.0109 0.0110
Add: 3282.4631 0.0147 0.0146 0.0149
Triad: 3321.2893 0.0146 0.0145 0.0148
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 2981.4898 0.0108 0.0107 0.0108
Scale: 2943.3067 0.0109 0.0109 0.0109
Add: 3283.8552 0.0147 0.0146 0.0149
Triad: 3313.9634 0.0147 0.0145 0.0148
four parallel
root@p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out & ./a.out &
./a.out & ./a.out
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1567.4880 0.0208 0.0204 0.0210
Scale: 1525.3401 0.0211 0.0210 0.0213
Add: 1739.7735 0.0279 0.0276 0.0282
Triad: 1763.4858 0.0274 0.0272 0.0276
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1559.0759 0.0208 0.0205 0.0210
Scale: 1536.2520 0.0211 0.0208 0.0212
Add: 1740.4503 0.0279 0.0276 0.0283
Triad: 1758.4951 0.0276 0.0273 0.0279
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1552.7271 0.0208 0.0206 0.0210
Scale: 1527.5275 0.0211 0.0209 0.0212
Add: 1737.9263 0.0279 0.0276 0.0282
Triad: 1757.3439 0.0276 0.0273 0.0278
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1515.5912 0.0213 0.0211 0.0214
Scale: 1544.7033 0.0210 0.0207 0.0212
Add: 1754.4495 0.0278 0.0274 0.0281
Triad: 1856.3659 0.0279 0.0259 0.0284
--
Sent via pgsql-performance mailing list (pgsql-performance@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance