Originally posted by BillBroadley
View Post
Update: Looking at your results it seemed to me open64 compilers generate as efficient code as icc 11.1 so I grabed the actual build and ran a comparison here.
PII 955BE 3.2GHz NB 2GHz MEM 2xDDR1333 Unganged CL7
ICC 11.1
Code:
Function Rate (MB/s) Avg time Min time Max time Copy: 13223.4215 0.0026 0.0024 0.0150 Scale: 13261.3109 0.0025 0.0024 0.0090 Add: 13726.5011 0.0036 0.0035 0.0048 Triad: 13788.5482 0.0036 0.0035 0.0050
Code:
Copy: 8859.2560 0.0036 0.0036 0.0037 Scale: 8712.0426 0.0037 0.0037 0.0037 Add: 9541.0925 0.0050 0.0050 0.0051 Triad: 9749.9439 0.0056 0.0049 0.0111
Code:
Copy: 8820.2489 0.0041 0.0036 0.0055 Scale: 8544.5460 0.0041 0.0037 0.0054 Add: 9597.9497 0.0053 0.0050 0.0055 Triad: 9632.8513 0.0059 0.0050 0.0100
ICC 11.1
Code:
Function Rate (MB/s) Avg time Min time Max time Copy: 15116.3113 0.0023 0.0021 0.0089 Scale: 15794.0372 0.0021 0.0020 0.0029 Add: 15185.2913 0.0032 0.0032 0.0035 Triad: 15320.4925 0.0032 0.0031 0.0050
Code:
Function Rate (MB/s) Avg time Min time Max time Copy: 9624.1021 0.0054 0.0033 0.0216 Scale: 9329.0977 0.0035 0.0034 0.0035 Add: 10278.5823 0.0047 0.0047 0.0047 Triad: 10478.1197 0.0046 0.0046 0.0047
Code:
Function Rate (MB/s) Avg time Min time Max time Copy: 9445.3011 0.0034 0.0034 0.0034 Scale: 9297.4320 0.0035 0.0034 0.0035 Add: 10225.8529 0.0047 0.0047 0.0048 Triad: 10405.5505 0.0046 0.0046 0.0047
Leave a comment: