Number of Threads = 4
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 66060288
Offset = 96
The total memory requirement is 1512 MB
You are running each test 100 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 373160 microseconds
(= 373160 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 3039.9033 .3507 .3477 .5598
Scale: 3010.0351 .3518 .3511 .3522
Add: 4378.1162 .3625 .3621 .3631
Triad: 4427.0507 .3585 .3581 .3645
Sum of a is = 0.537150969556339130E+126
Sum of b is = 0.107430193907022657E+126
Sum of c is = 0.143240258538696412E+126
locking to cpu 0
locking to cpu 1
locking to cpu 2
locking to cpu 3