High Performance Computing 1
Superscalar Processor
•Remedies:
–Blocked algorithms,
•do I=1,1000 do j=1,20
–a(I)=…. do i=(j-1)*50,j*50
–                                                  a(i)=....
– contiguous storage
– avoid strides and random/non-deterministic access
•a(ix(i)) = ...
–