High Performance Computing 1
BLAS on an IBM RS6000/590
BLAS 3
BLAS 2
BLAS 1
BLAS 3 (n-by-n matrix matrix multiply) vs
BLAS 2 (n-by-n matrix vector multiply) vs
BLAS 1 (saxpy of
n vectors)
Peak speed = 266 Mflops
Peak