High Performance Computing 1
Matrix-vector
•read x(1:n) into fast memory
•  read y(1:n) into fast memory
•    for i = 1:n
•     read row i of A into fast memory
•     for j = 1:n
•    y(i) = y(i) + A(i,j)*x(j)
•write y(1:n) back to slow memory
•