High Performance Computing 1
Scalability MatMat
•Assume B distributed to all slaves
•Dot product of columns of B with rows of A
•To compute C, require n multiplies, n-1 adds, for each element of A, so total work is n2 *(n+[n-1]) = 2*n3 –n2