High Performance Computing 1
Scalability MatMat
•
Assume B distributed to all slaves
•
Dot product of columns of B with rows of A
•
To compute C, require n multiplies, n-1
adds, for each element of A, so total work is
n
2
*(n+[n-1]) = 2*n
3
–n
2