High Performance Computing 1
Communication Model
•Speed = k/tcomm 
•Actual << Theoretical hardware limit advertised
•Consequences
–Send messages in blocks -- avoid small single messages
–Arrange data distributions to get nearest neighbor communications e.g. use ring shift with direct neighbors