High Performance Computing 1
Chip Architecture
•Use multiple Functional Units per processor
–Cray T90 has 2 track vector units; NEC SX4, Fujitsu VPP300 -- 8 track vector units
–superscalar e.g. IBM RS6000 Power2 uses 2 arithmetic units
•Need to provide data to multiple functional unit => fast memory access
•Limiting factors are memory-processor bandwidth
•