main
components are
Multiple ALU
and FPU
data and
instruction caches
superscalar since the ALU and FPUs can operate in parallel producing
more than one result per cycle
e.g. IBM POWER2 - 2 FPU/ALUs each can operate in parallel producing up to 4 results per cycle if operands are in
registers