The Floating-Point Units

The G4e’s very straightforward floating-point implementation has a single FPU that takes a minimum of five cycles to finish executing the fastest floating-point instructions. (Some instructions take many more cycles.) The FPU is served by 48 microarchitectural floating-point registers (32 registers for the PPC ISA and 16 additional rename registers). Finally, single- and double-precision floating-point operations take the same amount of time.

The 970’s floating-point implementation is almost exactly like the G4e’s, except there’s twice as much hardware. The 970 has two identical FPUs, each of which can execute the fastest floating-point instructions (like the fadd) in six cycles. As with the G4e, single- and double-precision ...

Get Inside the Machine now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.