IBM Talks Power5 at Hot Chips Conference
Date 20-Aug-2003 17:34:59
Topic: Hardware News
|As a few of you might know IBM has presented it's|
POWER5 CPU and some specs of it at the Hot Chips Conference.
Read the article over at CBR online for a good in depth report.
There's allso a nice summary made, from the presentation, over at Macrumors:
Hi guys, Exponent reporting here live from the HotChips
conference, and IBM is coming up right now with their
Talking about the simultaneos multithreading
implementation in Power5
On target to ship in 04, running in labs now,
started in 2000
What changed since power4 to get SMT in power5
3 fundamental probs: mem latencies (slow RAM),
branch processing, and execution unit utilization (20-25%
util usually seen in commercial code)
MT is designed to address these three issues
130nm, Cu, SOI
Dual processor core
8 way superscalar
-up to 2 virtual procs per real proc
-24% area growth per core for SMT
-"natural" extention to power4 design
-going beyond 2 threads per core diminishes returns
Review of different kinds of Multitreading:
-Single thread has low utilization
-coarse grain multitreading gives a second task to run
when utilization is low
-Fine grain threading: problems with long latencies when
resources are constrained
-SMT: no problems with constrained resources
(tech stuff follows, coming too fast to type)
How do they do SMT? Add a second Program counter,
then register renaming had to be extended.
Group Completion Counter had been extended
haches had associativity increased rather than size (more efficient)
Power5 gives dynamic feedback of shared resources,
and the machine takes action if resources are
constrained - net effect is increased parallelism and
There are situations where inbalanced thread execution
of threads is preferred - power5 gives more execution
slots to threads with higher priority.
(This kind of runs counter to the idea that with SMT you
get a second virtual processor for free due to resource
constraints, but they're IBM, and I'm not....)
SMT is a mode that doesn't have to be entered into
(I think that's what they're saying)
SMT can make chip-bring-up (i.e. intial chip coming back
from the fab debugging) a pain, so Power5 has goodies to help with this)
Dynamic feedback helps things
Chip can dynamically switch between single threading and SMT
40% execution speed improvements for SMT - good return on 24% increase in area.
SMT impacts are pervasive throught the chip.
AIX, Linux and OS/400 have been booted and running
End of presentation, taking questions
4 threads per real processor doesn't make any sense.
3 threads per real processor might make sense, but
making needed resource changes would be difficult
Didn't see benefit of trace cache.
Speaker "not prepared to comment" on how much power
went up, or what the increase of area due to SMT
End of questions