At Speed Test

Design of the Power Switching Network
Ruixing Yang
15.01.2009
Outline
Power Gating implementation styles

Sleep transistor power network synthesis
Wakeup in-rush current control
Wakeup and sleep latency reduction
The presentation is based on the reference book (M. Keating, et al., Low Power
Methodology Manual for System-on-Chip Design, Springer, 2007. ) chapter 14. All the
contents and figures used here are referenced from the book chapter 14.
Power Gating challenges
g – effective for reducing

Power Gating g the leakage
g ppower
in standby or sleep mode.
However:
I) Overhead
Silicon area taken by the sleep transistors.
Routing resources for permanent and virtual power networks.
Complex power-gating design and implementation processes.
II) Power integrity issues.
IR drop on the sleep transistors
Ground bounce caused by in-rush wake up current.
III) Wakeup latency.
Ring vs. Grid Style
Coarse grain power gating can be implemented in either a ring or a grid style power network.
Ring based switching – place the switches externally to the power gated block effectively
encapsulating the block with a ring of switches.
Grid based switching – the sleep transistors are distributed throughout the power gated region.
Ring Style Sleep Transistor Implementation Grid Style Sleep Transistor

Implementation
Ring vs. Grid Style – cont.
Ring style implementation: Grid style implementation

Advantages:
Ad t Advantages:
Ad t
Has a less complex power plan than the 9 The switches in a grid network drive the
grid because of the separation of the virtual supply for the short distances
permanent power network and the virtual compared with ring-style implementation
power network. The sleep
p p transistors are
not mixed with other logic cells. 9 Requires fewer sleep transistors than the
ring-style impl. To achieve the same IR
Has little negative impact on placement drop target.
and routing in the standard cell area.
9 The permanent power supply is available
Good option for small blocks of logic where across the power-down
power down domain areas.
areas
the voltage drop across the switch
transistors and VVDD mesh can be 9 It provides somewhat better trickle charge
managed. distribution for management of in-rush
current.
Disadvantages:
9 Has less impact on the area of a power
Doesn’t support retention registers. gated block.
Add significant extra area cost compared Disadvantages:
to a grid approach.
9 Has impact on standard cell routing and
physical
h i l synthesis.
th i
9 Complexity is added to power routing.
More grid style impl. – Row and Column Grids

1.
1 Column
C l based
b d switching
it hi (fig.
(fi upper right),
i ht) employs
l
columns of switch cells spaced evenly across the
switched design.
Advantage: Each power switch only has to provide
power to a small segment
p g of the standard cell row
thereby minimizing any potential voltage drop.
Disadvantage: Impact the placement optimization,
limiting the flexibility of the standard cell placer.
2
2. Row based switching (fig.
(fig bottom right)
right).
Advantage: Optimal solution for distributed switching
since the potential impact on the placement engine is
limited.
Disadvantage: Impact routing resources in lower layer
metal, which can be avoided by column based
approach.
Selection of the implementation style

The best choice of the impl. depens on:
The design being implemented
The library being used and the type of switches available.
The technology being targeted and its specific leakage characteristics.
The performance and power goals for the design.
The use of the legacy or highly optimized IP.
Hybrid Style Implementation

The grid style is implemented at the top-level and ring style is applied to
certain power-gated hard macros and/or power domain blocks which
have no retention cells.
Advantage: Take use of the both implementation styles’ advantages.
Disadvantages:
g more complex power planning.g
Recommendations – Ring vs. Grid Style
1. For the design which implements retention cells, select grid style.
2. If no retention cells, check the area budget and the need for permanent
power supply
p pp y in the ppower-down areas for always-on
y buffers.
3. For the design which has power-gated hard macros, or blocks without
retention logic, select hybrid style.
4. For grid-style, use wide straps in permanent power network to reduce
IR drop.
drop
Header vs. Footer Switch
Header Switch: use a high VT pMOS transistor to control VDD.

Footer
oote Switch:
S tc use a high g VT nMOS
OS transistor
a s s o too co
control
o VSS.
SS
The selection decision is based on area cost, IR drop constraints, and system architectural issues.
1. Switch Efficiency Consideration
Definition: Switch Efficiency = ratio of drain current in the ON and OFF states (Ion/Ioff)
Total Leakage in the switch fabric is mainly determined by the switch efficiency.
90nm High VT pMOS Switch Efficiency 90nm high VT nMOS Switch Efficiency
at Normal Body Bias at Normal Body Bias
Header vs. Footer Switch – cont.
2. Area Efficiency Consideration and L/W Choice
The area efficiency depends on the size (L*W) and layout implementation of the sleep
t
transistors.
i t
Optimal L is determined by the switch efficiency and can be obtained from the switch
efficiency curve.
The switch efficiency decreases with the increase of W in pMOS transistors, therefore the
small W is preferred.
p
Figure shows us:
Ion linearly increases with W.
Ion/W becomes constant at
given L and Vbb -> the area
efficiency is determined by
the layout implementation of
the sleep transistors.
3. Body Bias Considerations
Applying reverse body bias on the sleep transistor can increase the switch
efficiency and reduce leakage significantly.
Cost for the reverse body bias in the header switch is significantly smaller than
in the footer switch.
Reason:
N-well of the pMOS transistor is readily available for bias tapping in the
standard CMOS p process. It can be tapped
pp to its own body y bias supply
pp y as long
g
as N-well of the sleep transistor has enough space from the surrounding
standard cells’ N-wells.
nMOS transistor does not have a well in the standard CMOS process. It is
necessaryy to create wells for nMOS sleepp transistors to allow separate
p body
y
bias. Æ higher chip fabrication cost and design complexity & more process
variations.
Conclusion: pMOS header is preferable in reverse body bias application.
4. System Level Design Consideration
In SoC designs, blocks usually communicate in the active-high interface
protocols referencing common ground (VSS) as logic “0”. In header switch
implementation, all signal nets in power-gated blocks are settled at Vss which is
convenient from system design perspective.
Header switch avoids p potential signal
g integrity
g y issues and header switch allows
a simple design of a pull-down transistor to isolate power-gated blocks and
clamp output signals at logic “0”.
5. Recommendations – Header vs. Footer
Area efficiencyy is main concern: nMOS,, which produces
p higher
g switch efficiency
y
and smaller transistor size. W should be chosen as large as possible for a given
cell height.
System level design and IP integration: header.
Header is more commonly used than footer in power-gating design currently.
Choice of sleep p transistor can be limited by
y the availability
y of the low-leakage
g
transistor in a given technology.
Minimum standby leakage is main concern: W should be chosen based on high
switch efficiency and hence low leakage.
W is obtained based on the investigation of area and leakage trade-off.
Rail vs. Strap VDD Supply
Sleep transistors get power supply from the permanent power network (VDD) and deliver it
to the virtual power network (VVDD). Two ways to distribute Vdd to the sleep transistors –
Rail
R il vs. St
Strap VDD supply.
l
1. Parallel Rail VDD Distribution
A VDD rail is added to a cell row in parallel with VVDD rail. The sleep transistor gets its
permanent power supply by connecting to VDD rails.
Advantages:
Permanent power supply rail is reachable throughout the design.
No restriction on the placement of cells which require connections to permanent power
supply.
Disadvantages:
Th implementation
The i l t ti takes
t k att least
l t one trace
t off routing
ti resources iin every row iin VDD railil
layer.
Incurs layer conflict with conventional standard library cells which use the metal 1 layer for
cell internal routing.
2. Power Strap VDD Distribution

Permanent power network is built in one or two top metal layers. The sleep transistors are
placed
l d under
d ththe straps
t off th
the coarse-grain
i network
t k and
d gett th
their
i VDD supply
l th
through
h via
i
pillars.
Advantages:
Allows the use of a normal standard cell library in a power-gating design.
Disadvantages:
Permanent power network no longer covers the design area.
- Place the cells which need permanent power supply (PPS) under the PPS network
(placement constraint)
- Power-routing the cells which need PPS (complicates the power-routing nets)
3. Recommendations for supply Distribution
If no available standard cell library which provides extra VDD rail, select power strap VDD.
If impact on routing resources is the main concern, select power strap VDD.
If th
there are a significant
i ifi t number
b off retention
t ti registers
i t in
i a design
d i andd power integrity
i t it iin
power-routing are the main concern, select parallel distribution.
A Sleep Transistor Example
Double row 90nm header switch cell.
60 small pMOS transistors of 0.55um
0 55um widthwidth.
6-row transistor array.
Normal body bias.
VSS is in the middle of the two rows
A pair of inverters that drive the sleep
t
transistors
i t is
i iimplemented
l t d iin th
the cellll ffor
area efficiency.
Wakeup Current and Latency Control
Methods
In power gating design, thousands of sleep transistors waking up simultaneous -> a very
large current in charging the design to a full power-on state -> IR drop -> functional error /
short
h t tterm VDD collapse
ll -> state
t t iin retention
t ti registers
i t andd memoriesi corrupted.
t d
Possible solution: control in-rush current by separating the chip power supply to many rows
and the power is turned on row by row. Disadvantage: crowbar currents -> IR drop. Not
practical in power gating design industry.
1. g Daisy
Single y Chain Sleepp Transistor Distribution
Turn on the sleep transistors gradually by configuring the sleep transistors in a daisy chain
style.
Advantages: simple design. Disadvantages: the short delay of the buffers in the chain
usually turns on the sleep transistors too quickly -> larger than acceptable in-rush current
during wakeup.
2. Dual Daisy Chain Sleep Transistor Distribution
U weak
Use k ttransistors
i t tto trickle
t i kl charge
h th
the d
design
i tto preventt llarge iin-rush
h current.
t
When the design is trickle charged close to VDD, large transistors of the optimal drive
strength are turned on.
Methods
The transistors are split into two chains: a weak transistor chain and main transistor chain.
Size of the weak trickle is defined by the user-defined in-rush current limit and maximum
permissible turn-on
turn on delay time
time.
Size of the sleep transistors in the main chain is optimized by the methods described for the
performance and leakage goals.
Trickle sleep transistors are to control wakeup rush current and reduce wakeup latency.
The main chain transistor design is based on meeting IR drop target and reducing sleep
transistor area.
Methods
3. Parallel Short Chain Distribution of the Main Sleep Transistor
Wakeup Latency = trickle charge time + turn on time of main chain
Reduce main chain turn time to reduce wakeup latency.
Single daisy chain -> longest time to charge up & small peak charge current.
Parallel array -> smallest delay & largest peak current
Compromise: Parallel short chain – sleep transistors are connected as a number of short daisy
chains
h i connected t d iin a parallel
ll l manner. Th
The short
h td daisy
i chains
h i are tturned
d on simultaneously
i lt l
when the main chain is turned on. -> The delay is shortened and peak current is controlled.
4. Main Chain Turn-on Control

When weak and main chain design g are fixed,, it is needed to determine the threshold to turn on
the main chain. Lower threshold -> turn on early & higher peak current.
5. Buffer Delay Based Main Chain Turn-on Control

Control the time to trickle charge the design to the required threshold. In real power-gating
design trickle charge is controlled by the buffer chain which turns on the weak transistors in
design,
sequence.
Summary
Power gating design style Ring vs. Grid

Implementation of Ring
Ring, Grid
Row vs. Column Grid
Hybrid Style
Header vs. Footer Switch
Switch
S it h efficiency
ffi i
Area efficiency
Body bias
System level design
R il vs. St
Rail Strap VDD supply
l
Parallel Rail vs. Power Strap
Wakeup Current and Latency Control Methods
Single Daisy Chain
Dual Daisy Chain
Parallel Short Chain Distribution of the Main Sleep Transistors
Main Chain Turn-on Control

At Speed Test

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

At Speed Test

Uploaded by

Copyright:

Available Formats

Design of the Power Switching Network

Power Gating implementation styles

g – effective for reducing

Ring Style Sleep Transistor Implementation Grid Style Sleep Transistor

Ring style implementation: Grid style implementation

More grid style impl. – Row and Column Grids

Selection of the implementation style

Hybrid Style Implementation

Recommendations – Ring vs. Grid Style

Header Switch: use a high VT pMOS transistor to control VDD.

2. Power Strap VDD Distribution

3. Recommendations for supply Distribution

4. Main Chain Turn-on Control

5. Buffer Delay Based Main Chain Turn-on Control

Power gating design style Ring vs. Grid

You might also like