Professional Documents
Culture Documents
the p-type and the n-type transistors of a gate are the 2.9
X n X
n 2.65
q
2-input NOR
P P
3-input NOR
p
18
timal size varies linearly with the size of the load driven
by the gate. The power optimal size varies inversely 8
n-transistor. SPICE simulations were done using 1.2 Figure 3: Power optimal sizes for NOR gates compared
technology and varying the fanout load and input tran- to that of inverter
sition time for both rising and falling transitions. The
results verify the above observations [2]. 2.3 Power-delay product
Let us consider the circuit of gure 1 again. The delay
Table 1: Power optimal sizes and corresponding power of the gate g1 is aected by increasing the size of the
savings (SPICE) transistor. It is apparent from equation 5 that the delay
of the circuit decreases with increase in the transistor
Fan-out Pow-opt size Power reduction size. The power-delay product with transistor sizing is
p,n from min-sized given by equation 12 which, when simplied with respect
2 4 , 3 1.39% to W1 is of the form shown in equation 13.
5 8 , 4 15.57% 2(
Pn W )2 C
10 14 , 6 35.35% PD = P1 + kW1 + k W 0
W1 + k2tin
i =2 i L
20 24 , 10 57.82% 1
(12)
We sized both the p-transistor and the n-transistor to- PD = A + B W1 + W + W 2C D (13)
gether to obtain the overall power optimal conguration 1
for the circuit. Table 1 shows the results of sizing both 1
the gate.
2.2 Extension to general CMOS gates
Power-delay product
based delay model for series connected MOSFETS [13, Figure 4: Comparison of our power-delay model with
14, 3]. The problem is commonly framed as that of nd- SPICE
ing the equivalent width of a single inverter that would We computed the coecient in equation 13 using four
result in the same delay as the chain of MOSFETs. The sample points from SPICE simulation results for an in-
authors of [13] also considered the power consumption verter with a load of ve uniformly sized inverters. The
of the gate. In simple terms the equivalent width of N - predicted power-delay curve using our model is plot-
series connected transistors for the same delay as that of ted against the actual power-delay curve obtained from
a single inverter with width W is approximately N W . SPICE simulation in gure 4. The accuracy of our model
Using this equivalent inverter model in the derivation of with respect to SPICE results is apparent from the two
equation 8 we get the following expression for the power- curves.
optimal size of a gate with N series connected MOSFETs The right hand side of equation 13 is a convex function
in terms of the power optimal size for an inverter with of W1 and attains a global minimumfor certain W1 . The
the same load: value of W1 for which this function attains a minimum is
WN = NW1 (11) the power-delay optimal size of the transistor for a given
30 30
p-transistor power
n-transistor delay
power-delay
25 25
20 20
15 15
A B C
10 10
5 5
pow_opt pow_delay_opt
0 0
1 2 3 4 5 6 7 8 9 10 2 4 6 8 10 12 14
size of load (# of inverters) transistor size
Figure 5: Variation of power-delay optimal size with load Figure 7: Sample curves for power, delay and power-
delay
load. The value of the power-delay optimal size of a
transistor is much larger than the value of power optimal However, if the transistor is in region C, then its yield
size for the same load. The variation of the power-delay in speed is less than the increase in power consumption.
optimal sizes for the p and n-transistor with various load Hence, among the transistors along the critical paths,
sizes, as obtained from SPICE simulations is shown in any transistor in region B should be given preference
gure 5. The power-delay optimal sizes also vary linearly over a transistor in region C while sizing. On the other
with load. Figure 6 shows the variation of the power- hand, if we need to reduce the size of a transistor in re-
delay optimal size with dierent input transition times. gion B, we should choose the one with the slowest rate
of decrease in the power-delay product, because that is
lying on the least benet curve.
When the demand on the speed is very aggressive, the
28
input-r/f=1nS
22
20
delay curve tells the tradeo between the power and de-
lay while sizing the transistor. Given two transistors in
18
region C, the transistor with a smaller power-delay slope
16
should be given preference for sizing as compared to a
transistor with a steeper power-delay slope.
3 Power-delay driven transistor
14
12
10
8
sizing
Based on the above observations, we suggest the follow-
1 2 3 4 5 6 7 8 9 10
size of load (# of inverters)
Figure 6: Variation of power-delay optimal size with in- ing transistor sizing heuristic for power minimization of
a circuit under a given delay constraint.
put transition time (p-transistor)
The relation among the curves for delay(equation 5),
3.1 Power-optimal initial sizing
We propose that all the transistors in a circuit should be
power(equation 7) and power-delay(equation 13) are rep- sized to their power optimal sizes given by equations 9
resented graphically in gure 7. When the size of the and 10. This will give the minimum power congura-
transistor is below the power optimal size, the power tion for the layout with respect to transistor sizing, and
and delay both decrease with an increase in the transis- any further sizing of transistors (enlarging or reducing)
tor size. As a result the power-delay product decreases would only increase the power consumption of the cir-
very fast (region A of gure 7). Beyond the power opti- cuit. This strategy may be applied as the initial transis-
mal size the power consumption starts to increase with tor sizing heuristic; the transistors in a critical path may
the increase in transistor size while the delay of the gate be sized further for speed.
still keeps decreasing resulting in a slower decrease in the While a transistor in a gate is being sized to power op-
power-delay product (region B), and reaches the power- timal size, the increase in its size is re
ected in the load
delay optimal size. Beyond this size the power-delay seen by the gate in the previous stage. The power opti-
product starts increasing (region C) because the rate of mal size of the gates in the previous stage are determined
increase in power becomes more than the rate of decrease by the sizing at the current stage. Thus, the power opti-
in delay. mal sizing should be done in a breadth-rst(BF) traver-
Let us analyze the implication of the three regions sal order, starting with the gates generating the primary
in gure 7 with regards to a transistor sizing heuristic. outputs. A simple algorithm for power optimal sizing is
When the transistors are in region A, we are in a win- given in gure 8.
win situation in terms of power and delay. Therefore, all
transistors should be sized to at least up to the power 3.2 Power optimization under delay con-
optimal size. In region B, the reduction in delay is more
than the increase in power. Therefore, we can protably
straint
Our optimization method starts with a power minimal
size a transistor in region B. The steeper the curve in re- layout conguration and attempts to proceed along a
gion B, the more prot we get by sizing the transistor up. power optimal path to meet the required delay based
Algorithm power optimal initial sizing() Algorithm power-delay-opt()
compute load(); begin
todo list = [gates driving the primary outputs]; perform power optimal initial sizing();
while (todo list <> ) do perform delay analysis and nd critical path;
g = remove head(todo list); while 9 crit path delay > target +
g.p size = psizefunc(g,load); for each transistor 2 crit path
g.n size = nsizefunc(g,load); if (size < pd optimal size)
ist = g.fan in; size=pd optimal size();
for (f 2
ist) do end-for;
compute delay along crit path;
mark visited(f,g); if crit path delay > target +
update load(f,g); /* increase transistor size */
if (all gates in f.fan out are visited) then while crit path delay > target +
todo list = todo list + [f] ; nd transistor with minimum
end (for loop) power-delay slope;
end (while loop) increase size by one unit;
end-while;
else if crit path delay < target -
Figure 8: The breadth-rst algorithm for power optimal /* decrease transistor size */
sizing while crit path delay < target -
nd transistor with minimum
on the power-delay characterization in equation 12. The power-delay slope;
basic algorithm is given in gure 9. decrease size by one unit;
The initial transistor sizing is performed using the end-while;
power optimal transistor sizing algorithm given in g- end-if;
ure 8. If the power minimal layout satises the required recompute delays and nd crit path;
delay, we have produced a layout with minimum power end-while;
consumption and the algorithm returns. The algorithm, end;
after detecting the critical paths using a technique simi- Figure 9: Main algorithm for transistor sizing to opti-
lar to PERT [12], size the transistors on the critical path mize power under a delay constraint
to their power-delay optimal sizes (pd optimal size()),
from the gates driving primary outputs towards the pri- module generator, Per
ex[8], which uses transistor siz-
mary inputs, re
ecting the change in the load capaci- ing and input reordering to generate high performance
tance due to increase in transistor sizes. If a transistor layouts. We selected a few arithmetic circuits to test our
on the critical path is already sized beyond the power- power optimal transistor sizing heuristic. The primary
delay optimal size (because it belongs to another critical outputs of the circuit is assumed to have only one extra
path), it is left unchanged. The delay obtained in this fan-out gate in the next module, similarly, the primary
conguration may be smaller than the requirement. If inputs are assumed to drive only one gate in the cir-
that is the case, then we have to reduce the size for some cuit. Therefore, the primary inputs driving more than
of the transistors along the path to reduce the power dis- one gate are derived from buers which are subjected to
sipation. The transistors which have the slowest slope power optimal sizing.
on the power-delay curve are chosen for this purpose { The layouts were rst generated with all the tran-
the reason being that in this way we reduce the size of sistors maintained at their minimum width (3) and
the transistors that were least protable in power-delay the average power consumption for 100 random uniform
product. input patterns were computed using SPICE. Next, all
If the delay of the power-delay optimal layout is still the transistors in the high fan-out gates were sized to
more than the requirement, we have to size some of the their corresponding power optimal sizes using our heuris-
critical path transistors further. The transistors with tic. The same inputs were used to compute the average
the least slope on the power-delay curve are considered power consumption of the power optimal layout. Table 2
for further sizing. Once again, the reason being that
the transistors with steeper power-delay slope consume
relatively more power for the same amount of delay re- Table 2: The eect of power optimal transistor sizing on
duction. the test circuits
Algorithm power optimal initial sizing() requires lin-
ear time on the number of transistors. The outer while Circuit # trans- % gates Max tran. Power saving
loop in power-delay-opt() is executed only once for each name istors sized width compared to
critical path. Each iteration of the rst inner while loop (p,n) min-wid
increases the size of one transistor by one unit while each SDA3 320 23 14 , 7 15.3%
iteration of the second inner while loop decreases the HALU 280 15 9 , 4 5%
size by one unit. Computing the delays and critical path CLA4 244 19 10 , 5 5.2%
requires linear time on the number of transistors. Con-
sidering the worst case situation, the algorithm requires
O(N T ) time, where N is the number of transistors presents the power saving obtained for each of the cir-
in the circuit and T is the size of the largest transis- cuits. Note that SDA3, which has high percent of high-
tor. In other words, the running time of the algorithm fanout gates (about 20%) produced a signicant saving
is proportional to the total active area of the layout. (more than 15%) in overall power consumption. HALU
4 Preliminary results and CLA4, with fewer high fan-out gates represented a
smaller saving.
We implemented the power optimal transistor sizing Next, we considered the power optimization subjected
heuristic based on equations 9 and 10 to perform ini- to delay constraint. Work is in progress for developing
tial transistor sizing for our in-house performance driven an automatic layout tool for our heuristic. Presently,
we have used the power-delay optimal sizes from SPICE [3] H. Y. Chen and S. Dutta. A Timing Model for Static
simulations at various loads to manually size the tran- CMOS Gates. In Proceedings of ICCAD, pages 72{
sistors in the critical path. We used an 8-bit and a 75, 1989.
16-bit ripple carry adder (RCA) for this experiment.
First, we generated the layouts for the circuits using [4] H. Y. Chen and S. M. Kang. iCOACH : A Circuit
only minimum-sized transistors and computed the power Optimization Aid for CMOS High-Performance Cir-
consumption and the critical path delay. Then we sized cuits. In Proceedings of ICCAD, pages 372{375, Nov
all the transistors along the critical path to their corre- 1988.
sponding power-delay optimal sizes by hand. The power
consumption and critical path delay of the circuit were [5] J. P. Fishburn and A. E. Dunlop. TILOS: A Posyn-
computed using SPICE simulations. imial Programming Approach to Transistor Sizing.
In Proceedings of ICCAD, pages 326{328, Nov 1985.
Table 3: Power optimization subject to delay constraint [6] N. Hedenstierna and K. O. Jeppson. CMOS Circuit
method avg. power critical power-delay Speed and Buer Optimization. IEEE Trans. on
used (mW) delay(nS) product CAD, CAD-6(2):270{281, Mar 1987.
8-bit RCA
min-sized 0.9596 15.385 14.769 [7] B. Hoppe, G. Neuendorf, and D. Schmitt-
pow-dly opt 1.0045 10.831 10.879 Landsiedel. Optimization of High-Speed CMOS
16-bit RCA Logic Circuits with Analytical Models for Signal
min-sized 1.4435 32.106 46.345 Delay, Chip Area and Dynamic Power Dissipation.
pow-dly opt 1.5673 24.421 38.275 IEEE Trans. on CAD, 9(3):236{246, Mar 1990.
The results of the experiment are shown in table 3. [8] S. Kim, R. M. Owens, and M. J. Irwin. Experi-
The second column presents the average power consump- ments with a Performance driven layout generator.
tion for 100 random input patterns (30nS cycle time for In Proceedings of DAC, pages 687{690, June 1992.
RCA-8 and 40nS for RCA-16) and the third column rep-
resents the worst case delay through the circuit. The [9] U. Ko and P. T. Balsara. Short-Circuit Power
power-delay optimal sizing resulted in a much smaller Driven Gate Sizing Technique for Reducing Power
power-delay product for both the circuits as compared Dissipation. IEEE Trans. on VLSI Systems. to ap-
to the minimum-sized gates. This initial experiment il- pear.
lustrates that the overall power-delay product of a circuit
can be minimized by minimizing the power-delay prod- [10] C. M. Lee and H. Soukup. An Algorithm for CMOS
uct of the individual gates using our heuristics. Thus, timing and area optimization. IEEE Journal of
using our power-delay optimization based transistor siz- Solid-State Circuits, SC-19(5):781{787, Oct 1984.
ing heuristic it is possible to meet the delay requirement
with minimal power consumption. [11] D. Marple. Transistor Size Optimization in the Tai-
5 Conclusions lor Layout System. In Proceedings of DAC, pages
43{48, 1989.
We formulated the problem of transistor sizing for min- [12] J. J. Moder and C. R. Phillips. Project Management
imizing the power consumption of a circuit under delay
constraint. Based on accurate analysis of short circuit with CPM and PERT. NY: Van Norstand, 1970.
power consumption and gate delay in a sub-circuit con-
taining drivers and loads together, we obtained power [13] A. Nabavi-Lishi and N. C. Rumin. Inverter Models
optimal transistor sizes and hence the minimum power of CMOS Gates for Supply Current and Delay Eval-
conguration for a layout with respect to transistor siz- uation. IEEE Trans. on CAD, 13(10):1271{1279,
ing. The analysis of power-delay product of a gate in Oct 1994.
combination with the load devices led us to a new ap-
proach for computing the transistor sizes to minimize the [14] T. Sakurai and A. R. Newton. Delay Analysis of
power consumption of a circuit meeting the given delay Series-Connected MOSFET Circuits. IEEE Journal
requirement. The preliminary results from real circuits of Solid-State Circuits, 26(2):122{131, Feb 1991.
are encouraging. The authors are presently developing
a module generator to perform transistor sizing for min- [15] S. S. Sapatnekar, V. B. Rao, P. M. Vaidya, and
imizing power consumption of a circuit subject to delay S. M. Kang. An Exact Solution to the Transis-
constraint automatically. tor Sizing Problem for CMOS Circuits Using Con-
vex Optimization. IEEE Transactions on CAD,
References 12(11):1621{1634, Nov 1993.
[16] J. M. Shyu, A. Sangiovanni-Vincentelli, J. P. Fish-
[1] M. Borah, M. J. Irwin, and R. M. Owens. Minimiz- burn, and A. E. Dunlop. Optimization-Based Tran-
ing Power Consumption of Static CMOS Circuits by sistor Sizing. IEEE Journal of Solid State Circuits,
Transistor Sizing and Input Reordering. In Proceed- 23(2):400{409, April 1988.
ings of International Conference on VLSI Design,
Jan 1995. [17] H. J. M. Veendrick. Short-circuit dissipation of
[2] M. Borah, R. M. Owens, and M. J. Irwin. Tran- static CMOS circuitry and its impact on the de-
sign of buer circuits. IEEE Journal of Solid-State
sistor Sizing in Static CMOS Circuits for Minimiz- Circuits, SC-19:468{483, Aug 1984.
ing Power. Technical Report CSE-94-065, Dept. of
CSE, The Pennnsylvania State University, Decem-
ber 1994.