VLSI Design SoC CH 6
VLSI Design SoC CH 6
VLSI Design SoC CH 6
In Out
Combinational Combinational
In Logic Out Logic
Circuit Circuit
State
Combinational Sequential
3
A static CMOS gate is a combination of two
networks, called the pull-up network (PUN) and the
pull-down network (PDN) (Figure next ). The figure
shows a generic N input logic gate where all inputs are
distributed to both the pull-up and pull-down networks.
6
Complementary logic gate as a
combination of a PUN (pull-up network)
and a PDN (pull-down network).
The PDN is constructed using NMOS devices, while
PMOS transistors are used in the PUN. The primary
reason for this choice is that NMOS transistors produce
“strong zeros,” and PMOS devices generate “strong
ones”.
A set of construction rules can be derived to construct logic functions
A B
X Y Y= X if AANDB =A+ B
X B Y= Xif AORB= AB
Y
10
Using De Morgan’s theorems,
F D A .( B C )
Deriving the pull-up network
hierarchically by identifying sub-
nets
Example 6.2
Noise Margins are input-pattern dependent.
For the above example, a glitch on only one
of the two inputs has a larger chance of
creating a false transition at the output than
when the glitch would occur on both inputs
simultaneously. Therefore, the former
condition has a lower low noise margin.
The propagation delay depends upon the input
patterns
The output of this network is high, if and only if both inputs A and B
are low. The worst-case pull-down transition happens when only one
of the NMOS devices turns on (i.e., if either A or B is high). Assume
that the goal is to size the NOR gate such that it has approximately
the same delay as an inverter with the following device sizes:
NMOS 0.5µm/0.25µm and PMOS 1.5µm/0.25µm.
Since the pull-down path in the worst case is a
single device, the NMOS devices (M1 and M2) can
have the same device widths as the NMOS device in
the inverter. For the output to be pulled high, both
devices must be turned on. Since the resistances add,
the devices must be made two times larger compared to
the PMOS in the inverter (i.e., M3 and M4 must have a
size of 3µm/0.25µm). Since PMOS devices have a
lower mobility relative to NMOS devices, stacking
devices in series must be avoided as much as
possible. A NAND implementation is clearly
preferred over a NOR implementation for
implementing generic logic.
Rp Rp Rp
2 A B 2 4 B
Rn Rp Cint
CL 4
2 A
B
Rn Rn Rn CL
2 Cint
1
A A B 1
26
Transistor Sizing a Complex CMOS Gate
B 8
A 4
C 8
D 4
OUT = D + A • (B + C)
A 2
D 1
B 2 C 2
27
Ignoring the internal node capacitances, while
analysing propagation delay for a first order analysis is
a reasonable assumption. However, in more complex
logic gates that have large fan-in, the internal node
capacitances can become significant. Consider a 4-input
NAND gate as shown in figure below
The internal capacitances consist of the junction
capacitance of the transistors, as well as the gate-to-source
and gate-to-drain capacitances. The latter are turned into
capacitances to ground using the Miller equivalence.
1. Transistor Sizing
Not all inputs of a gate arrive at the same time (due, for
instance, to the propagation delays of
the preceding logical gates). An input signal to a gate is
called critical if it is the last signal of
all inputs to assume a stable value. The path through the
logic which determines the ultimate
speed of the structure is called the critical path.
Fast Complex Gates:
Design Technique 2
• Transistor ordering
charged 01
In3 1 M3 CL In1 M3 CLcharged
36
Putting the critical-path transistors closer to the output of
the gate can result in a speedup.
This is demonstrated in figure above. Signal In 1 is
assumed to be a critical signal. Suppose further that In 2
and In3 are high and that In1 undergoes a 0->1 transition.
Assume also that CL is initially charged high. In case (a),
no path to GND exists until M1 is turned on, which is
unfortunately the last event to happen. The delay between
the arrival of In1 and the output is therefore determined
by the time it takes to discharge CL, C1 and C2. In the
second case,
C1 and C2 are already discharged when In1 changes. Only
CL still has to be discharged, resulting in a smaller delay.
4. Logic Restructuring
We size the 2-input NAND and NOR such that their equivalent
resistances equal the resistance of the inverter
gNAND = 4/3
gNOR = 5/3
Example 6.5
The total delay of a path through a combinational
logic block can now be expressed as
51
The dynamic power dissipation is given by
α0->1 CL V2dd f, where α0->1 is the switching activity which
has two components: a static component that is only a
function of the topology of the logic network, and a
dynamic one that results from the timing behavior of the
circuit—the latter factor is also called glitching.
Initially, all the outputs are 1 since one of the inputs was 0. For this
particular transition (i.e. ‘1’ at the other input simultaneously), all the
odd bits must transition to 0 while the even bits remain at the value of 1.
However, due to the finite propagation delay, the higher order even
outputs start to discharge and the voltage drops. When the correct input
ripples through the network, the output goes high. The glitch on the
even bits causes extra power dissipation beyond what is required to
strictly implement the logic function.
Glitching in a chain of NAND
Gates
Reducing Switching Activity
1. Logic Restructuring
Chain implementation will have an overall lower switching
activity than the tree implementation for random inputs.
However the tree topology will have lower (no) glitching
activity since the signal paths are balanced to all the gates.
Below are two alternate implementations of
F= A.B.C.D
Assume that all primary inputs (A,B,C,D) are
uncorrelated and uniformly distributed (i.e.,
p1 (a,b,c,d)= 0.5)
2. Input ordering
The nominal high output voltage (VOH) for this gate is VDD
since the pull-down devices are turned off when the output is
pulled high (assuming that VOL is below VTn). On the other
hand, the nominal low output voltage is not 0 V since there is
a fight between the devices in the PDN and the grounded
PMOS load device. This results in reduced noise margins and
more importantly static power dissipation.
C = 2.5V C = 2.5 V
M2
A = 2.5 V A = 2.5 V B
Mn
B
CL M1
1) Charge Leakage
If the pull-down network is off, the output should ideally remain
at the precharged state of VDD during the evaluation phase.
However, this charge gradually leaks away due to leakage
currents, eventually resulting in a malfunctioning of the gate.
Sources of leakage are reverse biased
diodes and sub-threshold leakage of
the devices M1 and Mp
2
Out1
Voltage
1
Clk
0
In Out2
-1
0 2 Time, ns 4 6
109
Issues in Dynamic Design 4: Clock Feedthrough
110
Clock Feedthrough
In3 In &
0.5 Clk
In4 Out
Clk -0.5
0 0.5 Time, ns 1
Clock feedthrough
112
Straightforward cascading of
dynamic gates to create more
complex structures does not work.
Cascading Dynamic Gates
The function of transistors Mf1 and Mf2 is to keep the circuit static when the clock
is high for extended periods of time
(bleeder)
Multiple Output Domino for
reduced Area
Compound Domino Logic