(Report) Design of A Low-Power Asynchronous SAR ADC in 45 NM CMOS Technology
(Report) Design of A Low-Power Asynchronous SAR ADC in 45 NM CMOS Technology
(Report) Design of A Low-Power Asynchronous SAR ADC in 45 NM CMOS Technology
in 45 nm CMOS Technology
Muhammad Aldacher
San Jose State University
________________________________________
________________________________________
Design of a Low-Power Asynchronous SAR ADC
in 45 nm CMOS Technology
Muhammad Aldacher
San Jose State University
1
1. Abstract
This project proposes a 1V 8-bit asynchronous successive approximation register (SAR) analog-
to-digital converter (ADC) implemented in 45nm CMOS technology. The asynchronous SAR ADC system
consists of an internal-clock generator, a bootstrapped sample-and-hold switch, a capacitive digital-to-
analog converter (DAC), a dynamic comparator, and a SAR logic. Taking a 64-point FFT on the output of
the SAR ADC and with an input-signal of 1.2V differentially, the maximum ENOB achieved at 20 MHz
sampling frequency is 7.35 bits, with an SNR of 46 dB & a total power consumption of 49.124 uW. The
figure-of-merit achieved, based on Walden’s calculation, is 15 fJ/conv. This SAR ADC system can be used
in systems that mainly require low power with medium resolution and medium speed like in computing-in-
memory cores for AI applications and in sensors for biomedical applications.
2. Introduction
2.1 Motivation
The SAR ADC architecture has become an attractive solution in recent low-power applications
since it offers a good compromise between medium resolution and speed and low power consumption [1].
Typical SAR ADCs utilize the binary search algorithm in a feedback system to produce a sample of data
after a series of sequential iterations, resulting in a relatively lower sample rate compared to pipeline and
flash ADCs, as shown in Fig.1. But since SAR ADCs don’t require power-hungry analog circuits like
Opamps, neither do they use a large number of comparators per ADC, the total power of the building blocks
of the SAR ADCs can scale down well with technology. The power efficiency of SAR ADCs is useful for
systems that require the use of a large number of ADCs and operate at speeds around a few KHz or a few
MHz, like in computing-in-memory cores (CIMs) used in artificial intelligence systems [2] and biomedical
signal processing units [3].
2
In terms of control schemes, there are two types of SAR ADC architectures; the synchronous SAR
and the asynchronous SAR. Unlike the synchronous architecture, the process of the asynchronous SAR
ADC is controlled by internally generated clock signals, providing the same sample rate as the typical
synchronous SAR ADC while using a slower external sampling clock signal. In synchronous SAR ADCs,
at least N clock cycles are needed per conversion, while in asynchronous SAR ADCs, the whole conversion
process occurs within 1 cycle of the external clock. The idea is to use internal conditions to cause the
sequential binary search process to start and finish within 1 clock cycle. By removing the need for an
oversampled clock signal, the dynamic power and the complexity of the logic circuits can be further reduced
[4]. Another advantage for the asynchronous architecture is that it is possible to further decrease the
conversion time at a given resolution, since each comparison depends on the internal conditions of the
circuits in the system, rather than the period of the sampling clock [5].
At the beginning of any conversion operation, the input signal VIN is sampled to the bottom plates
of the capacitors using the S switches, while the upper plate (node VX) is connected to VCM. The total charge
stored on the capacitors is equal to VIN multiplied by the sum of the capacitors. The voltage and the charge
at the node VX are shown in equations 1 and 2.
3
During sampling phase: 𝑉𝑋 = 𝑉𝐶𝑀 = 0 (1)
𝑄𝑋 = −𝑉𝐼𝑁 . 8𝐶 (2)
After the sample phase, the top plate of the capacitors is disconnected from VCM, while the bottom
plate is disconnected from VIN and is changed according to the digital code coming from the SAR logic. In
the first comparison, the MSB is set to 1 making the DAC code 100 initially. Through charge distribution
and voltage division, VX changes as shown in equation 4 and compared to VCM (0 in this example). This
is equivalent to VIN being compared to ½ VREF. If VIN > ½ VREF (VX < 0), the comparator outputs a 1,
otherwise, the comparator gives a 0.
4𝐶
Comparison 1 (DAC=100): 𝛥𝑉 = 4𝐶+ 2𝐶+ 𝐶+ 𝐶
(3)
1
𝑉𝑋 = −𝑉𝐼𝑁 + 𝛥𝑉 = −𝑉𝐼𝑁 + 𝑉𝑅𝐸𝐹 (4)
2
In the next comparison, the MSB will take the value of the comparator’s output D2 and the MSB-1
bit is set to 1. VIN now is compared in the either the upper half of VREF if D2 = 1 or the lower half of VREF if
D2 =0. After this comparison, the MSB-1 bit takes the value of the comparator D1 and MSB-2 bit is set to
1. After N=3 comparisons, the SAR ADC outputs D2D1D0 as the final binary code that translates to the
voltage level VDAC given in equation 9. Fig.3. shows the change in the node voltages of the capacitors in
each stage of the conversion process. The binary search process is shown in Fig.4. The binary tree in Fig.5.
shows the all the possible outcomes of this 3-bit SAR system. The stages of this binary tree can be extended
to N stages for an N-bit SAR ADC.
(𝐷2 . 4𝐶)+ 2𝐶
Comparison 2 (DAC=D210): 𝑉𝐷𝐴𝐶 = 4𝐶+ 2𝐶+ 𝐶+ 𝐶
(5)
𝐷2 1
𝑉𝑋 = −𝑉𝐼𝑁 + 𝑉𝐷𝐴𝐶 = −𝑉𝐼𝑁 + 2
𝑉𝑅𝐸𝐹 + 4
𝑉𝑅𝐸𝐹 (6)
4
(a)
(e)
5
Fig.5. Binary tree showing all the possible outputs of a 3-bit SAR ADC
6
𝑉𝑅𝑒𝑓
𝑉𝑅𝑒𝑓𝑛 = 𝑉𝐶𝑀 − 2
(10)
𝑉𝑅𝑒𝑓
𝑉𝑅𝑒𝑓𝑝 = 𝑉𝐶𝑀 + 2
(11)
7
Fig.8. Timing diagram of an asynchronous SAR ADC system
3. Design
3.1 Internal Clock Generator
The internal clock generator is the block responsible for providing the rest of the system with the
clock signals to operate. This block produces 2 output signals, “clk_sample” for the sample phase and
“clk_sar” for the comparison phases, as explained before. There are 3 inputs to this block, “clk_ext” which
is the system’s external clock, “Ready” which is produced by the comparator once the comparison is done
to tell the system that it is ready for the next iteration, and “EOC” which is produced by the SAR logic
block once all the N bits are generated. The rising edge of the external clock is only used to prompt the
beginning of the conversion cycle, then the rest of the inputs and the outputs control the rest of the process
till the “EOC” is generated, concluding the conversion.
8
The circuit implementation of the internal clock generator is shown in Fig. 10. Part A is the part
that creates the “clk_sample” signal. The delay chosen in this section is enough for the S/H circuit to settle
when charging the DAC capacitors during the sampling phase. The XOR gate creates a pulse at each
transition of the external clock and the AND gate only passes the pulse when “clk_ext” is high, as shown
in Fig. 11. Part B is responsible for toggling “clk_SAR” after the sampling phase is done. The 2-to-1
multiplexers are used to prevent “clk_SAR” from toggling and to stay 0 during the sampling phase
(clk_sample = 1) or after the conversion is done (EOC = 1). The D input of the flipflop is kept high, so that
the Q output would initially be high, consequently causing “clk_SAR” to go high once the sampling phase
ends (clk_sample becomes 0). Part C controls the toggling of “clk_SAR” through the set and reset inputs
of the flipflop, while ensuring that set and reset are never high at the same time (invalid state). The
“clk_SAR” going high triggers the reset signal, which pulls “clk_SAR” low after a certain delay, and the
“Ready” going high triggers the set signal, which pulls “clk_SAR” high after a certain delay. The flipflop
used is a true-single phase clock (TSPC) flipflop, which is discussed in section 3.5, and the rest of the digital
gates are CMOS gates, as shown in Fig. 12. The implementation of the delay cell is shown in Fig. 13.
9
(a) (b)
(c)
(d)
Fig.12. Digital gates used in the internal clock generator block
(a) NAND, (b) Inverter, (c) 2-to-1 Mux, (d) XOR
10
Fig.13. Delay cell circuit implementation
11
1 1
𝑅𝑂𝑁 = = (12)
𝜇𝑛 .𝐶𝑂𝑋 .𝑊⁄𝐿 .(𝑉𝐺𝑆 − 𝑉𝑇𝐻 ) 𝜇𝑛 .𝐶𝑂𝑋 .𝑊⁄𝐿 .((𝑉𝐷𝐷 −𝑉𝐼𝑁 ) − 𝑉𝑇𝐻 )
The bootstrapped switch help mitigate some of these non-idealities by keeping the VGS of the switch
fixed during the sampling phase. In the basic illustration in Fig. 16 (a), Φ2 switches are ON during the hold
phase, turning the main switch MNSW completely off by connecting its gate to ground and charging the
capacitor COFFSET to a voltage of VC = VDD – VSS [8]. During the sampling phase, Φ1 switches are ON while
Φ2 switches are OFF, causing the capacitor to act as a constant battery fixing VGS to be equal to VC. Fig. 16
(b) shows how the gate voltage Vg and the source voltage Vs of MNSW are bootstrapped allowing them to
change together. Fixing VGS suppresses the distortion due to dependence of RON and QCH on the input signal.
Also, the width W of MNSW does not need to be large to further reduce RON’s effect, which in turn helps
reducing clock feedthrough as well since the parasitic caps are smaller with smaller W.
(a) (b)
Fig.16. (a) Basic bootstrapped switch circuit, (b) Vg waveform while bootstrapped to Vs (= Vi)
12
The proposed bootstrapped circuit is shown in Fig. 17. The MOSFET devices Mn1, Mp1, Mn2,
and Mp2 act as switches that are used to charge the offset capacitor during the hold phase (clk = 0) and to
fix the VGS of the main switch MnSW during the sample/track phase (clk = 1). The large VGS ensures that
MnSW is completely ON and much greater than the threshold voltage VTH while the source voltage of
MnSW is changing. The gate nodes of MnSW and Mp2 are tied because during sampling, Vg and Mp2’s
source are usually greater than VDD. For Mp2 to be completely off, its gate must be tied to the highest
voltage in the circuit, hence tied to Vg. The bulks of the PMOS switches Mp1 and Mp2 are connected to
the highest voltage of the circuit which is the positive node of COFFSET. During the hold phase, Mn4 connects
Vg to the lowest voltage in the system to ensure complete shut-off of MnSW. Mn3 is added to prevent Mn4
from having a very high VGD when Mn4 is OFF. Since Vg ≥ VDD and the gate voltage of Mn3 is VDD (less
than Vg), then Mn3 won’t act as a strong pass device and the drain of Mn4 will be limited to a maximum
of VDD-VTH. Regarding the value of COFFSET, the bigger it is, the better it can maintain a constant Vc during
the sampling phase, but at the expense of larger area. With MpSW added in parallel to MnSW, the clock
feedthrough effect is cancelled-out, since they both operate with 2 opposing clocks (clk & clk_b).
13
to a dummy capacitor. The differential capacitive DAC employs 2 of such array, as shown in Fig. 18 [9].
Other than the switches used to sample the inputs, the switches used in the DAC are CMOS transmission
gates. To switch between the sampled input and then to the DAC code selection, a switch network is used
as shown in Fig. 19. To make the transmission gates less resistive (smaller RON), the widths of the NMOS
and PMOS transistors are increased, but this is at the expense of larger parasitic capacitances. For the DAC
capacitors, increasing the unit capacitor’s value decreases the thermal noise effect but also decreases the
speed of the system. In the proposed DAC, an additional common-mode switch is added between VXP and
VXN for a better and faster connection to each other and to VCM during the sampling phase.
14
(a) (b)
Fig.20. Schematics of the switches used in the DAC (a) Switch network, (b) Transmission gate
15
up towards VDD. Since the back-to-back inverters form a positive feedback loop, regeneration occurs pulling
out– to ground and out+ to VDD, as shown in Fig. 23 (b). Since the outputs of the strong-arm latch are set
to VDD during the reset phase, the RS latch is added to keep the generated decision for the SAR logic to
pickup it up during the reset phase. This is done by turning OFF M5 and M6. The back-to-back inverters
of the RS latch are weak enough for M5 and M6 to change their stored values when the new outputs are
regenerated by the strong-arm latch. Finally, the XOR gate produces the ready signal by generating a high
only when the regeneration process makes the output nodes of the strong-arm different (one of them is 1
and the other is 0), signaling that the comparison process is finished.
(a) (b)
Fig.21. Types of Dynamic Comparator (a) Strong-Arm Comparator, (b) Two-Stage Comparator
16
(a) (b)
Fig.23. Proposed Strong-Arm Comparator (a) Schematics, (b) Operating phases [13]
17
it outputs a 1 for its corresponding bit and also triggers the clock for the previous FF so that the previous
bit would take the value of the comparator’s decision. At the end of the end of the conversion process, the
code register holds the final DAC code and an EOC signals the output stage to take that code as the result
of the ADC conversion.
Fig. 26 shows the schematics of the proposed SAR logic. The state table of the SAR logic block
showing the whole 8-bit conversion process is presented in Table 1. The clock driving this block is
“clk_SAR” coming from the internal clock generator, and the reset pin of this block is connected to
“clk_sample” so that the outputs of all the FFs are reset during the sampling phase. For the 8-bit system,
there are 8 main FFs in each of the sequencer and the code registers, in addition to 1 extra FF at beginning
of the sequencer register to load the hot code during the sampling phase, and 2 extra FFs at the end of the
registers to generate the EOC signal along with the final output code. The output of the comparator is
connected to the D input of the main FFs of the code register. Since the capacitive DAC operates
differentially, both the complementary outputs of the FFs are sent to the DAC.
18
Table 1: SAR Logic States Table
All the FFs in the SAR logic block are implemented using the true single-phase clock (TSPC)
circuit shown in Fig. 27. TSPC FFs provide higher speeds, lower power consumption, and smaller area
compared to the CMOS FFs [15]. Transistors M9, M11, and M12 are added to the conventional TSPC FF
to add asynchronous set and reset capabilities. Inverters are added as an output stage to the TSPC FF to
give strong rail-to-rail outputs.
19
3.6.1 Output Register
The output register’s job is to receive the final DAC code at the rising edge of the EOC signal and
releases it at the falling edge of EOC which happens at the rising edge of the next external clock cycle. The
FFs used in this block are CMOS FFs at shown in Fig. 29. CMOS FFs are used in this block instead of
TSPC FFs to ensure rail-to-rail node voltages within the circuit since the EOC signal is a low-frequency
signal.
20
3.6.2 Output DAC
The ideal output DAC used for evaluation is shown in Fig. 30. This block consists of 8 binary-
weighted ideal voltage-controlled voltage sources (VCVS). They are stacked on top of each other to be
summed together to represent the analog output given by equation 15. The final VCVS at the right is to
correct the normalized analog output by multiplying by the right gain equal to the peak-to-peak swing of
the input signal and shift the minimum level to VRefn of the system.
1
𝑉𝑂𝑈𝑇 = 28 . (𝐷7 . 27 + 𝐷6 . 26 + 𝐷5 . 25 + 𝐷4 . 24 + 𝐷3 . 23 + 𝐷2 . 22 + 𝐷1 . 21 + 𝐷0 . 20 ) (15)
21
4.1 Ideal Models
To gain an understanding on the SAR ADC system, both the synchronous and the asynchronous
SAR ADCs were modelled using the VerilogA blocks and the ideal components shown in appendix A. Fig.
31 shows the testbench for the 8-bit synchronous SAR ADC. The stimuli used were a clock of 100 MHz
7
and an input of 64
𝑥 10 MHz. The output produced is of 10 MS/sec as shown in Fig. 32, since each 10
cycles of the clock produces 1 sample. Fig. 33 shows a zoomed view into 1 conversion cycle that contains
1 cycle for sampling, 8 cycles for bit comparisons, and 1 cycle to produce the final code.
22
Fig.33. 1 conversion cycle for the synchronous SAR ADC
The testbench is then modified to test the 8-bit asynchronous SAR ADC functionality as shown in
7
Fig. 34. The stimuli used were an external clock of 10 MHz and an input of 64 𝑥 10 MHz. Again, the output
produced is of 10 MS/sec as shown in Fig. 35, and Fig. 36 shows the behavior within 1 conversion cycle
that occurs in 1 external clock cycle. Since ideal components can change instantaneously, suitable ideal
delay cells must be added inside the internal clock generator block and at the output of the comparator.
23
Fig.35. Transient Waveforms of the ideal asynchronous SAR ADC testbench
24
4.2.1 Bootstrapped Switch
To test the performance of the bootstrapped circuit, the output node is connected to all the
capacitors in the DAC array as shown in Fig. 37. The sampling pulse is dictated by the time the output node
needs to fully track the input node during the tracking phase (clk = 1). In the asynchronous SAR ADC, this
pulse is generated by the delay cells in part A of the internal clock generator block explained in section 3.1
(Fig. 10). The smaller this pulse gets, the more challenging the design of the bootstrapped switch becomes
for the given output capacitance load.
For the asynchronous SAR ADC to operate with a sampling pulse of 3.125 ns, the testbench
parameters are determined as follows:
1
𝑓𝐶𝐿𝐾 = 𝑓𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔 = = 160 𝑀𝐻𝑧
2 𝑥 3.125 𝑛𝑠
𝐹𝑜𝑟 𝐹𝐹𝑇 𝑜𝑓 𝑁 = 64, 𝑎𝑛𝑑 𝑡𝑜 ℎ𝑎𝑣𝑒 1 𝑖𝑛𝑝𝑢𝑡 𝑐𝑦𝑐𝑙𝑒 𝑒𝑣𝑒𝑟𝑦 7 𝑠𝑎𝑚𝑝𝑙𝑒 𝑐𝑦𝑐𝑙𝑒𝑠, 𝑡ℎ𝑒𝑛:
7
𝑓𝐼𝑁 = 𝑥 160 = 17.5 𝑀𝐻𝑧
64
1
𝑇𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔 = = 6.25 𝑛𝑠
𝑓𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔
𝑇ℎ𝑒𝑛, 𝑡ℎ𝑒 𝑠𝑖𝑚𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑡𝑖𝑚𝑒 𝑡𝑜 𝑐𝑎𝑝𝑡𝑢𝑟𝑒 64 𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑐𝑦𝑐𝑙𝑒𝑠 𝑜𝑟 7 𝑖𝑛𝑝𝑢𝑡 𝑐𝑦𝑐𝑙𝑒𝑠:
𝑇𝑠𝑖𝑚𝑢𝑙𝑎𝑡𝑖𝑜𝑛 = 64 𝑥 6.25 𝑛𝑠 = 400 𝑛𝑠
𝐴𝑛𝑑 𝐴𝑑𝑑𝑖𝑛𝑔 9𝑛𝑠 𝑎𝑠 𝑎𝑛 𝑖𝑛𝑡𝑖𝑎𝑙𝑖𝑧𝑎𝑡𝑖𝑜𝑛 𝑡𝑖𝑚𝑒, 𝑡ℎ𝑒𝑛 𝑡ℎ𝑒 𝑚𝑒𝑎𝑠𝑢𝑟𝑒𝑚𝑒𝑛𝑡 𝑡𝑖𝑚𝑒 𝑡𝑎𝑘𝑒𝑛 𝑖𝑠:
𝑇𝑚𝑒𝑎𝑠𝑢𝑟𝑒𝑚𝑒𝑛𝑡 : 9𝑛𝑠 → 409𝑛𝑠
25
The waveforms of the bootstrapped switch’s testbench in Fig. 39 and Fig. 40 show how the output
“vinn_sampled” tracks the input when clk =1 and holds the value when clk = 0. Using the setup in Fig. 41,
the generated output spectrum in Fig. 42 gives an ENOB of 10.02 bits and SNR of 62.08 dB.
26
Fig.41. Spectrum measurement setup for the bootstrapped circuit’s output signal
27
and (b) show the waveforms with faster settling and slower setting times respectively. The delay from
“clk_SAR” to the reset input of the FF in part C of the internal clock generator in Fig. 10 is chosen to be
longer than the settling time of the DAC. The proposed system is designed for a 2fF unit capacitance in the
DAC and a settling time < 1ns.
(a)
28
(b)
Fig.44. Transient waveforms of the capacitive DAC testbench
(a) with faster settling (b) with slower settling
4.2.3 Comparator
The resolution of the ADC is determined by the minimum input that the comparator can detect and
resolve. The comparator must be able to resolve a differential input voltage of 1 LSB or less. For an 8-bit
1 0.6
ADC and a single-ended input full-scale range of 0.6V, the ½ LSB is equal to 2 . 28 = 1.17 𝑚𝑉. The
resolution of the comparator is tested through the overdrive recovery test using the testbench in Fig. 45. In
the overdrive test, the comparator is tested by changing the input signal between different levels and seeing
if the comparator can recognize these changes within 1 clock cycle from the change. Fig. 46 and 47 show
the overdrive test waveforms at different inputs. The inputs change during the comparator’s reset phase
(S_bar = 0) and the comparator starts the comparison at the rising edge of its clock. From the overdrive test,
the minimum differential input that can be resolved by the designed comparator is 0.6 mV with a clock of
300 MHz.
29
Fig.45. Testbench for the dynamic comparator
30
Fig.47. Overdrive recovery test (Vin = 0.6mV and frequency = 300MHz)
31
Fig.49. Transient waveforms of the asynchronous SAR ADC testbench
Fig.50. Waveforms of the asynchronous SAR ADC testbench during 1 conversion cycle
Fig. 50 is a zoomed snapshot of the waveforms during 1 conversion cycle. At the rising edge of
“clk_external”, a pulse is generated at “clk_sample” in which the sampling of the inputs occurs. Once
“clk_sample” goes low, “clk_SAR” goes high causing the code from the SAR logic block to redistribute
the charges in the capacitive DAC, changing the input nodes to the comparator “vgp” and “vgn”. At the
falling edge of “clk_SAR”, the DAC would have settled already, and the comparator is activated
32
(regeneration phase). When the outputs of the strongarm latch “latch+” and “latch–” give opposite rail-to-
rail output levels (a strong high and a strong low), the “ready” signal goes high which triggers “clk_SAR”
to go up high after a certain delay. After 8 cycles of “clk_SAR”, the EOC signal or “register_clk” signals
the output register to output the result of the conversion process, which is the final digital code from the
SAR logic block. To measure the performance, the digital output is converted to its analog equivalence and
the FFT is taken for this analog output.
To produce the frequency response of the output of the system, the measurement setup in Fig. 51
is used. The stimuli and the measurement parameters are determined from the external clock frequency of
the system as follows:
𝑓𝐶𝐿𝐾 = 20 𝑀𝐻𝑧
𝐹𝑜𝑟 𝐹𝐹𝑇 𝑜𝑓 𝑁 = 64, 𝑎𝑛𝑑 𝑡𝑜 ℎ𝑎𝑣𝑒 1 𝑖𝑛𝑝𝑢𝑡 𝑐𝑦𝑐𝑙𝑒 𝑒𝑣𝑒𝑟𝑦 7 𝑠𝑎𝑚𝑝𝑙𝑒 𝑐𝑦𝑐𝑙𝑒𝑠, 𝑡ℎ𝑒𝑛:
7
𝑓𝐼𝑁 = 𝑥 20 = 2.1875 𝑀𝐻𝑧
64
1
𝑇𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔 = = 50 𝑛𝑠
𝑓𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔
𝑇ℎ𝑒𝑛, 𝑡ℎ𝑒 𝑠𝑖𝑚𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑡𝑖𝑚𝑒 𝑡𝑜 𝑐𝑎𝑝𝑡𝑢𝑟𝑒 64 𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑐𝑦𝑐𝑙𝑒𝑠 𝑜𝑟 7 𝑖𝑛𝑝𝑢𝑡 𝑐𝑦𝑐𝑙𝑒𝑠:
𝑇𝑠𝑖𝑚𝑢𝑙𝑎𝑡𝑖𝑜𝑛 = 64 𝑥 50 𝑛𝑠 = 3200 𝑛𝑠
𝐴𝑛𝑑 𝐴𝑑𝑑𝑖𝑛𝑔 150 𝑛𝑠 𝑎𝑠 𝑎𝑛 𝑖𝑛𝑡𝑖𝑎𝑙𝑖𝑧𝑎𝑡𝑖𝑜𝑛 𝑡𝑖𝑚𝑒, 𝑡ℎ𝑒𝑛 𝑡ℎ𝑒 𝑚𝑒𝑎𝑠𝑢𝑟𝑒𝑚𝑒𝑛𝑡 𝑡𝑖𝑚𝑒 𝑡𝑎𝑘𝑒𝑛 𝑖𝑠:
𝑇𝑚𝑒𝑎𝑠𝑢𝑟𝑒𝑚𝑒𝑛𝑡 : 150𝑛𝑠 → 3350𝑛𝑠
Fig.51. Spectrum measurement setup for the asynchronous SAR ADC output signal
The spectrum of the final output of the asynchronous SAR ADC testbench, from which the key
dynamic ADC performance metrics are measured, is shown in Fig. 52. The signal-to-noise-and-distortion
33
ratio (SINAD) is calculated from the signal-to-noise ratio (SNR) and the total harmonic distortion (THD)
as in equation (16). Since the THD is neglected in this setup, the SINAD and SNR are considered the same.
The theoretical SNR for an N-bit ADC driven by a full-scale sine wave and that only considers the
quantization noise is given by equation (17). Using the equation of SNR, the effective number of bits
(ENOB) can be obtained from equation (19). The ENOB represents the actual resolution of the ADC after
taking the noise and the distortion of the system into consideration. As a measure of the strength of the
desired signal compared to the worst spur in the output spectrum, the spurious-free dynamic range (SFDR)
is also found [16].
𝑃𝑆𝑖𝑔𝑛𝑎𝑙
𝑆𝑁𝑅 (𝑖𝑛 𝑑𝐵𝑠) = 10 log ( 𝑃 ) = 6.02 𝑁 + 1.76 (17)
𝑛𝑜𝑖𝑠𝑒
𝑃𝑆𝑖𝑔𝑛𝑎𝑙
𝑇𝐻𝐷 (𝑖𝑛 𝑑𝐵𝑠) = 10 log ( ) (18)
𝑃𝑑𝑖𝑠𝑡𝑜𝑟𝑡𝑖𝑜𝑛
𝑆𝐼𝑁𝐴𝐷 − 1.76
𝐸𝑁𝑂𝐵 (𝑖𝑛 𝐵𝑖𝑡𝑠) = 6.02
(19)
These key metrics and parameters of the system are displayed in Table 2. The figure-of-merit
(FOM) of the system is defined by equation (20). The aim of the design is to decrease the FOM, which
mainly involves decreasing the total power consumption and improving the ENOB. The power is improved
by using circuits that consume no static power like the strongarm latch and the TSPC flipflops, while the
34
ENOB is improved by using techniques like bootstrapped switching and highly sensitive strongarm latch
design. The achieved FOM from the proposed SAR ADC is 15 fJ/conversion-step which is comparable to
earlier asynchronous SAR ADC designs, as shown in Ref [4].
𝑃𝑡𝑜𝑡𝑎𝑙
𝐹𝑂𝑀 = 2𝐸𝑁𝑂𝐵 .𝑓𝑠
(20)
Table 3 and Fig. 53 show how the total power is divided among the different blocks. Most of the
power consumption come from the digital blocks, with the delay cells being the biggest contributor to the
high power. The power of the digital circuits usually scales down well with supply, since the dynamic
power is directly proportional to VDD2 as shown in equation (21). This allows the proposed asynchronous
SAR ADC to have the potential to offer more power reduction in future design work.
35
Fig.53. Power consumption percentage per block
5. Conclusion
A low-power 8-bit asynchronous SAR ADC, with a clock input of 20MHz and designed using
CMOS 45nm technology at 1 V, is presented. The design uses an internal clock generator that controls the
conversion process by generating clock signals for the rest of the circuit. Bootstrapped switches are used to
sample the differential input signal and to provide high ENOB. A binary-weighted charge-redistribution
capacitive DAC provides the input levels to be compared by the comparator while consuming no static
power. The comparison is done by a low-power strongarm-based comparator with a resolution of 0.6 mV.
The SAR logic block that generates the digital code uses TSPC flipflops to further reduce power
consumption. The measured ENOB of the system is 7.35 bits while consuming a total power of 49.12 uW,
resulting in an FOM of 15 fJ/conversion-step.
Improvements can be further applied to the design to get better FOM. As seen in the results section,
most of the power consumption comes from the digital blocks, so power reduction is possible by lowering
the supply voltage for these blocks or by using other logic topologies, like dynamic CMOS logic. Also,
other capacitive DAC topologies offer lower switching energy through split-capacitor arrangements and
monotonic switching methods, as discussed in Ref [9]. Changing the switching scheme in the DAC will
require a re-design of the SAR logic block as well. Other topologies for the internal clock generator can
also help in saving power, as discussed in Ref [6]. The proposed design shows that the asynchronous SAR
ADC architecture can provide an energy and FOM efficient ADC without the use of an oversampled clock.
36
Appendix A: Verilog-A codes and Ideal models
In this section, the VerilogA codes used in the ideal models for the synchronous and the
asynchronous SAR ADCs are presented.
module VerilogA_SampleAndHold(clk,vin,vmin,vout);
input clk,vin,vmin;
output vout;
electrical vout,vin,vmin,clk;
real v;
analog begin
module VerilogA_ClockedComparator(dout,vref,vin,clk);
input vin,vref,clk;
output dout;
electrical dout,vref,vin,clk;
real d_result;
analog begin
37
else begin
d_result = 0;
end
end
module
VerilogA_SAR_logic_Synchronous(in_comp,clk,d0,d1,d2,d3,d4,d5,d6,d7,regclk,sampl
eclk,vdd,vss);
inout vdd,vss;
input in_comp, clk;
output d0,d1,d2,d3,d4,d5,d6,d7;
output sampleclk,regclk;
electrical in_comp,clk,d0,d1,d2,d3,d4,d5,d6,d7,regclk,sampleclk,vdd,vss;
real d_0,d_1,d_2,d_3,d_4,d_5,d_6,d_7,sample_en,sar_counter,reg_out;
analog begin
// Initial State
@(initial_step) begin
sample_en = 1;
d_7 = 0;
d_6 = 0;
d_5 = 0;
d_4 = 0;
d_3 = 0;
d_2 = 0;
d_1 = 0;
d_0 = 0;
sar_counter = 9; //10-bit counter
end
// Operation: Comparison
// At Rising Edge of clk, the bits are updated for comparison.
// At Falling Edge of clk, the Comparator produces the new bit
(Regeneration)
@(cross(V(clk) - clk_threshold, +1))
begin
if(sar_counter == 9) begin
sample_en = 1; //1st count is for sampling
reg_out = 0;
d_7 = 0; d_6 = 0; d_5 = 0; d_4 = 0; d_3 = 0; d_2 = 0; d_1 = 0;
d_0 = 0;
38
sar_counter = sar_counter - 1; end
// Producing the neg edge of the pulse to sample & to get the register output
@(cross(V(clk) - clk_threshold, -1))
begin
if(sample_en == 1) begin
sample_en = 0; end
if(sar_counter == 9) begin
reg_out = 1; end
end
39
end
endmodule
module VerilogA_Register_8bit
(clk,in0,in1,in2,in3,in4,in5,in6,in7,out0,out1,out2,out3,out4,out5,out6,out7,
vdd,vss);
inout vdd,vss;
input clk,in0,in1,in2,in3,in4,in5,in6,in7;
output out0,out1,out2,out3,out4,out5,out6,out7;
electrical clk,in0,in1,in2,in3,in4,in5,in6,in7,out0,out1,out2,out3,out4,out5,
out6,out7,vdd,vss;
real d_0,d_1,d_2,d_3,d_4,d_5,d_6,d_7;
analog begin
@(cross(V(clk) - clk_threshold, +1))
begin
d_7 = V(in7);
d_6 = V(in6);
d_5 = V(in5);
d_4 = V(in4);
d_3 = V(in3);
d_2 = V(in2);
d_1 = V(in1);
d_0 = V(in0);
end
module VerilogA_DAC_8bit(d0,d1,d2,d3,d4,d5,d6,d7,vout,vdd,vss,vmin,vmax);
40
inout vdd,vss;
input d0,d1,d2,d3,d4,d5,d6,d7;
input vmin, vmax;
output vout;
electrical vout,vdd,vss,d0,d1,d2,d3,d4,d5,d6,d7,vmin,vmax;
real result,d_0,d_1,d_2,d_3,d_4,d_5,d_6,d_7;
analog begin
d_7 = V(d7)*128;
d_6 = V(d6)*64;
d_5 = V(d5)*32;
d_4 = V(d4)*16;
d_3 = V(d3)*8;
d_2 = V(d2)*4;
d_1 = V(d1)*2;
d_0 = V(d0)*1;
result = ((d_7+d_6+d_5+d_4+d_3+d_2+d_1+d_0) *
((V(vmax)-V(vmin))/(256))) + V(vmin) ;
41
A.2.2 SAR Logic:
module VerilogA_DFlipFlop(D,clk,Q,Qb,set,reset,vdd,vss);
inout vdd,vss;
input D,clk,set,reset;
output Q,Qb;
electrical D,clk,Q,Qb,set,reset,vdd,vss;
real d_0,d_0b;
analog begin
@(cross(V(set) - vtrans, +1)) begin
d_0 = 1;
d_0b = 0; end
@(cross(V(reset) - vtrans, +1)) begin
d_0 = 0;
d_0b = 1; end
else begin
d_0 = 0; d_0b = 1; end
//end
end
42
A.2.4 Inverter Gate:
// VerilogA for SAR_VerilogA_Asynchronous, VerilogA_inv, veriloga
`include "constants.vams"
`include "disciplines.vams"
module VerilogA_inv(A,X);
input A;
output X;
electrical A,X;
real X_out;
analog begin
module VerilogA_XOR(A,B,X);
input A,B;
output X;
electrical A,B,X;
real X_out;
analog begin
else begin
X_out = 1; end
43
A.2.6 AND Gate:
// VerilogA for SAR_VerilogA, VerilogA_AND, veriloga
`include "constants.vams"
`include "disciplines.vams"
module VerilogA_AND(A,B,X);
input A,B;
output X;
electrical A,B,X;
real X_out;
analog begin
module VerilogA_2to1Mux(in0,in1,select,out,vdd,vss);
inout vdd,vss;
input in0,in1;
output out;
electrical in0,in1,select,out,vdd,vss;
real result;
analog begin
end
endmodule
44
A.2.8 Delay Cell (1ns):
// VerilogA for SAR_VerilogA_Asynchronous, VerilogA_1ns_delaycell, veriloga
`include "constants.vams"
`include "disciplines.vams"
module VerilogA_1ns_delaycell(inp,outp,vdd,vss);
inout vdd,vss;
input inp;
output outp;
electrical inp,outp,vdd,vss;
real d_out;
analog begin
@(cross(V(inp) - vtrans, +1))
begin
d_out = 1;
end
endmodule
45
References
[1] J. Fredenburg and M. P. Flynn, "ADC trends and impact on SAR ADC architecture and analysis,"
2015 IEEE Custom Integrated Circuits Conference (CICC), San Jose, CA, 2015, pp. 1-8.
[2] S. Zhang, K. Huang and H. Shen, "A Robust 8-Bit Non-Volatile Computing-in-Memory Core for
Low-Power Parallel MAC Operations," IEEE Transactions on Circuits and Systems I: Regular
Papers, 2020.
[3] W. Mao, Y. Li, C. Heng, and Y. Lian, "A Low Power 12-bit 1-kS/s SAR ADC for Biomedical Signal
Processing," IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 66, no. 2, pp. 477-
488, Feb. 2019.
[4] P. J. A. Harpe et al., "A 26 uW 8 bit 10 MS/s Asynchronous SAR ADC for Low Energy Radios,"
IEEE Journal of Solid-State Circuits, vol. 46, no. 7, pp. 1585-1595, July 2011.
[5] C. Huang, J. Lin, Y. Shyu and S. Chang, "A Systematic Design Methodology of Asynchronous SAR
ADCs," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 24, no. 5, pp. 1835-
1848, May 2016, doi: 10.1109/TVLSI.2015.2494063.
[6] O. Kardonik, "A study of SAR ADC and implementation of 10-bit asynchronous design," M.S.
Thesis, Department of Electrical and Computer Engineering, University of Texas at Austin, 2013.
[7] B. Razavi, "Data Conversion System Design," IEEE Press, 1995.
[8] M. Dessouky and A. Kaiser, "Very low-voltage digital-audio Δ∑ modulator with 88-dB dynamic
range using local switch bootstrapping," IEEE Journal of Solid-State Circuits, vol. 36, no. 3, pp. 349-
355, March 2001.
[9] C. Liu, S. Chang, G. Huang, and Y. Lin, “A 10-bit 50-MS/s SAR ADC with a Monotonic Capacitor
Switching Procedure," in IEEE Journal of Solid-State Circuits, vol. 45, no. 4, pp. 731-740, April
2010.
[10] B. Razavi, "The StrongARM Latch [A Circuit for All Seasons]," IEEE Solid-State Circuits Magazine,
vol. 7, pp. 12-17, 2015.
[11] M. Aldacher, M. Nasrollahpour and S. Hamedi-hagh, "A low-power, high-resolution, 1 GHz
differential comparator with low-offset and low-kickback," 2017 24th IEEE International Conference
on Electronics, Circuits and Systems (ICECS), Batumi, 2017, pp. 310-313.
[12] M. Al-Qadasi, A. Alshehri, A. S. Almansouri, T. Al-Attar, and H. Fariborzi, "A High Speed Dynamic
StrongARM Latch Comparator," 2018 IEEE 61st International Midwest Symposium on Circuits and
Systems (MWSCAS), Windsor, ON, Canada, 2018, pp. 540-541.
[13] M. M. Ayesh, S. Ibrahim, and M. M. Aboudina, "Design and analysis of a low-power high-speed
charge-steering based StrongARM comparator," 2016 28th International Conference on
Microelectronics (ICM), Giza, Egypt, 2016, pp.209-212.
46
[14] T.O. Anderson. Optimum control logic for successive approximation analog-to-digital converters.
Communications Systems Research Section, JPL Technical Report 32-1526, Vol XIII.
[15] J. Yuan and C. Svensson, "New single-clock CMOS latches and flipflops with improved speed and
power savings," in IEEE Journal of Solid-State Circuits, vol. 32, no. 1, pp. 62-69, Jan. 1997.
[16] W. Kester, "Understand SINAD, ENOB, SNR, THD, THD + N, and SFDR so you don’t get lost in
the noise floor," Analog Devices, Norwood, MA, USA, Tech. Rep. MT-003, 2009.
47