An Automatic Circuit Design Framework For Level Shifter Circuits
An Automatic Circuit Design Framework For Level Shifter Circuits
An Automatic Circuit Design Framework For Level Shifter Circuits
Abstract—Although design automation is a key enabler of to realize its true potential. Various approaches have been
modern large-scale digital systems, automating the transistor- reported for automatic circuit topology generation. For digital
level circuit design process still remains a challenge. Some recent logic gates, the Boolean expression factoring method that gen-
works suggest that deep learning algorithms could be adopted
to find optimal transistor dimensions in relatively small circuitry erates series-parallel (SP) associations of transistors for the
such as analog amplifiers. However, those approaches are not given function was suggested [3]. Possani et al. [4] proposed
capable of exploring different circuit structures to meet the given an improved graph-based method that creates a logic gate by
design constraints. In this work, we propose an automatic circuit introducing nonseries-parallel (NSP) arrangements into the SP
design framework that can generate practical circuit structures structure, thus reducing the number of transistors. But these
from scratch as well as optimize the size of each transistor, con-
sidering performance and reliability. We employ the framework methods regard transistors as ideal switches and, hence, are
to design level shifter circuits, and the experimental results show only applicable to designing digital logic gates based on static
that the framework produces novel level shifter circuit topolo- operations.
gies and the automatically optimized designs achieve 2.8×–5.3× There have also been several circuit topology synthesis
lower power-delay product (PDP) than prior arts designed by approaches aimed at more general integrated circuits. The
human experts.
library-based methods [5], [6] select one of the predefined
Index Terms—Circuit design automation, deep learning, circuit structures (e.g., a two-stage amplifier) in the library
evolutionary algorithm, level shifter, reinforcement learning (RL). based on the desired operating characteristics. However, one
must construct a library containing all possible circuit struc-
I. I NTRODUCTION tures in advance, which is a time-consuming process that also
ITH increasing hardware design complexity and vari- necessitates a considerable amount of human effort. Building-
W ability of the fabrication process, design automation
has been widely adopted in a large portion of the IC design
block-based methods [7]–[9] take a similar approach, but rely
on a library of smaller building blocks, such as a current mirror
process. For instance, various electronic design automation and a differential input pair. They employ various algorithms
(EDA) tools are now available for designing digital blocks to search for the best topology, such as the multiobjective
and System-on-Chips (SoCs). The EDA tools can generate a evolutionary algorithm [7], framework for explorative analog
large block composed of millions of logic gates very efficiently topology synthesis method (FEATS) [8], and graph-grammar-
using a standard cell library[1]. However, when it comes to based topology generation (GGTG) [9]. Since the library- or
designing integrated circuits, design automation remains a building-block-based approaches have relatively limited search
challenge. Most digital and analog circuits are still carefully space, they are suitable for the fast generation of integrated cir-
designed by human experts due to high design complexity and cuits using a well-established topology. However, the search
reliability concerns [2]. space is constrained within the predefined set of circuit struc-
Integrated circuit design automation can be decomposed tures or building blocks and, hence, they are less adaptive to
into two design problems: 1) circuit topology selection and changes in design parameters or fabrication process. In addi-
2) transistor size optimization. It is crucial to choose a proper tion, there is little possibility that they could generate a novel
circuit topology in the first place since the topology mainly topology that has not been studied yet.
sets the limit on the performance and reliability a circuit can On the other hand, the transistor-based methods [10]–[13]
achieve. We also need to optimize the size of each transistor do not rely on predefined components for topology genera-
tion; instead, they progressively construct a circuit by adding
Manuscript received 25 June 2021; revised 10 December 2021; accepted or removing a transistor in the topology. For instance, the
10 February 2022. Date of publication 1 March 2022; date of current version
22 November 2022. This work was supported in part by the National Research circuit-constructing robot (CC-BOT) [11] starts with a single
Foundation of Korea under Grant NRF-2020R1A4A4079177 and Grant NRF- node and conditionally adds a transistor following an evolu-
2019R1C1C1004927, and in part by the IC Design Education Center. This tionary algorithm. An active bot moves to a newly created node
article was recommended by Associate Editor P. Li. (Corresponding author:
Dongsuk Jeon.) and continues adding transistors from there. The algorithm
The authors are with the Graduate School of Convergence Science in [13] represents transistors and passive devices as a 3-node
and Technology, the Research Institute for Convergence Science, and the graph (hypergraph) and an edge, respectively. In each gener-
Inter-University Semiconductor Research Center, Seoul National University,
Seoul 08826, South Korea (e-mail: djeon1@snu.ac.kr). ation, it removes and adds multiple hypergraphs and edges,
Digital Object Identifier 10.1109/TCAD.2022.3155444 also following an evolutionary algorithm. The transistor-based
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/
5170 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 41, NO. 12, DECEMBER 2022
two connection genes: VDD , VSS , and an internal node con- that produce a desired output. Once the algorithm finds prop-
nected by one P-channel MOSFET & N-channel MOSFET erly working circuit topologies, the scores related to the hard
pair [Fig. 2(a)]. The gate of each transistor is randomly con- constraints saturate and do not affect the fitness. The remain-
nected to the node except VDD and VSS . In each generation, der of the evolution process further modifies the topology to
evolution begins by calculating the fitness of the species in improve circuit performance.
the current population. The algorithm converts each offspring The topology generator employs various mutation functions
into a netlist and runs a SPICE simulation to calculate the so that it can cover a wide range of circuit topologies. Note
offspring’s fitness based on the observed functionality and that the nodes without any connection (i.e., floating nodes)
performance. Then, the fitness of the offsprings included in the can be generated as a result of mutation. Hence, we label the
species is averaged to obtain the species fitness. The number nodes with one or more connections as active nodes, and only
of offspring that can reproduce from each species to the next active nodes are selected for mutation. The types of mutations
generation is determined in proportion to the species fitness. are discussed as follows.
During circuit topology generation, simulations are performed 1) Add Connection: This mutation randomly chooses two
only at the typical (TT) corner to scan large search spaces and active nodes and connects them by adding a new edge. Since
find the best candidates quickly. an edge corresponds to a transistor in the actual circuit, it links
Before reproducing a new population, the algorithm the gate of the new edge to one of the existing active nodes
observes whether the fitness of the best offspring in each by updating the gate property.
species has been improved or not in the last few generations. 2) Add Node: This inserts a new node in one of the edges.
If the fitness of a species does not improve any further for a In other words, a single transistor is replaced with two stacked
certain number of generations, then the species is considered transistors. The gates of the stacked transistors are connected
stagnant, and offsprings of that species are removed from the to the same node to which the gate of the original transistor
population. The evolution process is independently performed was connected. This process is often used when designing
for each species. First, the offspring with the highest fitness a circuit to increase output resistance or minimize leakage
in each species is automatically included in the population of current.
the next generation. Next, a set of offspring with the highest 3) Add P-Channel MOSFET and N-Channel MOSFET:
fitness within each species is selected as a parent pool. Two Pair A P-channel MOSFET and N-channel MOSFET pair
offspring are randomly chosen from the pool and compared makes a new connection between VDD and VSS . If a sin-
with each other, where the winner evolves through mutation gle P-channel or N-channel MOSFET transistor is placed
and joins the population of the next generation. between VDD and VSS , this will be just a current leaking path.
The fitness function represents the performance and relia- Therefore, we place transistors as a pair of P-channel and
bility of a circuit as a single value. We consider two types of N-channel MOSFET devices when making a new connection
design constraints for fitness calculation: 1) hard constraints between the supply rails.
and 2) soft constraints. The hard constraints are the set of 4) Change Gate: The gate of a transistor is connected to a
design constraints that a circuit must satisfy (e.g., rail-to-rail different active node except for VDD and VSS nodes.
output swing for level shifters), whereas the soft constraints 5) Remove Connection: This mutation randomly removes
indicate the design quality (e.g., power consumption and con- one of the connections, which allows for removing transis-
version delay of level shifters). The fitness of an offspring at tors from the current topology. This prevents the circuit from
the xth generation is calculated as continuously becoming larger.
⎧ ⎫ 6) Change Size: The size of the connection genes repre-
⎨ ⎬ sents the relative size (strength) of a transistor. Since our goal
fitx = αi f qi,x + f qi,x αj f qj,x (1)
⎩ ⎭ is to quickly go through a variety of circuit topologies and
i∈H i∈H j∈S
find promising candidates, we define each transistor’s strength
where fitx is the calculated fitness of an offspring, qi,x is the in only three steps: 1) strong; 2) medium; and 3) weak.
observed performance of the circuit in SPICE simulations cor- During mutation, the transistor size randomly changes in each
responding to the ith constraint, f (qi,x ) is the score function connection gene independently.
for each constraint, αi is the weight of the ith constraint, and 7) Change Output Port: This mutation changes the location
H and S represent the sets of hard and soft constraints, respec- of the output port. One of the active nodes is selected as an
tively. This is similar to the reward function used in RL for output.
circuit optimization in [19], but our approach has two distinct In the original NEAT algorithm, each mutation function is
differences: 1) we use log(qi,x ) instead of qi,x for the scores randomly selected in each mutation. Hence, multiple types of
that have a large dynamic range and 2) the contribution of soft mutations may be performed simultaneously. However, this
constraints in the fitness is regulated by the scores related to may result in an excessive amount of change in a circuit. For
the hard constraints, instead of using a hyper-parameter manu- instance, removing a transistor from the circuit and changing
ally tuned for a specific type of circuit. In early generations, it the gate connection of another transistor would produce a cir-
is highly likely that most offspring would fail to function prop- cuit with entirely different characteristics. Hence, we limit the
erly. The scores related to the hard constraints would be very mutation process to select only one of the add, remove or gate
low, making the fitness largely dictated by the hard constraints. change mutations (mutations 1 through 5 above). In addition,
Hence, the algorithm focuses on finding feasible topologies other minor mutations (mutations 6 and 7) are independently
HONG et al.: AUTOMATIC CIRCUIT DESIGN FRAMEWORK FOR LEVEL SHIFTER CIRCUITS 5173
introduced with a certain probability. Let PaddNode , PaddCon , in continuous or high-dimensional action spaces. An agent
PaddPair , PchangeGate , and PrmCon denote the probability of collects and saves a sample into a replay memory. Then, a
mutations 1 through 5 above. Then, the mutation process minibatch is randomly selected from the replay memory to
follows the equation below: train the network. While PPO also has an actor–critic struc-
ture suitable for training in continuous or high-dimensional
PaddNode + PaddCon + PaddPair + PchangeGate + PrmCon = 1 (2) action space, it does not have a replay memory. Instead, N
During topology exploration, we do not want the algorithm agents collect samples in parallel during an episode which
to keep adding transistors indefinitely. Otherwise, the number consists of T time steps, and a minibatch is constructed using
of transistors in a circuit may explode, and the resulting cir- the collected samples and used for training the algorithm.
cuit would be far from what we desire. For instance, an ideal Then, all the samples are discarded. DDPG exhibits slower
analog amplifier or level shifter circuit typically has tens of convergence during training since it only uses one agent con-
transistors at most. Therefore, we balance the expected num- trary to PPO, but has the advantage of being able to reuse the
ber of removed and added transistors in each mutation by samples stored in the replay memory. PPO trains the model
enforcing the relationship as follows: more quickly by using multiple agents, but it only uses the
samples collected in the current episode for training, which
2PaddNode + PaddCon + 2PaddPair − PrmCon = 0 (3) reduces sample efficiency. In circuit optimization, samples
are obtained by running time-consuming SPICE simulations.
since adding a node (a net in the circuit) adds two transistors,
Therefore, it is crucial to maximize sample efficiency (i.e.,
whereas adding or removing a connection adds or removes a
reduce the number of samples required for algorithm con-
single transistor in the circuit.
vergence) to speed up the circuit optimization process. To
After a new generation is obtained by mutating all the off-
resolve this issue, we adopt distributed distributional DDPG
spring of the current generation, the newly generated offspring
(D4PG) [28] algorithm in the circuit optimizer. D4PG sup-
are grouped again into a set of species. Each offspring is com-
ports both multiagent training and sample reuse by using a
pared to the base models of existing species. If the number
replay memory. Unlike DDPG and PPO which express future
of differences in the connection genes is below the thresh-
rewards as a single scalar value, D4PG expresses rewards as
old for one or more existing species, then the offspring joins
a probability distribution. It models the inherent uncertainty
the closest species. Otherwise, the offspring constitutes a new
imposed by function approximation in a continuous environ-
species and becomes its base model. After the grouping pro-
ment, resulting in better gradients and improving the training
cess is done, the population undergoes another iteration of the
performance compared to DDPG. It also shows more stable
mutation process to obtain the next generation. This process
performance when multiple agents are used [28].
continues until it reaches the maximum number of generations
In the RL algorithms using actor–critic structure, two differ-
defined by the user.
ent neural networks are typically employed: an actor network
The topology generator selects candidate topologies both
and a critic network. The actor network takes a state vector
during and at the end of the evolution. When a stagnant species
as an input and produces an action vector, whereas the critic
is removed during evolution, the offspring with the best fitness
network takes state and action vectors as inputs and predicts
in that species is selected and added to the candidate list if
the reward value an agent is expected to receive as a result
it meets all the given design constraints. When the algorithm
of the current and future actions. The RL algorithm trains
finishes the last iteration, the same operation is performed on
those neural networks on the observed samples. As the com-
all the remaining species. Note that there may exist floating
plexity of the neural network increases with the dimension of
nodes and floating paths as a result of mutation. Before adding
input vectors, it is important to minimize the dimension of
an offspring to the candidate list, the topology generator finds
the input vector for faster optimization. Since the action vec-
and removes the floating nodes and paths.
tor represents relative size changes of all the transistors in the
circuit, its dimension is fixed. Hence, we aim to optimize the
B. Circuit Optimizer critic network by reducing the dimension of the state vector.
The topology generator is aimed at quickly finding promis- Specifically, we use the simulated circuit performance (e.g.,
ing circuit topologies. Hence, each transistor is only roughly power consumption and delay) and area as a state, instead of
sized during exploration (e.g., strong, medium, or weak). This feeding each transistor’s size or other characteristics (e.g., Vth ,
accelerates the search process by significantly limiting the Vsat , and μ0 ) as did in prior works [19]–[21]. Therefore, the
search space, but the size of each transistor must be further dimension of the state vector is independent of the number of
tuned for optimal performance. For this purpose, we employ an transistors in the topology and the optimization process can be
additional circuit optimizer as the second stage in the proposed efficiently accelerated when the target circuit topology consists
circuit design framework. of many transistors.
The circuit optimizer adopts a RL algorithm to optimize The actor network creates an action based on the state
candidate integrated circuits. Various RL algorithms have been obtained by SPICE simulations. An action represents a rel-
used for circuit optimization. L2DC [19] and GCN-RL [21] are ative change in the size (width, length, and multiplier) of each
based on deep deterministic policy gradient (DDPG) [26], and transistor. If the target topology has N transistors in total, the
AutoCkt [20] adopts proximal policy optimization (PPO) [27]. dimension of the action vector would be 3N. Each dimension
DDPG has an actor–critic structure and generally works well of the action vector has a value in [−1, 1]. Then, the amount
5174 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 41, NO. 12, DECEMBER 2022
TABLE I
network. However, RNN is typically hard to train due to the E XPERIMENTAL S ETUP FOR L EVEL S HIFTER D ESIGN
vanishing and exploding gradient problems [29]. Also, the
state is composed of the observed values (e.g., gm and Vth )
of each transistor, and the order is determined by the sig-
nal path of the circuit, necessitating manual examination of
the circuit topology. Instead, we use an MLP as the actor as
did in AutoCkt [20], where the specifications of topology are
combined into a state vector in an arbitrary order. Also, we
initialize the weights of the MLP following the method in [30].
While the circuit optimizer primarily focuses on maximizing
circuit performance, it is also very important to guarantee that
the circuit properly operates under process variations. Contrary
to prior works on circuit optimization [19]–[21], we run SPICE to digital circuits [22]. On the other hand, the internal opera-
simulations at five different process corners (TT, FF, SS, FS, tion is similar to that of analog circuits such as amplifiers. In
and SF). The optimizer constantly observes if the circuit meets experiments, we adopt the framework to design level shifter
the hard constraints at all corners during optimization. On the circuits in a 180-nm CMOS process, and the resulting circuits
contrary, the scores related to the soft constraints are only mea- are compared to prior designs reported in the literature.
sured at TT corner. This allows the circuit to exhibit maximum A level shifter circuit converts a low-voltage (VDDL ) digi-
performance at the corner of most concern while still guaran- tal signal to a high-voltage (VDDH ) signal. Level shifters must
teeing proper functionality in the worst cases. Note that the generate a rail-to-rail swing between the ground and VDDH at
Monte-Carlo analysis better captures the robustness of a circuit the output. Therefore, we use output signal swing as a hard
under process variation. However, since the size of each tran- constraint in the framework. Because level shifters are typi-
sistor continues to change during optimization, adopting the cally expected to operate with high conversion speed and low
Monte-Carlo analysis will require a large number of SPICE power consumption with minimal footprint [31], we use the
simulations in each time step, incurring a large time overhead. delay, total power (Ptotal ), static power (Pstatic ), and area as
On the contrary, the corner analysis requires only a few sim- soft constraints. The circuit area is calculated as the number
ulations for each design point and, hence, is more suitable for of transistors in the topology generator, whereas the circuit
fast optimization. optimizer uses the total active area.
Table I shows the score functions used in each step. The
scores related to the soft constraints are calculated as -log(qi,x)
III. E XPERIMENT R ESULT except for the area in the topology generator, whereas the score
In the previous section, we presented an unified circuit for the hard constraint (output swing) is calculated as the swing
design framework that automatically generates appropriate cir- observed in simulation divided by VDDH . In topology gener-
cuit topologies and further optimizes each design through ation, the area is calculated as the number of transistors in
finding an optimal size of each transistor. In this section, we the circuit. When the number of transistors exceeds a thresh-
experimentally verify the proposed circuit design framework. old (b in Table I), the score is divided by a slope which is a
By employing the framework to design level shifter circuits, hyperparamter. In the circuit optimizer, we use the worst val-
we demonstrate that the topology generator produces novel ues across all process corners when calculating the score for
level shifter topologies, and the circuit optimizer successfully the hard constraint. Soft constraint scores are obtained at the
improves the design. Finally, the resulting level shifter designs TT corner.
are fabricated and compared against prior arts designed by
human experts. All experiments are conducted on a work- B. Topology Generation
station running CentOS 7.4 with two Intel E5-2687W v4
The topology generator runs seven SPICE simulations in
processors, 128-GB DRAM, and an Nvidia GTX Titan X
parallel only at the TT corner for fast topology search. The
GPU. The topology generator only uses the processors whereas
input inverter of level-shifter is implemented using low thresh-
the circuit optimizer uses both the processors and GPU.
old voltage (Vth ) devices, whereas the other transistors are
standard Vth devices. We use a minimum-sized transistor with
A. Level Shifter Design 180-nm channel length and 220-nm channel width as a weak
We choose a level shifter circuit as a test vehicle for our device. Medium and strong devices have 2× and 4× larger
framework since it is an active research area where new circuit channel width, respectively. The initial population size is set
topologies are continuously developed. There are many differ- to 450, and the population evolves for 600 generations, which
ent topologies, and an optimal topology varies with the design takes approximately 5 h 30 min. In addition, we experiment
constraints [24]. Therefore, the effectiveness of our framework with varying the soft constraint weights in the fitness func-
that is capable of finding optimal circuit topologies could be tion to observe how the topology generator performs under
verified more clearly. In addition, level shifter circuits share different design constraints. The generated circuit topologies
common properties both with digital and analog circuits. For are displayed in Fig. 3. Specifically, three cases are tested: 1)
instance, level shifters operate on a rail-to-rail input signal all the constraints have the same weight [C1 in Fig. 3]; 2)
and produce a rail-to-rail output in a higher voltage, similar only the weight of static power is lowered (C2 and C3); and
5176 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 41, NO. 12, DECEMBER 2022
Fig. 3. Level shifter circuit topologies generated by topology generator (a) C1. (b) C2. (c) C3. (d) C4. (e) C5.
TABLE II
R ESULTS OF T OPOLOGY G ENERATION C. Circuit Optimization
In experiments, we use an MLP with three hidden layers
and 200 nodes in each layer as the actor network. The critic
network has the same structure but has two hidden layers.
First, we evaluate each of the proposed RL optimization tech-
niques using WCMLS circuit, which is widely adopted in level
shifter designs [22]–[24]. Experiments are performed using a
total of seven actors, where one of them is used to estimate
the performance of the optimization algorithm in real time
(evaluation actor). Thirty thousand SPICE simulations are run
across all the actors except the evaluation actor, which takes
2 h 20 min. Since the RL algorithm has some degree of ran-
domness, we test each configuration on three independent runs
to observe its reliability. Fig. 5 summarizes the experimen-
tal results. Fig. 5(a) shows that conventional D4PG fails to
converge in two of the three runs. However, when the mul-
tiupdate technique with U = 10 is applied, the algorithm
successfully finds a correct optimization direction and properly
biases transistors in the circuit after about 7000 SPICE sim-
(a) (b) ulations [Fig. 5(c)]. Fig. 5(e) shows the optimization results
when the episode early stopping method is also employed.
Fig. 4. Experimental results of topology generation. (a) Fitness. (b) Number Initially, an episode stops only after four steps, and the episode
of species. length increases by two after every five episodes in each
exploration agent until it reaches the maximum length of
20. This method reduces the number of SPICE simulations
3) the weight of delay is increased while the weights of static required to capture the bias points from 7000 to 5000, sug-
and total powers are decreased (C4 and C5). Table II sum- gesting that this technique accelerates RL training in the early
marizes the performance and fitness of the generated circuits stage. Note that the algorithm shows more fluctuation during
(C1–C5). Simulation results show that the circuits generated optimization when the early stopping method is adopted. We
with lower static power weight (C2 and C3) exhibit higher suspect that the conventional approach is exposed to more
static power than C1. In addition, the circuits optimized for “bad” samples, which are far from the initial nearly opti-
delay (C4 and C5) achieve lower delay than C1 at the expense mized design from the topology generator, in the early stages
of power consumption increases. of training. Those samples exhibit very low rewards as they
We also perform three independent runs of circuit topology do not meet the hard constraints. As a result, the actor is
generation to estimate algorithm stability. Fig. 4 shows the trained to be more conservative, and once the design enters
fitness and the number of species as the evolution proceeds. the near-optimal region where the hard constraints are sat-
The best fitness, which is the fitness of the best circuit in isfied, the algorithm tends to stay near that point only with
the population, rapidly increases in the first 7–9 generations, fine tuning to avoid a large drop in the reward value. On the
and then gradually improves through fine tuning of the circuit contrary, the episode early stopping method allows the design
topology. Note that the value of fitness is not capped at a fixed to enter the near-optimal region quickly, significantly reduc-
value. While the fitness of a circuit can have an arbitrary value, ing the number of bad samples during initial training. When
the generated circuits exhibit fitness values less than 30 in our the design approaches an optimal point during optimization,
experiments. The number of species is nearly constant during the algorithm now searches for better design points more
evolution except in the first few generations, suggesting that aggressively. In other words, the algorithm is less reluctant
stagnant species are replaced with a similar number of new to depart from the local optima, which helps find a global
species. optimum.
HONG et al.: AUTOMATIC CIRCUIT DESIGN FRAMEWORK FOR LEVEL SHIFTER CIRCUITS 5177
(c) (d)
(a) (b)
Fig. 7. Trends of output swing ratio when the circuit is optimized (a) at
TT corner only and (b) at all process corners. Considering process corners
significantly improves reliability.
TABLE III
E XPERIMENTS W ITH D IFFERENT W EIGHTS IN C IRCUIT O PTIMIZATION
(a)
(d) (e)
Fig. 10. Baseline level shifter designs from prior work. (a) B1 [22].
(b) B2 [23]. (c) B3 [34]. (d) B4 [35]. (e) B5 [36].
(b) (c)
Fig. 9. (a) Delay measurement circuit, (b) test chip layout, and (c) test chip (a) (b)
micrography.
Fig. 11. AND gates generated by topology generator.
TABLE V
M EASUREMENT R ESULTS OF G ENERATED C IRCUITS
[31] S. R. Hosseini, M. Saberi, and R. Lotfi, “A high-speed and power- Sunghoon Kim (Graduate Student Member, IEEE)
efficient voltage level shifter for dual-supply applications,” IEEE Trans. received the B.S. degree in electrical engineering
VLSI Syst., vol. 25, no. 3, pp. 1154–1158, Mar. 2017. from Seoul National University, Seoul, South Korea,
[32] R. Lotfi, M. Saberi, S. R. Hosseini, A. R. Ahmadi-Mehr, and in 2018, where he is currently the Ph.D. degree with
R. B. Staszewski, “Energy-efficient wide-range voltage level shifters the Graduate School of Convergence Science and
reaching 4.2 fJ/transition,” IEEE Solid-State Circuits Lett., vol. 1, no. 2, Technology.
pp. 34–37, Feb. 2018. His current research interests include radiation
[33] S. Kabirpour and M. Jalali, “A low-power and high-speed voltage level hardening circuit, mixed-signal machine learning
shifter based on a regulated cross-coupled pull-up network,” IEEE Trans. hardware, and computation in memory.
Circuits Syst. II, Exp. Briefs, vol. 66, no. 6, pp. 909–913, Jun. 2019.
[34] M. Lanuzza, P. Corsonello, and S. Perri, “Fast and wide range voltage
conversion in multisupply voltage designs,” IEEE Trans. Very Large
Scale Integr. (VLSI) Syst., vol. 23, no. 2, pp. 388–391, Feb. 2015.
[35] Y. Osaki, T. Hirose, N. Kuroki, and M. Numa, “A low-power level shifter
with logic error correction for extremely low-voltage digital CMOS
LSIs,” IEEE J. Solid-State Circuits, vol. 47, no. 7, pp. 1776–1783,
Jul. 2012.
[36] S. R. Hosseini, M. Saberi, and R. Lotfi, “A low-power subthreshold to
above-threshold voltage level shifter,” IEEE Trans. Circuits Syst. II, Exp.
Briefs, vol. 61, no. 10, pp. 753–757, Oct. 2014.
[37] B. A. Chappell et al., “Fast CMOS ECL receivers with 100-mV worst-
case sensitivity,” IEEE J. Solid-State Circuits, vol. 23, no. 1, pp. 59–67,
Feb. 1988.
[38] C. J. A. Gomez, H. Klimach, E. Fabris, and O. E. Mattia, “High PSRR
nano-watt MOS-only threshold voltage monitor circuit,” in Proc. IEEE
Symp. Integr. Circuits Syst. Design (SBCCI), 2015, pp. 1–6.
[39] A. Shankar, J. Silva-Martínez, and E. Sánchez-Sinencio, “A low voltage
operational transconductance amplifier using common mode feedforward
for high frequency switched capacitor circuits,” in Proc. IEEE Int. Symp.
Circuits Syst., vol. 1, 2001, pp. 643–646.