Improve Clock Tree Efficiency For Low Power Clock Tree Design
Improve Clock Tree Efficiency For Low Power Clock Tree Design
Improve Clock Tree Efficiency For Low Power Clock Tree Design
Abstract—Low power design is critical in today’s chip carefully with limited clock buffers, while clock skew
design. Clock tree takes much of chip power. “Clock may be not so critical sometimes. (* all “buffers” here
tree cost” is introduced to help design low power are similar for inverters)
clock tree. Five methods are proposed to reduce Based on this point, a new clock metric – “clock
“clock tree cost” and improve clock tree efficiency. tree cost (CTC)” is proposed to measure clock tree
They include clock sink depth check, redundant scan efficiency. CTC is defined as average clock buffer
mux check, redundant clock gating cell check, number per 100 leaf pins for one clock. The lower for
CCOPT (Clock Concurrent Optimization) and simple CTC, the higher for clock tree efficiency.
clock tree, and low threshold voltage tree. By these
ways, clock tree efficiency is improved and clock tree CTC = (buffer_numbers ÷ leaf_pin_numbers)× 100
power is reduced.
For a good and heathy clock tree, CTC is low,
Keywords—Low power, clock tree, efficiency which means clock tree can be built with a small number
of clock buffers. Then clock power is low due to less
switching power on clock buffers. On the other hand, if
1. Introduction CTC is high, reasons should be found if there is clock
Low power design is more and more important in tree specification issue or clock structure shortcoming.
today’s SoC design. Low power means longer battery
life, more reliability, less package cost, and there’s huge 3. Improve Clock Tree Efficiency For Low Power
demand for portable devices and IoT applications. Five ways are proposed to reduce “clock tree cost”
Much of chip power is occupied by clock tree, and improve clock tree efficiency for low power. Test
which always has high toggle rate. To reduce clock tree chips are based on CMOS 90nm ARM core MCUs.
power, one common method is clock gating. Clock is
gated when clock is not used or all clock leaf cells are in 3.1 Sink depth check
hold state. But even then, clock tree power is still high. In chip design, especially for MCU design, kinds of
Low power clock design is studied widely for years. different peripherals are connected to bus. This makes
Some are for new clock gating methods, such as low efficiency for clock tree synthesis. In ideal clock
data-driven gating cell to improve clock gating structure, clock sink pin depth is same (Fig 1). This
efficiency [1]. Some are for new clock gating cell design, makes a very high clock tree efficiency. But in real chip
which reduces power of gating cell itself [2]. Some are (Fig 1), different sink pin depth reduces clock tree
for clock gating cell split, merge and distribution efficiency.
algorithm [3] [4]. And some are for register placement
and merge methodology [5].
In this paper, clock tree efficiency is proposed
when doing low power clock tree design. The rest of
paper is as follows. Section 2 introduces “clock tree
cost” to measure clock tree efficiency. Section 3 is main
part, which presents five ways to improve clock tree
efficiency and power. Section 4 gives a summary.