Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1450095.1450125acmconferencesArticle/Chapter ViewAbstractPublication PagesesweekConference Proceedingsconference-collections
research-article

Multiple sleep mode leakage control for cache peripheral circuits in embedded processors

Published: 19 October 2008 Publication History

Abstract

This paper proposes a combination of circuit and architectural techniques to maximize leakage power reduction in embedded processor on-chip caches. It targets cache peripheral circuits, which according to recent studies account for a considerable amount of cache leakage. At circuit level, we propose a novel design with multiple sleep modes for cache peripherals. Each mode represents a trade-off between leakage reduction and wakeup delay. Architectural control is proposed to decide "when and how" to use these different low-leakage modes using cache miss information to guide its action. This control is based on simple state machines that do not impact area or power consumption and can thus be used even in the resource constrained processors. Experimental results indicate that proposed techniques can keep the L1 cache peripherals in one of the low-power modes for more than 85% of total execution time, on average. This translates to an average leakage power reduction of 50% for 65nm technology. The DL1 cache energy-delay product is reduced, on average, by 20%.

References

[1]
D. Nicolaescu et al,. Fast Speculative Address Generation and Way Caching for Reducing L1 Data Cache Energy. Proc. IEEE ICCD, 2006.
[2]
H. Homayoun and A. Veidenbaum, ZZ-HVS: Zig-Zag Horizontal and Vertical Sleep Transistor Sharing to Reduce Leakage Power in On-Chip SRAM Peripheral Circuits. Proc. IEEE ICCD, 2008.
[3]
K.-S. Min et al., "Zigzag super cut-off CMOS (ZSCCMOS) block activation with self-adaptive voltage level controller: an alternative to clock-gating scheme in leakage dominant era," ISSCC 2003.
[4]
M. Horiguchi et al., "Switched-source-impedence CMOS circuit for low-standby subthreshold current giga-scale LSI's," Symp. VLSI circuits Dig. Tech. Papers, pp. 47--48, 1993.
[5]
Y. Meng, T. Sherwood, and R. Kastner. On the limits of leakage power reduction in caches. In HPCA-11, 2005.
[6]
W. Zhang and J. S. Hu. Compiler-directed instruction cache leakage optimization. In Proc. In MICRO-35, 2002.
[7]
J. M. Rabaey et al., Digital integrated circuits: a design perspective, Prentice Hall, Second. Edition, 2003.
[8]
Y. Takeyama et al,. A Low Leakage SRAM Macro with Replica Cell Biasing Scheme. IEEE Journal Of Solid- State Circuits, Vol. 41, No. 4, April 2006.
[9]
A Sub-lW to 2W Low-Power IA Processor for Mobile Internet Devices and Ultra-Mobile PCs in 45nm Hi-K Metal Gate CMOS, in ISSCC-2008.
[10]
J. C. Park, V. J. Mooney III. Sleepy stack leakage reduction. IEEE Trans. VLSI Syst. 14(11): 1250--1263 (2006).
[11]
SimpleScalar4 tutorial, http://www.simplescalar.com/tutorial.html.
[12]
K. Nii et al., A 90-nm low-power 32 KByte embedded SRAM with gate leakage suppression circuit for mobile applications, IEEE J. Solid-State Circuits, vol. 39, Apr. 2004.
[13]
M.D. Powell et al,. Gated Vdd: A circuit technique to reduce leakage in deep-submicron cache memories. in Proc. IEEE ISLPED, 2000 .
[14]
A. Agarawal et al., DRG-Cache: A data retention gated-ground cache for low Power, DAC 2002.
[15]
K. Agarwal, H. Deogun, D. Sylvester, K. Nowka. Power gating with multiple sleep modes. In ISQED 2006.
[16]
K. Nii, et al. A low power SRAM using auto-backgate-controlled MT-CMOS. In ISLPED, 1998, pp. 293--298.
[17]
M. Mamidipaka, K. S. Khouri, N. Dutt and M. S. Abadir, Analytical models for leakage power estimation of memory array structures. CODES+ISSS 2004.
[18]
B S. Amrutur et al,. Speed and power scaling of SRAMs, IEEE Journal of Solid State Circuits. Feb 2000, vol. 35.
[19]
S. Kaxiras et al,. Cache decay: exploiting generational behavior to reduce cache leakage power. IEEE-ISCA, 2001.
[20]
B.S. Amrutur, et al., A replica technique for wordline and sense control in low-power SRAM's, IEEE Journal of Solid-State Circuits, vol. 33, No. 8, Aug.2000.
[21]
K. Flautner et al,. Drowsy caches: simple techniques for reducing leakage power. IEEE ISCA, 2002.
[22]
. Thoziyoor, N. Muralimanohar, J. H. Ahn, and N P. Jouppi "CACTI 5.1 Technical Report" HP Laboratories, Palo Alto, April 2, 2008.
[23]
M. Powell et al,. Gated-Vdd: A Circuit Technique to Reduce Leakage in Deep-Submicron Cache Memories. IEEE-ISLPED 2000.
[24]
H. Homayoun and A. Veidenbaum, Reducing Leakage Power in Peripheral Circuit of L2 Caches, In Proc. IEEE Intl. Conference on Computer Design (ICCD 2007), Lake Tahoe, Oct. 2007.
[25]
"ARM11 MPCore Processor Revision: r1p0" Technical Reference Manual, infocenter.arm.com/ help/ topic/ com.arm.doc.ddi0360e/DDI0360E_arm11_mpcore_r1p0_trm.pdf
[26]
MiBench Version 1.0. http://www.eecs.umich.edu/mibench/.
[27]
J. Montanaro, et al., "A 160-MHz, 32-b, 0.5-W CMOS RISC microprocessor", IJSSC, November 1996.
[28]
J. C. Ku, S. Ozdemir, G. Memik and Y. Ismail,"Power Density Minimization for Highly-Associative Caches in Embedded Processors", GLSVLSI 2006.
[29]
S. Rusu et al,. A 65-nm Dual-Core Multithreaded Xeon® Processor With 16-MB L3 Cache, IEEE Journal Of Solid-State Circuits, VOL. 42, 2007.
[30]
S. Chun and K. Skadron, "On-Demand Solution to Minimize I-Cache Leakage Energy with Maintaining Performance", Transaction on Computers, 2008.
[31]
C. H. Kim et al,. A forward body-biased low-leakage SRAM cache: device, circuit and architecture considerations. IEEE Trans. on VLSI Systems, vol. 13, 2005 .
[32]
J. Kao, S. Narendra, and A. Chandrakasan, "MTCMOS hierarchical sizing based on mutual exclusive discharge patterns," DAC, June 1998.

Cited By

View all
  • (2020)Runtime Performance Optimization of 3-D Microprocessors in Dark SiliconIEEE Transactions on Computers10.1109/TC.2020.3015711(1-1)Online publication date: 2020
  • (2018)Enhancing Power, Performance, and Energy Efficiency in Chip Multiprocessors Exploiting Inverse Thermal DependenceIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2017.278080026:4(778-791)Online publication date: Apr-2018
  • (2016)A New Optimal Algorithm for Energy Saving in Embedded System With Multiple Sleep ModesIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2015.241482724:2(706-719)Online publication date: Feb-2016
  • Show More Cited By

Index Terms

  1. Multiple sleep mode leakage control for cache peripheral circuits in embedded processors

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CASES '08: Proceedings of the 2008 international conference on Compilers, architectures and synthesis for embedded systems
      October 2008
      274 pages
      ISBN:9781605584690
      DOI:10.1145/1450095
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 19 October 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. cache
      2. embedded processor
      3. leakage power
      4. multiple sleep mode
      5. peripheral circuits

      Qualifiers

      • Research-article

      Conference

      ESWEEK 08
      ESWEEK 08: Fourth Embedded Systems Week
      October 19 - 24, 2008
      GA, Atlanta, USA

      Acceptance Rates

      Overall Acceptance Rate 52 of 230 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 07 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2020)Runtime Performance Optimization of 3-D Microprocessors in Dark SiliconIEEE Transactions on Computers10.1109/TC.2020.3015711(1-1)Online publication date: 2020
      • (2018)Enhancing Power, Performance, and Energy Efficiency in Chip Multiprocessors Exploiting Inverse Thermal DependenceIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2017.278080026:4(778-791)Online publication date: Apr-2018
      • (2016)A New Optimal Algorithm for Energy Saving in Embedded System With Multiple Sleep ModesIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2015.241482724:2(706-719)Online publication date: Feb-2016
      • (2015)Runtime Thermal Management for 3-D Chip-Multiprocessors With Hybrid SRAM/MRAM L2 CacheIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2014.231179823:3(520-533)Online publication date: Mar-2015
      • (2014)Runtime 3-D stacked cache data management for energy minimization of 3-D chip-multiprocessorsFifteenth International Symposium on Quality Electronic Design10.1109/ISQED.2014.6783325(197-204)Online publication date: Mar-2014
      • (2014)Temperature-aware runtime power management for chip-multiprocessors with 3-D stacked cacheFifteenth International Symposium on Quality Electronic Design10.1109/ISQED.2014.6783320(163-170)Online publication date: Mar-2014
      • (2013)Elastic pagingACM SIGMETRICS Performance Evaluation Review10.1145/2494232.247978141:1(349-350)Online publication date: 17-Jun-2013
      • (2013)Elastic pagingProceedings of the ACM SIGMETRICS/international conference on Measurement and modeling of computer systems10.1145/2465529.2479781(349-350)Online publication date: 17-Jun-2013
      • (2013)Runtime 3-D stacked cache management for chip-multiprocessorsInternational Symposium on Quality Electronic Design (ISQED)10.1109/ISQED.2013.6523592(68-72)Online publication date: Mar-2013
      • (2012)Variation trained drowsy cache (VTD-Cache)IEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2011.210652320:4(630-642)Online publication date: 1-Apr-2012
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media