Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/581630.581632acmconferencesArticle/Chapter ViewAbstractPublication PagesesweekConference Proceedingsconference-collections
Article

A case for dynamic pipeline scaling

Published: 08 October 2002 Publication History

Abstract

Energy consumption can be reduced by scaling down frequency when peak performance is not needed. A lower frequency permits slower circuits, and hence a lower supply voltage. Energy reduction comes from voltage reduction, a technique called Dynamic Voltage Scaling (DVS).This paper makes the case that the useful frequency range of DVS is limited because there is a lower bound on voltage. Lowering frequency permits voltage reduction until the lowest voltage is reached. Beyond that point, lowering frequency further does not save energy because voltage is constant.However, there is still opportunity for energy reduction outside the influence of DVS. If frequency is lowered enough, pairs of pipeline stages can be merged to form a shallower pipeline. The shallow pipeline has better instructions-per-cycle (IPC) than the deep pipeline. Since energy also depends on IPC, energy is reduced for a given frequency. Accordingly, we propose Dynamic Pipeline Scaling (DPS). A DPS-enabled deep pipeline can merge adjacent pairs of stages by making the intermediate latches transparent and disabling corresponding feedback paths. Thus, a DPS-enabled pipeline has a deep mode for higher frequencies within the influence of DVS, and a shallow mode for lower frequencies. Shallow mode extends the frequency range for which energy reduction is possible. For frequencies outside the influence of DVS, a DPS-enabled deep pipeline consumes from 23% to 40% less energy than a rigid deep pipeline.

References

[1]
D. H. Albonesi. Dynamic IPC/Clock Rate Optimization. 25th Intel Symp. on Computer Architecture, June 1998.]]
[2]
D. H. Albonesi. Selective Cache Ways: On-Demand Cache Resource Allocation. 32nd Intel Symp. on Microarchitecture, Nov. 1999.]]
[3]
Alpha 21264 Microprocessor Hardware Reference Manual http://ftp.digital.com/pub/Digital/info/semiconductor/literature/21264hrm.pdf]]
[4]
R. Balasubramonian, D. H. Albonesi, A. Buyuktosunoglu, and S. Dwarkadas. Memory Hierarchy Reconfiguration for Energy and Performance in General-Purpose Processor Architectures. 33rd Intel Symp. on Microarchitecture, Dec. 2000.]]
[5]
D. Brooks, V. Tiwari, and M. Martonosi. Wattch: A Framework for Architectural-Level Power Analysis and Optimizations. 27th Intel Symp. on Computer Architecture, June 2000.]]
[6]
M. D. Brown, J. Stark, and Y. N. Patt. Select-Free Instruction Scheduling Logic. 34th Intel Symp. on Microarchitecture, Dec. 2001.]]
[7]
K.Bult. Analog Design in Deep Sub-Micron CMOS. 26th European Solid State Circuits Conference, Sep. 2000.]]
[8]
T.D. Burd and R.W. Brodersen. Energy Efficient CMOS Microprocessor Design. Proceedings of the Annual HICSS Conference, Jan. 1995.]]
[9]
D. Burger, T. M. Austin, and S. Bennet. Evaluating Future Microprocessors: The Simplescalar Tool Set. Tech. Rep. CS-TR-96-1308, Univ. Of Wisc.-Madison, July 1996.]]
[10]
A. Buyuktosunoglu, S. Schuster, D. Brooks, P. Bose, P. Cook, and D. H. Albonesi. An Adaptive Issue Queue for Reduced Power at High Performance. Workshop on Power-Aware Computer Systems, Nov. 2000.]]
[11]
A. Buyuktosunoglu, S. Schuster, D. Brooks, P. Bose, P. Cook, and D.H. Albonesi. A Circuit Level Implementation of an Adaptive Issue Queue for Power-Aware Microprocessors. 11th Great Lakes Symposium on VLSI, Mar. 2001.]]
[12]
Crusoe™ Processor Model TM5800 Product Brief http://www.transmeta.com/pdf/specifications/ productbrief_tm5800_05jul01.pdf]]
[13]
Crusoe™ Processor Model TM5400 Product Brief http://www.transmeta.com/technology/specifications/tm5400.html]]
[14]
J. L. Gonzalez, X. Aragones, F. Moll, and A. Rubio. Scaling Trends for Delta Noise in Sub-Micron CMOS Technologies. Design of Integrated Circuits and Systems, Sevilla, Spain, Nov.1996.]]
[15]
J. L. Gonzalez and A. Rubio. Delta-I Noise Scaling in Sub-Micron CMOS Technologies. Workshop on Signal Propagation on Interconnects, Travemunde, Germany, May 1997.]]
[16]
K. Govil, E Chan, and H. Wasserman. Comparing Algorithms for Dynamic Speed-Setting of a Low-Power CPU. 1st Intel Conf. on Mobile Computing and Networking, Nov. 1995.]]
[17]
D. Grunwald, P. Levis, C Morrey III, M. Neufeld, and K. Farkas. Policies for Dynamic Clock Scheduling. Symp. on Operating Systems Design and Implementation, Oct. 2000.]]
[18]
R. Ho, K. W. Mai, and M. A. Horowitz. The Future of Wires. Proceedings of the IEEE, 89(4):490--504, April 2001.]]
[19]
I. Hong, M. Potkonjak, and M. Srivastava. On-line Scheduling of Hard Real-Time Tasks on Variable Voltage Processors. Intel Conf. on Computer-Aided Design, Nov. 1998.]]
[20]
M. S. Hrishikesh, N. Jouppi, K. I. Farkas, D. Burger, S. W. Keckler, and P. Shvakumar. The Optimum Logic Depth Per Pipeline Stage. 29th Intel Symp. on Computer Architecture, May 2002.]]
[21]
D. Kaeli and P. Emma. Branch History Table Prediction of Moving Target Branches due to Subroutine Returns. 18th Intel Symp. on Computer Architecture, May 1991.]]
[22]
Y. Lee and C. Krishna. Voltage Clock Scaling for Low Energy Consumption in Real-Time Embedded Systems. 6th Intel Conf. on Real-Time Computing Systems and Applications, Dec. 1999.]]
[23]
J. Lorch and A. J. Smith. Improving Dynamic Voltage Scaling Algorithms with PACE. Proceedings of the ACM SIGMETRICS 2001 Conf., June 2001.]]
[24]
S. Manne, A. Klauser, and D. Grunwald. Pipeline Gating: Speculation Control for Energy Reduction. 25th Intel Symp. on Computer Architecture, June 1998.]]
[25]
S. McFarling. Combining Branch Predictors. Technical Report WRL-TN-36, Compaq-WRL, June 1993.]]
[26]
T. Pering, T. Burd, and R. Brodersen. The Simulation of Dynamic Voltage Scaling Algorithms. Symposium on Low Power Electronics, Oct. 2000.]]
[27]
M. Weiser, B. Welch, A. Demers, and S. Shenker. Scheduling for Reduced CPU Energy. 1st Symp. on Operating Systems Design and Implementation, Nov. 1994.]]

Cited By

View all
  • (2021)Extending Performance-Energy Trade-offs Via Dynamic Core ScalingIEEE Transactions on Computers10.1109/TC.2020.302930670:11(1875-1886)Online publication date: 1-Nov-2021
  • (2016)Maximizing Energy Efficiency in NTC by Variation-Aware Microprocessor Pipeline OptimizationProceedings of the 2016 International Symposium on Low Power Electronics and Design10.1145/2934583.2934635(272-277)Online publication date: 8-Aug-2016
  • (2016)AnyCore: A synthesizable RTL model for exploring and fabricating adaptive superscalar cores2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)10.1109/ISPASS.2016.7482096(214-224)Online publication date: Apr-2016
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CASES '02: Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems
October 2002
324 pages
ISBN:1581135750
DOI:10.1145/581630
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 October 2002

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. clock gating
  2. configurable pipeline
  3. dynamic voltage scaling
  4. fetch gating
  5. power and energy management
  6. shallow and deep pipelines
  7. variable-depth pipeline

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 52 of 230 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Extending Performance-Energy Trade-offs Via Dynamic Core ScalingIEEE Transactions on Computers10.1109/TC.2020.302930670:11(1875-1886)Online publication date: 1-Nov-2021
  • (2016)Maximizing Energy Efficiency in NTC by Variation-Aware Microprocessor Pipeline OptimizationProceedings of the 2016 International Symposium on Low Power Electronics and Design10.1145/2934583.2934635(272-277)Online publication date: 8-Aug-2016
  • (2016)AnyCore: A synthesizable RTL model for exploring and fabricating adaptive superscalar cores2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)10.1109/ISPASS.2016.7482096(214-224)Online publication date: Apr-2016
  • (2015)An energy-delay product study on chip multi-processors for variable stage pipeliningHuman-centric Computing and Information Sciences10.1186/s13673-015-0046-x5:1Online publication date: 21-Sep-2015
  • (2014)A comparative simulation study on the power---performance of multi-core architectureThe Journal of Supercomputing10.1007/s11227-014-1263-170:1(465-487)Online publication date: 1-Oct-2014
  • (2013)Energy Optimization using Fine-Grain Variable Stages Pipeline Processor ChipInternational Journal of Networking and Computing10.15803/ijnc.3.2_1923:2(192-204)Online publication date: 2013
  • (2013)Design and evaluation of fine‐grain‐mode transition method based on dynamic memory access analysing for variable stages pipeline processorIET Computers & Digital Techniques10.1049/iet-cdt.2012.00677:1(41-47)Online publication date: Jan-2013
  • (2013)Low Power Compiler Optimization for Pipelining ScalingAdvances in Intelligent Systems and Applications - Volume 210.1007/978-3-642-35473-1_63(637-646)Online publication date: 2013
  • (2012)Design and evaluation of variable stages pipeline processor with low-energy techniquesIET Computers & Digital Techniques10.1049/iet-cdt.2011.00276:1(43)Online publication date: 2012
  • (2011)A fine-grained runtime power/performance optimization method for processors with adaptive pipeline depthJournal of Computer Science and Technology10.5555/1991856.199186626:2(292-301)Online publication date: 1-Mar-2011
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media