Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1391469.1391693acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

Temperature management in multiprocessor SoCs using online learning

Published: 08 June 2008 Publication History

Abstract

In deep submicron circuits, thermal hot spots and high temperature gradients increase the cooling costs, and degrade reliability and performance. In this paper, we propose a low-cost temperature management strategy for multicore systems to reduce the adverse effects of hot spots and temperature variations. Our technique utilizes online learning to select the best policy for the current workload characteristics among a given set of expert policies. We achieve 20% and 60% average decrease in the frequency of hot spots and thermal cycles respectively in comparison to the best performing expert, and reduce the spatial gradients to below 5%.

References

[1]
A. Ajami, K. Banerjee, and M. Pedram. Modeling and analysis of nonuniform substrate temperature effects on global ULSI interconnects. IEEE Transactions on CAD, 24(6):849--861, June 2005.
[2]
A. K. Coskun, T. Rosing, and K. Whisnant. Temperature aware task scheduling in MPSoCs. In DATE, 2007.
[3]
G. Dhiman and T. Rosing. Dynamic voltage frequency scaling for multitasking systems using online learning. In ICCAD, 2007.
[4]
J. Donald and M. Martonosi. Techniques for multicore thermal management: Classification and new exploration. In ISCA, 2006.
[5]
Y. Freund, R. E. Schapire, Y. Singer, and M. K. Warmuth. Using and combining predictors that specialize. In STOC, 1997.
[6]
M. Gomaa, M. D. Powell, and T. N. Vijaykumar. Heat-and-Run: leveraging SMT and CMP to manage power density through the operating system. In ASPLOS, 2004.
[7]
K. Gross, K. Whisnant, and A. Urmanov. Electronic prognostics through continuous system telemetry. In MFPT, pages 53--62, April 2006.
[8]
M. Herbster and M. K. Warmuth. Tracking the best expert. In International Conference on Machine Learning, pages 286--294, 1995.
[9]
Failure mechanisms and models for semiconductor devices, JEDEC publication JEP122C. http://www.jedec.org.
[10]
H. Kufluoglu and M. A. Alam. A computational model of NBTI and hot carrier injection time-exponents for MOSFET reliability. Journal of Computational Electronics, 3 (3):165--169, Oct. 2004.
[11]
A. Kumar, L. Shang, L.-S. Peh, and N. K. Jha. HybDTM: a coordinated hardware-software approach for dynamic thermal management. In DAC, pages 548--553, 2006.
[12]
A. Leon, L. Jinuk, K. Tam, W. Bryg, F. Schumacher, P. Kongetira, D. Weisner, and A. Strong. A power-efficient high-throughput 32-thread SPARC processor. ISSCC, 2006.
[13]
T. S. Rosing, K. Mihic, and G. D. Micheli. Power and reliability management of SoCs. IEEE Transactions on VLSI, 15(4), April 2007.
[14]
M. Ruggiero, A. Guerri, D. Bertozzi, F. Poletti, and M. Milano. Communication-aware allocation and scheduling framework for streamoriented multi-processor system-on-chip. In DATE, 2006.
[15]
K. Skadron, M. Stan, W. Huang, S. Velusamy, K. Sankaranarayanan, and D. Tarjan. Temperature-aware microarchitecture. In ISCA, 2003.
[16]
SLAMD Distributed Load Engine. www.slamd.com.
[17]
J. Srinivasan, S. V. Adve, P. Bose, and J. A. Rivers. The case for lifetime reliability-aware microprocessors. In ISCA, 2004.
[18]
H. Su, F. Liu, A. Devgan, E. Acar, and S. Nassif. Full-chip leakage estimation considering power supply and temperature variations. In ISLPED, 2003.
[19]
Y. Zhang, X. S. Hu, and D. Z. Chen. Task scheduling and voltage selection for energy minimization. In DAC, 2002.

Cited By

View all
  • (2023)Dependable DNN Accelerator for Safety-Critical Systems: A Review on the Aging PerspectiveIEEE Access10.1109/ACCESS.2023.330037611(89803-89834)Online publication date: 2023
  • (2023)A Survey of Machine Learning for Network-on-ChipsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.104778(104778)Online publication date: Nov-2023
  • (2021)A Survey of Thermal Management in Cloud Data Centre: Techniques and Open IssuesWireless Personal Communications10.1007/s11277-020-08039-xOnline publication date: 15-Jan-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DAC '08: Proceedings of the 45th annual Design Automation Conference
June 2008
993 pages
ISBN:9781605581156
DOI:10.1145/1391469
  • General Chair:
  • Limor Fix
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 June 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. multiprocessor
  2. online learning
  3. thermal management

Qualifiers

  • Research-article

Funding Sources

  • University of California MICRO

Conference

DAC '08
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25
62nd ACM/IEEE Design Automation Conference
June 22 - 26, 2025
San Francisco , CA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)1
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Dependable DNN Accelerator for Safety-Critical Systems: A Review on the Aging PerspectiveIEEE Access10.1109/ACCESS.2023.330037611(89803-89834)Online publication date: 2023
  • (2023)A Survey of Machine Learning for Network-on-ChipsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.104778(104778)Online publication date: Nov-2023
  • (2021)A Survey of Thermal Management in Cloud Data Centre: Techniques and Open IssuesWireless Personal Communications10.1007/s11277-020-08039-xOnline publication date: 15-Jan-2021
  • (2020)DS3: A System-Level Domain-Specific System-on-Chip Simulation FrameworkIEEE Transactions on Computers10.1109/TC.2020.2986963(1-1)Online publication date: 2020
  • (2020)Thermo-Mechanical Coupling Induced Performance Degradation in Storage Systems2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID)10.1109/CCGrid49817.2020.00-78(152-161)Online publication date: May-2020
  • (2019)A Survey of Prediction and Classification Techniques in Multicore Processor SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2018.287869930:5(1184-1200)Online publication date: 1-May-2019
  • (2019)Temperature-aware core management in MPSoCs: modelling and evaluation using MRMsIET Computers & Digital Techniques10.1049/iet-cdt.2018.5131Online publication date: 20-Aug-2019
  • (2018)Reliable power and time-constraints-aware predictive management of heterogeneous exascale systemsProceedings of the 18th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation10.1145/3229631.3239368(187-194)Online publication date: 15-Jul-2018
  • (2018)Sustainability-Oriented Evaluation and Optimization for MPSoC Task Allocation and Scheduling under Thermal and Energy VariationsIEEE Transactions on Sustainable Computing10.1109/TSUSC.2017.27235003:2(84-97)Online publication date: 1-Apr-2018
  • (2018)TheSPoT: Thermal Stress-Aware Power and Temperature Management for Multiprocessor Systems-on-ChipIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2017.276841737:8(1532-1545)Online publication date: Aug-2018
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media