Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Power Limitations and Dark Silicon Challenge the Future of Multicore

Published: 01 August 2012 Publication History

Abstract

Since 2004, processor designers have increased core counts to exploit Moore’s Law scaling, rather than focusing on single-core performance. The failure of Dennard scaling, to which the shift to multicore parts is partially a response, may soon limit multicore scaling just as single-core scaling has been curtailed. This paper models multicore scaling limits by combining device scaling, single-core scaling, and multicore scaling to measure the speedup potential for a set of parallel workloads for the next five technology generations. For device scaling, we use both the ITRS projections and a set of more conservative device scaling parameters. To model single-core scaling, we combine measurements from over 150 processors to derive Pareto-optimal frontiers for area/performance and power/performance. Finally, to model multicore scaling, we build a detailed performance model of upper-bound performance and lower-bound core power. The multicore designs we study include single-threaded CPU-like and massively threaded GPU-like multicore chip organizations with symmetric, asymmetric, dynamic, and composed topologies. The study shows that regardless of chip organization and topology, multicore scaling is power limited to a degree not widely appreciated by the computing community. Even at 22 nm (just one year from now), 21% of a fixed-size chip must be powered off, and at 8 nm, this number grows to more than 50%. Through 2024, only 7.9× average speedup is possible across commonly used parallel workloads for the topologies we study, leaving a nearly 24-fold gap from a target of doubled performance per generation.

References

[1]
Amdahl, G. M. 1967. Validity of the single processor approach to achieving large-scale computing capabilities. In Proceedings of the AFIPS Conference.
[2]
Azizi, O., Mahesri, A., Lee, B. C., Patel, S. J., and Horowitz, M. 2010. Energy-performance tradeoffs in processor architecture and circuit design: A marginal cost analysis. In Proceedings of the International Symposium on Computer Architecture (ISCA).
[3]
Bakhoda, A., Yuan, G. L., Fung, W. W. L., Wong, H., and Aamodt, T. M. 2009. Analyzing CUDA workloads using a detailed GPU simulator. In Proceedings of the 2009 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).
[4]
Bhadauria, M., Weaver, V., and McKee, S. 2009. Understanding PARSEC performance on contemporary CMPs. In Proceedings of the IEEE International Symposium on Workload Characterization (IISWC).
[5]
Bienia, C., Kumar, S., Singh, J. P., and Li, K. 2008. The PARSEC benchmark suite: Characterization and architectural implications. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT).
[6]
Borkar, S. 2010. The exascale challenge. In Proceedings of the International Symposium on VLSI Design, Automation and Test (VLSI-DAT).
[7]
Chakraborty, K. 2008. Over-provisioned multicore systems. Ph.D. thesis, University of Wisconsin-Madison.
[8]
Cho, S. and Melhem, R. 2008. Corollaries to Amdahl’s law for energy. Comput. Arch. Lett. 7, 1.
[9]
Chung, E. S., Milder, P. A., Hoe, J. C., and Mai, K. 2010. Single-chip heterogeneous computing: Does the future include custom logic, FPGAs, and GPUs? In Proceedings of MICRO.
[10]
Dennard, R. H., Gaensslen, F. H., Rideout, V. L., Bassous, E., and LeBlanc, A. R. 1974. Design of ion-implanted mosfet’s with very small physical dimensions. IEEE J. Solid-State Circ. 9.
[11]
Dennard, R. H., Cai, J., and Kumar, A. 2007. A perspective on today’s scaling challenges and possible future directions. Solid-State Electron. 5, 4, 518--525.
[12]
Esmaeilzadeh, H., Cao, T., Yang, X., Blackburn, S. M., and McKinley, K. S. 2011. Looking back on the language and hardware revolutions: Measured power, performance, and scaling. In Proceedings of the 16th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’11). ACM.
[13]
Guz, Z., Bolotin, E., Keidar, I., Kolodny, A., Mendelson, A., and Weiser, U. C. 2009. Many-core vs. many-thread machines: Stay away from the valley. IEEE Comput. Arch. Lett. 8.
[14]
Hardavellas, N., Ferdman, M., Falsafi, B., and Ailamaki, A. 2011. Toward dark silicon in servers. IEEE Micro 31, 4.
[15]
Hempstead, M., Wei, G.-Y., and Brooks, D. 2009. Navigo: An early-stage model to study power-contrained architectures and specialization. In Proceedings of MoBS.
[16]
Hill, M. D. and Marty, M. R. 2008. Amdahl’s law in the multicore era. Computer 41, 7.
[17]
Horowitz, M., Alon, E., Patil, D., Naffziger, S., Kumar, R., and Bernstein, K. 2005. Scaling, power, and the future of CMOS. In Proceedings of the 2005 International Electron Devices Meeting (IEDM).
[18]
Ipek, E., Kirman, M., Kirman, N., and Martinez, J. F. 2007. Core fusion: Accommodating software diversity in chip multiprocessors. In Proceedings of the International Symposium on Computer Architecture (ISCA).
[19]
ITRS. 2011. International technology roadmap for semiconductors, 2010 update.
[20]
Kim, C., Sethumadhavan, S., Govindan, M. S., Ranganathan, N., Gulati, D., Burger, D., and Keckler, S. W. 2007. Composable lightweight processors. In Proceedings of MICRO.
[21]
Lee, J.-G., Jung, E., and Shin, W. 2009. An asymptotic performance/energy analysis and optimization of multi-core architectures. In Proceedings of the International Conference on Distributed Computing and Networking (ICDCN).
[22]
Lee, V. W., Kim, E., Chhugani, J., Deisher, M., Kim, D., Nguyen, A. D., Satish, N., Smelyanskiy, M., Chennupaty, S., Hammarlund, P., Singhal, R., and Dubey, P. 2010. Debunking the 100X GPU vs. CPU myth: An evaluation of throughput computing on CPU and GPU. In Proceedings of the 37th Annual International Symposium on Computer Architecture (ISCA’10). ACM, New York, 451--460.
[23]
Loh, G. 2008. The cost of uncore in throughput-oriented many-core processors. In Proceedings of the Workshop on Architectures and Languages for Troughput Applications (ALTA).
[24]
Moore, G. E. 1965. Cramming more components onto integrated circuits. Electronics 38, 8.
[25]
Nose, K. and Sakurai, T. 2000. Optimization of VDD and VTH for low-power and high speed applications. In Proceedings of the Asia and South Pacific Design Automation Conference (ASP-DAC).
[26]
Pollack, F. 1999. New microarchitecture challenges in the coming generations of CMOS process technologies. In Proceedings of MICRO.
[27]
SPEC. 2011. Standard performance evaluation corporation.
[28]
Suleman, A. M., Mutlu, O., Qureshi, M. K., and Patt, Y. N. 2009. Accelerating critical section execution with asymmetric multi-core architectures. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS).
[29]
Venkatesh, G., Sampson, J., Goulding, N., Garcia, S., Bryksin, V., Lugo-Martinez, J., Swanson, S., and Taylor, M. B. 2010. Conservation cores: Reducing the energy of mature computations. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS).
[30]
Woo, D. H. and Lee, H.-H. S. 2008. Extending Amdahl’s law for energy-efficient computing in the many-core era. Computer 41, 12.

Cited By

View all
  • (2024)An innovative manifold microchannel heat sink for thermal management of high-power chips with multiple hotspotsNumerical Heat Transfer, Part A: Applications10.1080/10407782.2024.2377695(1-22)Online publication date: 14-Jul-2024
  • (2024)RETRACTED: An improved hybrid network‐on‐chip with flexible topology and frugal routingThe Journal of Engineering10.1049/tje2.123952024:6Online publication date: 11-Jun-2024
  • (2022)Energy Efficient Computing Systems: Architectures, Abstractions and Modeling to Techniques and StandardsACM Computing Surveys10.1145/351109454:11s(1-37)Online publication date: 9-Sep-2022
  • Show More Cited By

Index Terms

  1. Power Limitations and Dark Silicon Challenge the Future of Multicore

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Computer Systems
      ACM Transactions on Computer Systems  Volume 30, Issue 3
      August 2012
      97 pages
      ISSN:0734-2071
      EISSN:1557-7333
      DOI:10.1145/2324876
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 August 2012
      Accepted: 01 May 2012
      Received: 01 March 2012
      Published in TOCS Volume 30, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Dark silicon
      2. modeling
      3. multicore
      4. power
      5. technology scaling

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)31
      • Downloads (Last 6 weeks)5
      Reflects downloads up to 02 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)An innovative manifold microchannel heat sink for thermal management of high-power chips with multiple hotspotsNumerical Heat Transfer, Part A: Applications10.1080/10407782.2024.2377695(1-22)Online publication date: 14-Jul-2024
      • (2024)RETRACTED: An improved hybrid network‐on‐chip with flexible topology and frugal routingThe Journal of Engineering10.1049/tje2.123952024:6Online publication date: 11-Jun-2024
      • (2022)Energy Efficient Computing Systems: Architectures, Abstractions and Modeling to Techniques and StandardsACM Computing Surveys10.1145/351109454:11s(1-37)Online publication date: 9-Sep-2022
      • (2021)Performance Analysis of Impinging Chip-Attached Micro Pin Fin Direct Liquid Cooling Package for Hotspot Targeted Applications2021 20th IEEE Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (iTherm)10.1109/ITherm51669.2021.9503295(220-228)Online publication date: 1-Jun-2021
      • (2020)Cross-Layer Reliability, Energy Efficiency, and Performance Optimization of Near-Threshold Data PathsJournal of Low Power Electronics and Applications10.3390/jlpea1004004210:4(42)Online publication date: 3-Dec-2020
      • (2020)Offloading strategies for Stencil kernels on the KNC Xeon Phi architectureInternational Journal of High Performance Computing Applications10.1177/109434201773835234:2(199-207)Online publication date: 17-Jun-2020
      • (2020)3D-ReGACM Journal on Emerging Technologies in Computing Systems10.1145/337569916:2(1-24)Online publication date: 30-Jan-2020
      • (2020)Improving Performance and Energy Consumption in Embedded Systems via Binary Acceleration: A SurveyACM Computing Surveys10.1145/336976453:1(1-36)Online publication date: 6-Feb-2020
      • (2020)Temperature-Driven Gate Geometry Effects in Nanoscale Cryogenic MOSFETsIEEE Electron Device Letters10.1109/LED.2020.298428041:5(661-664)Online publication date: May-2020
      • (2020)Compiler-Directed Parallelism Scaling Framework for Performance Constrained Energy OptimizationIEEE Access10.1109/ACCESS.2019.29615688(1733-1754)Online publication date: 2020
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media