Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3409390.3409399acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

Exploiting Dynamism in HPC Applications to Optimize Energy-Efficiency

Published: 17 August 2020 Publication History

Abstract

The growing need for computational performance is resulting in an increase in the energy consumption of HPC systems, which is a major challenge to reach Exascale computing. To overcome this challenge, we developed a tuning plugin that targets applications that exhibit dynamically changing characteristics between iterations of the time loop as well as change in the control flow within the time loop itself. To analyze the inter-loop dynamism, we propose features to characterize the behaviour of loops for clustering via DBSCAN and spectral clustering. To save tuning time and costs, we implemented a random search strategy with a Gaussian probability distribution model to test a large number of system configurations in a single application run. The goal is to select the best configurations of the CPU and uncore frequencies for groups of similarly behaving loops, as well as individual instances of regions called within these loops based on their unique computational characteristics. During production runs, the configurations are dynamically switched for different code regions. The results of our experiments for two highly dynamic real-world applications highlight the effectiveness of our methodology in optimizing energy-efficiency.

References

[1]
[n.d.]. Highly Accurate Finite Element Simulation for Sheet Metal Forming. http://gns-mbh.com/en/produkte/indeed/.
[2]
[n.d.]. Performance Application Programming Interface. http://icl.cs.utk.edu/papi/.
[3]
Bilge Acun, Kavitha Chandrasekar, and Laxmikant Kalé. 2019. Fine-Grained Energy Efficiency Using Per-Core DVFS with an Adaptive Runtime System. 1–8.
[4]
Axel Auweter, Arndt Bode, Matthias Brehm, Luigi Brochard, Nicolay Hammer, Herbert Huber, Raj Panda, Francois Thomas, and Torsten Wilde. 2014. A Case Study of Energy Aware Scheduling on SuperMUC. In Supercomputing, Julian Martin Kunkel, Thomas Ludwig, and Hans Werner Meuer(Eds.). Springer International Publishing, 394–409.
[5]
Francis R. Bach and Michael I. Jordan. 2006. Learning Spectral Clustering, With Application To Speech Separation. J. Mach. Learn. Res. 7 (Dec. 2006), 1963–2001.
[6]
Prasanna Balaprakash, Jack Dongarra, Todd Gamblin, Mary Hall, Jeffrey K. Hollingsworth, Boyana Norris, and Richard W. Vuduc. 2018. Autotuning in High-Performance Computing Applications. Proc. IEEE 106, 11 (Nov 2018), 2068–2083.
[7]
Alexander Breuer and Michael Bader. 2012. Teaching Parallel Programming Models on a Shallow-Water Code. 2012 11th International Symposium on Parallel and Distributed Computing (2012), 301–308.
[8]
J. Chang, K.B. Nakshatrala, M.G. Knepley, and L. Johnsson. [n.d.]. A performance spectrum for parallel computational frameworks that solve PDEs. Concurrency and Computation: Practice and Experience 30, 11([n. d.]), e4401.
[9]
Ryan Cochran, Can Hankendi, Ayse Coskun, and Sherief Reda. 2011. Identifying the Optimal Energy-Efficient Operating Points of Parallel Workloads. In Proceedings of the International Conference on Computer-Aided Design (San Jose, California) (ICCAD ’11). IEEE Press, 608–615.
[10]
Jonathan Eastep, Steve Sylvester, Christopher Cantalupo, Brad Geltz, Federico Ardanaz, Asma Al-Rawi, Kelly Livingston, Fuat Keceli, Matthias Maiterth, and Siddhartha Jana. 2017. Global Extensible Open Power Manager: A Vehicle for HPC Community Collaboration on Co-Designed Energy Management Solutions. 394–412.
[11]
Ester, Martin and Kriegel, Hans-Peter and Sander, Jörg and Xu, Xiaowei. 1996. A Density-based Algorithm for Discovering Clusters a Density-based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (Portland, Oregon) (KDD’96). AAAI Press, 226–231. http://dl.acm.org/citation.cfm?id=3001460.3001507
[12]
Vincent W. Freeh and David K. Lowenthal. 2005. Using Multiple Energy Gears in MPI Programs on a Power-Scalable Cluster. In Proceedings of the Tenth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Chicago, IL, USA) (PPoPP ’05). Association for Computing Machinery, 164–173.
[13]
Michael Gerndt, Eduardo César, and Siegfried Benkner (Eds.). 2015. Automatic Tuning of HPC Applications - The Periscope Tuning Framework. Shaker Verlag, Aachen. ISBN: 978-3-8440-3517-9.
[14]
D. Hackenberg, R. Schöne, T. Ilsche, D. Molka, J. Schuchart, and R. Geyer. 2015. An Energy Efficiency Feature Survey of the Intel Haswell Processor. In Parallel and Distributed Processing Symposium Workshop.
[15]
Parham Haririan. 2020. DVFS and Its Architectural Simulation Models for Improving Energy Efficiency of Complex Embedded Systems in Early Design Phase. Computers 9(2020).
[16]
Yoshihiko Hotta, Mitsuhisa Sato, Hideaki Kimura, Satoshi Matsuoka, Taisuke Boku, and Daisuke Takahashi. 2006. Profile-Based Optimization of Power Performance by Using Dynamic Voltage Scaling on a PC Cluster. In Proceedings of the 20th International Conference on Parallel and Distributed Processing (Rhodes Island, Greece) (IPDPS’06). IEEE Computer Society, 298.
[17]
A. Knüpfer, C. Rössel, D. an Mey, S. Biersdorff, K. Diethelm, D. Eschweiler, M. Geimer, M. Gerndt, D. Lorenz, A. D. Malony, W. E. Nagel, Y. Oleynik, P. Philippen, P. Saviankou, D. Schmidl, S. S. Shende, R. Tschüter, M. Wagner, B. Wesarg, and F. Wolf. 2012. Score-P: A Joint Performance Measurement Run-time Infrastructure for Periscope, Scalasca, TAU, and Vampir. In Tools for High Performance Computing 2011, H. Brunst, M. Müller, W. E. Nagel, and M. M. Resch (Eds.). Springer, 79–91.
[18]
Madhura Kumaraswamy, Anamika Chowdhury, and Michael Gerndt. 2017. Design-Time Analysis for the READEX Tool Suite. In Parallel Computing is Everywhere, Proceedings of the International Conference on Parallel Computing, ParCo 2017, 12-15 September 2017, Bologna, Italy. 307–316.
[19]
M. Kumaraswamy and M. Gerndt. 2018. Leveraging Inter-Phase Application Dynamism for Energy-Efficiency Auto-tuning. In PDPTA’18: The 24th International Conference on Parallel and Distributed Processing Techniques and Applications. 132–138. https://csce.ucmss.com/cr/books/2018/ConferenceReport?ConferenceKey=PDP.
[20]
Ulrike Luxburg. 2007. A Tutorial on Spectral Clustering. Statistics and Computing 17, 4 (Dec. 2007), 395–416.
[21]
Aniruddha Marathe, Peter E. Bailey, David K. Lowenthal, Barry Rountree, Martin Schulz, and Bronis R. de Supinski. 2015. A Run-Time System for Power-Constrained HPC Applications. In High Performance Computing, Julian M. Kunkel and Thomas Ludwig (Eds.). 394–408.
[22]
Oliver Meister. 2016. Sierpinski Curves for Parallel Adaptive Mesh Refinement in Finite Element and Finite Volume Methods. Dissertation. Technische Universität München, München.
[23]
Ao Mo-Hellenbrand. 2019. Resource-Aware and Elastic Parallel Software Development for Distributed-Memory HPC Systems. Dissertation. Technische Universität München, München.
[24]
Andrew Y. Ng, Michael I. Jordan, and Yair Weiss. 2001. On Spectral Clustering: Analysis and an Algorithm. In Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic (Vancouver, British Columbia, Canada) (NIPS’01). MIT Press, 849–856.
[25]
Y. Oleynik, M. Gerndt, J. Schuchart, P. G. Kjeldsberg, and W. E. Nagel. 2015. Run-Time Exploitation of Application Dynamism for Energy-Efficient Exascale Computing (READEX). In Computational Science and Engineering (CSE), 2015 IEEE 18th International Conference on, C. Plessl, D. El Baz, G. Cong, J. M. P. Cardoso, L. Veiga, and T. Rauber (Eds.). IEEE, 347–350.
[26]
Barry Rountree, David K. Lowenthal, Bronis R. de Supinski, Martin Schulz, Vincent W. Freeh, and Tyler Bletsch. 2009. Adagio: Making DVS Practical for Complex HPC Applications. In Proceedings of the 23rd International Conference on Supercomputing(ICS ’09). Association for Computing Machinery, 460–469.
[27]
Jörg Sander, Martin Ester, Hans-Peter Kriegel, and Xiaowei Xu. 1998. Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications. Data Min. Knowl. Discov. 2, 2 (June 1998), 169–194.
[28]
Robert Schöne and Daniel Molka. 2014. Integrating Performance Analysis and Energy Efficiency Optimizations in a Unified Environment. Comput. Sci. 29, 3–4 (Aug. 2014), 231–239.
[29]
Mohammed Sourouri, Espen Birger Raknes, Nico Reissmann, Johannes Langguth, Daniel Hackenberg, Robert Schöne, and Per Gunnar Kjeldsberg. 2017. Towards Fine-Grained Dynamic Tuning of HPC Applications on Modern Multi-Core Architectures. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (Denver, Colorado) (SC ’17). Association for Computing Machinery.

Cited By

View all
  • (2023)Imitation Learning-Based Performance-Power Trade-Off Uncore Frequency Scaling Policy for Multicore SystemSensors10.3390/s2303144923:3(1449)Online publication date: 28-Jan-2023
  • (2022)PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS53621.2022.00126(1272-1282)Online publication date: May-2022
  • (2022)Optimizing Energy Efficiency of Node.js Applications with CPU DVFS Awareness2022 IEEE 13th International Green and Sustainable Computing Conference (IGSC)10.1109/IGSC55832.2022.9969367(1-8)Online publication date: 24-Oct-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP Workshops '20: Workshop Proceedings of the 49th International Conference on Parallel Processing
August 2020
186 pages
ISBN:9781450388689
DOI:10.1145/3409390
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 August 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Dynamic Voltage and Frequency Scaling
  2. Uncore Frequency Scaling
  3. autotuning
  4. energy-efficiency
  5. search space optimization

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

ICPP Workshops '20
ICPP Workshops '20: Workshops
August 17 - 20, 2020
AB, Edmonton, Canada

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)6
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Imitation Learning-Based Performance-Power Trade-Off Uncore Frequency Scaling Policy for Multicore SystemSensors10.3390/s2303144923:3(1449)Online publication date: 28-Jan-2023
  • (2022)PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS53621.2022.00126(1272-1282)Online publication date: May-2022
  • (2022)Optimizing Energy Efficiency of Node.js Applications with CPU DVFS Awareness2022 IEEE 13th International Green and Sustainable Computing Conference (IGSC)10.1109/IGSC55832.2022.9969367(1-8)Online publication date: 24-Oct-2022

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media