Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3472456.3472516acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application

Published: 05 October 2021 Publication History

Abstract

Heterogeneity is part of HPC infrastructures, not only at the intra-node but at the system level. Applications with multiple phases with distinct resource necessities can take advantage of this inter-node heterogeneity to improve performance and reduce resource idleness. Such an application is ExaGeoStat, a task-based machine learning framework specifically designed for geostatistics data. This work presents strategies to efficiently distribute multi-phase applications in system-level heterogeneous resources. We both (1) improve application phase overlap by optimizing runtime and scheduling decisions and (2) compute the optimal distribution for all the phases using a linear program leveraging node heterogeneity while limiting communication overhead. The performance gains of our phase overlap improvements are between 36% and 50% compared to the original base synchronous and homogeneous execution. We show that by adding some slow nodes to a homogeneous set of fast nodes, we can improve the performance by another 25% compared to a standard block-cyclic distribution, thereby harnessing any machine.

References

[1]
Sameh Abdulah, Hatem Ltaief, Ying Sun, Marc G. Genton, and David E. Keyes. 2018. ExaGeoStat: A High Performance Unified Software for Geostatistics on Manycore Systems. IEEE Transactions on Parallel and Distributed Systems 29, 12 (2018), 2771–2784.
[2]
Emmanuel Agullo 2010. Faster, Cheaper, Better – a Hybridization Methodology to Develop Linear Algebra Software for GPUs. In GPU Computing Gems, Wen mei W. Hwu (Ed.). Vol. 2. Morgan Kaufmann.
[3]
Cédric Augonnet 2011. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. Conc. Comp.: Pract. Exp., SI: EuroPar’09 23 (2011), 187–198.
[4]
Olivier Beaumont, Vincent Boudet, Fabrice Rastello, and Yves Robert. 2001. Matrix multiplication on heterogeneous platforms. IEEE Trans. Parallel Distributed Systems 12, 10 (2001), 1033–1051.
[5]
Olivier Beaumont, Arnaud Legrand, Fabrice Rastello, and Yves Robert. 2001. Static LU Decomposition on Heterogeneous Platforms. Int. Journal of High Performance Computing Applications 15 (2001), 310–323.
[6]
Laura S. Blackford 1997. ScaLAPACK User’s Guide. Society for Industrial and Applied Mathematics, USA.
[7]
George Bosilca 2013. PaRSEC: Exploiting Heterogeneity to Enhance Scalability. Computing in Science Engineering 15, 6 (2013), 36–45.
[8]
Alexandre Denis. 2019. Scalability of the NewMadeleine Communication Library for Large Numbers of MPI Point-to-Point Requests. In 19th IEEE/ACM Int. Symposium in Cluster, Cloud, and Grid Computing. IEEE, Cyprus, 371–380.
[9]
Jack J Dongarra 2017. With Extreme Computing, the Rules Have Changed. Computing in Science & Engineering 19, 3 (2017), 52.
[10]
Jack J Dongarra, Hans W Meuer, Erich Strohmaier, 1997. TOP500 supercomputer sites. Supercomputer 13(1997), 89–111.
[11]
Alejandro Duran, Eduard Ayguadé, Rosa M Badia, Jesús Labarta, Luis Martinell, Xavier Martorell, and Judit Planas. 2011. OmpSs: a proposal for programming heterogeneous multi-core architectures. Paral. Proces. Letters 21 (2011), 173–193.
[12]
Vinícius Garcia Pinto 2018. A visual performance analysis framework for task-based parallel applications running on hybrid clusters. Concurrency and Computation: Practice and Experience 30, 18(2018), e4472.
[13]
Mark Gates 2019. SLATE: Design of a Modern Distributed and Accelerated Linear Algebra Library. In Proceedings of the Int. Conference for High Performance Computing, Networking, Storage and Analysis. ACM, United States, 18 pages.
[14]
Robert B. Gramacy. 2020. Surrogates: Gaussian Process Modeling, Design, and Optimization for the Applied Sciences. CRC Press, United States.
[15]
Julien Herrmann 2016. Assessing the cost of redistribution followed by a computational kernel: Complexity and performance results. Par. Comp. 52(2016).
[16]
Alexey Kalinov and Alexey Lastovetsky. 2001. Heterogeneous Distribution of Computations Solving Linear Algebra Problems on Networks of Heterogeneous Computers. J. of Par. and Distr. Comp. 61, 4 (2001), 520.
[17]
Lucas Leandro Nesi, Lucas Mello Schnorr, and Arnaud Legrand. 2020. Communication-Aware Load Balancing of the LU Factorization over Heterogeneous Clusters. In 26th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2020. IEEE, Hong Kong, 54–63.
[18]
Loïc Prylli and Bernard Tourancheau. 1996. Efficient block cyclic data redistribution. In Euro-Par’96 Parallel Processing, Luc Bougé et al. (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 155–164.
[19]
Eduardo Roloff, Matthias Diener, Luciano Gaspary, and Philippe Navaux. 2019. Exploring Instance Heterogeneity in Public Cloud Providers for HPC Applications. In The 9th Intl. Conf. on Cloud Comp. and Services Sci.SciTePress, 210–222.
[20]
Luka Stanisic 2015. Faithful Performance Prediction of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures. Concurrency and Computation: Practice and Experience 27, 16(2015), 4075–4090.

Cited By

View all
  • (2023)Summarizing task-based applications behavior over many nodes through progression clustering2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)10.1109/PDP59025.2023.00014(35-42)Online publication date: Mar-2023
  • (2022)Multi-Phase Task-Based HPC Applications: Quickly Learning how to Run Fast2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS53621.2022.00042(357-367)Online publication date: May-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP '21: Proceedings of the 50th International Conference on Parallel Processing
August 2021
927 pages
ISBN:9781450390682
DOI:10.1145/3472456
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Load Balancing
  2. Partitioning
  3. Scheduling
  4. Task-Based

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES)
  • Brazilian National Council for Scientific and Technological Development (CNPq)
  • FAPERGS

Conference

ICPP 2021

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)3
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Summarizing task-based applications behavior over many nodes through progression clustering2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)10.1109/PDP59025.2023.00014(35-42)Online publication date: Mar-2023
  • (2022)Multi-Phase Task-Based HPC Applications: Quickly Learning how to Run Fast2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS53621.2022.00042(357-367)Online publication date: May-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media