Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1383422.1383447acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article

Harmony: an execution model and runtime for heterogeneous many core systems

Published: 23 June 2008 Publication History

Abstract

The emergence of heterogeneous many core architectures presents a unique opportunity for delivering order of magnitude performance increases to high performance applications by matching certain classes of algorithms to specifically tailored architectures. Their ubiquitous adoption, however, has been limited by a lack of programming models and management frameworks designed to reduce the high degree of complexity of software development intrinsic to heterogeneous architectures. This paper proposes Harmony, a runtime supported programming and execution model that provides: (1) semantics for simplifying parallelism management, (2) dynamic scheduling of compute intensive kernels to heterogeneous processor resources, and (3) online monitoring driven performance optimization for heterogeneous many core systems. We are particulably concerned with simplifying development and ensuring binary portability and scalability across system configurations and sizes. Initial results from ongoing development demonstrate the binary compatibility with variable number of cores, as well as dynamic adaptation of schedules to data sets. We present preliminary results of key features for some benchmark applications.

References

[1]
V. T. Barry Minor, Gordon Fossum, "Terrain rendering engine (tre)," tech. rep., IBM, 2005.
[2]
J. Suh, E.-G. Kim, S. Crago, L. Srinivasan, and M. French, "A performance analysis of pim, stream processing, and tiled processing on memory-intensive signal processing kernels," Computer Architecture, 2003. Proceedings. 30th Annual International Symposium on, pp. 410--419, 9-11 June 2003.
[3]
M. Gordon, W. Thies, and S. Amarasinghe, "Exploiting coarse-grained task, data, and pipeline parallelism in stream programs," in International Conference on Architectural Support for Programming Languages and Operating Systems, (San Jose, CA), Oct. 2006.
[4]
K. Fatahalian, D. R. Horn, T. J. Knight, L. Leem, M. Houston, J. Y. Park, M. Erez, M. Ren, A. Aiken, W. J. Dally, and P. Hanrahan, "Sequoia: programming the memory hierarchy," in SC '06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, (New York, NY, USA), p. 83, ACM, 2006.
[5]
M. D. Linderman, J. D. Collins, H. Wang, and T. H. Meng, "Merge: a programming model for heterogeneous multi-core systems," in ASPLOS XIII: Proceedings of the 13th international conference on Architectural support for programming languages and operating systems, (New York, NY, USA), pp. 287--296, ACM, 2008.

Cited By

View all
  • (2023)Regular Composite Resource Partitioning and Reconfiguration in Open SystemsACM Transactions on Embedded Computing Systems10.1145/360942422:5(1-29)Online publication date: 26-Sep-2023
  • (2022)Scheduling of multiple network packet processing applications using PythiaComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2022.109006212:COnline publication date: 20-Jul-2022
  • (2021)Device HoppingACM Transactions on Architecture and Code Optimization10.1145/347190918:4(1-25)Online publication date: 29-Sep-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HPDC '08: Proceedings of the 17th international symposium on High performance distributed computing
June 2008
252 pages
ISBN:9781595939975
DOI:10.1145/1383422
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 June 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. dependency graph
  2. gpgpu
  3. harmony
  4. heterogeneous
  5. many core
  6. optimization
  7. performance monitoring
  8. runtime
  9. scheduling

Qualifiers

  • Research-article

Conference

HPDC '08
Sponsor:

Acceptance Rates

Overall Acceptance Rate 166 of 966 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)2
Reflects downloads up to 01 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Regular Composite Resource Partitioning and Reconfiguration in Open SystemsACM Transactions on Embedded Computing Systems10.1145/360942422:5(1-29)Online publication date: 26-Sep-2023
  • (2022)Scheduling of multiple network packet processing applications using PythiaComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2022.109006212:COnline publication date: 20-Jul-2022
  • (2021)Device HoppingACM Transactions on Architecture and Code Optimization10.1145/347190918:4(1-25)Online publication date: 29-Sep-2021
  • (2020)A Survey on Parallel Architectures and Programming Models2020 43rd International Convention on Information, Communication and Electronic Technology (MIPRO)10.23919/MIPRO48935.2020.9245341(999-1005)Online publication date: 28-Sep-2020
  • (2020)He..ro DB: A Concept for Parallel Data Processing on Heterogeneous HardwareArchitecture of Computing Systems – ARCS 202010.1007/978-3-030-52794-5_7(82-96)Online publication date: 9-Jul-2020
  • (2019)Predictable Data-Driven Resource Management: an Implementation using Autoware on Autonomous Platforms2019 IEEE Real-Time Systems Symposium (RTSS)10.1109/RTSS46320.2019.00038(339-352)Online publication date: Dec-2019
  • (2019)Using meta-heuristics and machine learning for software optimization of parallel computing systemsComputing10.1007/s00607-018-0614-9101:8(893-936)Online publication date: 1-Aug-2019
  • (2019)Performance Evaluation of Two Load Balancing Algorithms for Hybrid ClustersWater Governance and Management in India10.1007/978-3-030-15996-2_9(119-131)Online publication date: 26-Mar-2019
  • (2018)A Review of Machine Learning and Meta-heuristic Methods for Scheduling Parallel Computing SystemsProceedings of the International Conference on Learning and Optimization Algorithms: Theory and Applications10.1145/3230905.3230906(1-6)Online publication date: 2-May-2018
  • (2018)Minimizing Thermal Variation in Heterogeneous HPC Systems with FPGA Nodes2018 IEEE 36th International Conference on Computer Design (ICCD)10.1109/ICCD.2018.00086(537-544)Online publication date: Oct-2018
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media