Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/762761.762771acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article

Active harmony: towards automated performance tuning

Published: 16 November 2002 Publication History

Abstract

In this paper, we present the Active Harmony automated runtime tuning system. We describe the interface used by programs to make applications tunable. We present the Library Specification Layer which helps program library developers expose multiple variations of the same API using different algorithms. The Library Specification Language helps to select the most appropriate program library to tune the overall performance. We also present the optimization algorithm used to adjust parameters in the application and the libraries. Finally, we present results that show how the system is able to tune several real applications. The automated tuning system is able to tune the application parameters to within a few percent of the best value after evaluating only 11 out of over 1,700 possible configurations.

References

[1]
Berman, F. and R. Wolski. Scheduling from the perspective of the application. in Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing. 1996. Syracuse, NY, USA 6--9 Aug. 1996.
[2]
Borovikov, E., A. Sussman, and L. Davis. An Efficient System for Multi-Perspective Imaging and Volumetric Shape Analysis. in Workshop on Parallel and Distributed Computing in Image Processing, Video Processing Multimedia (PDIVM'20001). 2001: IEEE Computer Society Press.
[3]
Dongarra, J. and e. al., LAPACK - Linear Algebra PACKage.
[4]
Foster, I. and C. Kesselman, eds. The Grid: Blueprint for a New Computing Infrastructure. 1998, Morgan-Kaufmann: San Francisco.
[5]
Gailly, J.-l. and M. Adler, zlib - A Massively Spiffy Yet Delicately Unobtrusive Compression Library.
[6]
Geist, A.G., J.A. Kohl, and P.M. Papadopoulos, CUMULVS: Providing Fault tolerance, Visualization, and Seering of Parallel Applications. International Journal of Supercomputer Applications and High Performance Computing, 1997. 11(3): p. 224--35.
[7]
Gu, W., et al. Falcon: On-line Monitoring and Steering of Large-Scale Parallel Programs. in Frontiers '95. 1995. McLean, VA: IEEE Press.
[8]
Hollingsworth, J.K. and P.J. Keleher. Prediction and Adaptation in Active Harmony. in The 7th International Symposium on High Performance Distributed Computing. 1998. Chicago.
[9]
Keleher, P.J., J.K. Hollingsworth, and D. Perkovic. Exposing Application Alternatives. in ICDCS. 1999. Austin, TX.
[10]
Kurc, T., et al. Querying Very Large Multidimensional Datasets in ADR. in Proceedings of SC99. 1999. Orlando, FL: ACM Press.
[11]
Lagarias, J.C., et al., Convergence properties of the Nelder-Mead simplex method in low dimensions. SIAM Journal for Optimizations. 9(1): p. 112--147.
[12]
Nelder, J.A. and R. Mead, A Simplex Methd for Function Minimization. Comput. J., 1965. 7(4): p. 308--313.
[13]
Okamoto, T., LHa for UNIX 1.1.4i. 2000.
[14]
Osterhout, J.K. Tcl: An Embeddable Command Language. in USENIX Winter Conf. 1990.
[15]
Parker, S.G. and C.R. Johnson. SCIRun: a scientific programming environment for computational steering. in Supercomputing'95. 1995. San Diego.
[16]
Reed, D.A., et al. The next frontier: interactive and closed loop performance steering. in ICPP Workshop on Challenges for Parallel Process. 1996. Bloomingdale, Ill.
[17]
Ribler, R.L., H. Simitci, and D.A. Reed, The Autopilot Performance-Directed Adaptive Control System. Future Generation Computer Systems, special issue (Performance Data Mining), 2001. 18(1): p. 175--187.
[18]
Ribler, R.L., et al. Autopilot: Adaptive Control of Distributed Applications. in High Performance Distributed Computing. 1998. Chicago, IL.
[19]
Whaley, R.C. and J.J. Dongarra. Automatically tuned linear algebra software (ATLAS). in Super-computing. 1998. Orlando, FL.
[20]
Wolski, R. Forecasting Network Performance to Support Dynamic Scheduling Using the Network Weather Service. in High Performance Distributed Computing (HPDC). 1997. Portland, Oregon: IEEE Press.
[21]
Ziv, J. and A. Lempel, A Universal Algorithm for Sequential Data Compression. IEEE Transactions on Information Theory. 23(3): p. 337--343.

Cited By

View all
  • (2022)A Survey of Binary Code Fingerprinting Approaches: Taxonomy, Methodologies, and FeaturesACM Computing Surveys10.1145/348686055:1(1-41)Online publication date: 17-Jan-2022
  • (2019)Runtime Adaptive Task Inlining on Asynchronous Multitasking Runtime SystemsProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337915(1-10)Online publication date: 5-Aug-2019
  • (2019)Massively Parallel Automated Software TuningProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337908(1-10)Online publication date: 5-Aug-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing
November 2002
952 pages
ISBN:076951524X

Sponsors

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 16 November 2002

Check for updates

Qualifiers

  • Article

Conference

SC '02
Sponsor:

Acceptance Rates

SC '02 Paper Acceptance Rate 67 of 230 submissions, 29%;
Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)A Survey of Binary Code Fingerprinting Approaches: Taxonomy, Methodologies, and FeaturesACM Computing Surveys10.1145/348686055:1(1-41)Online publication date: 17-Jan-2022
  • (2019)Runtime Adaptive Task Inlining on Asynchronous Multitasking Runtime SystemsProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337915(1-10)Online publication date: 5-Aug-2019
  • (2019)Massively Parallel Automated Software TuningProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337908(1-10)Online publication date: 5-Aug-2019
  • (2019)Efficient hierarchical online-autotuningProceedings of the ACM International Conference on Supercomputing10.1145/3330345.3330377(354-366)Online publication date: 26-Jun-2019
  • (2019)An Autotuning Protocol to Rapidly Build AutotunersACM Transactions on Parallel Computing10.1145/32915275:2(1-25)Online publication date: 4-Jan-2019
  • (2018)Unconventional Parallelization of Nondeterministic ApplicationsACM SIGPLAN Notices10.1145/3296957.317318153:2(432-447)Online publication date: 19-Mar-2018
  • (2018)Dynamic Tuning of OpenMP Memory Bound Applications in Multisocket Systems using MATEWorkshop Proceedings of the 47th International Conference on Parallel Processing10.1145/3229710.3229748(1-10)Online publication date: 13-Aug-2018
  • (2018)Bootstrapping Parameter Space Exploration for Fast TuningProceedings of the 2018 International Conference on Supercomputing10.1145/3205289.3205321(385-395)Online publication date: 12-Jun-2018
  • (2018)A Survey on Compiler Autotuning using Machine LearningACM Computing Surveys10.1145/319797851:5(1-42)Online publication date: 18-Sep-2018
  • (2018)An Auto-Tuning Framework for a NUMA-Aware Hessenberg Reduction AlgorithmCompanion of the 2018 ACM/SPEC International Conference on Performance Engineering10.1145/3185768.3186304(5-8)Online publication date: 2-Apr-2018
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media