Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/369028.369063acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article
Free access

Profiling a parallel language based on fine-grained communication

Published: 17 November 1996 Publication History

Abstract

Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present a profiler for the fine-grained parallel programming language Split-C, which provides a simple global address space memory model. As our experience shows, it is much more challenging to profile programs that make use of efficient, low-overhead communication. We incorporated techniques which minimize profiling effects on the running program. We quantify the profiling overhead and present several Split-C applications which show that the profiler is useful in determining performance bottlenecks.

References

[1]
{BK94} E. Brewer and B. Kuszmaul. How to Get Good Performance from the CM-5 Data Network. In Eighth International Parallel Processing Symposium, April 1994.
[2]
{CDG+93a} D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta, S. Luna, T. von Eicken, and K. Yelick. Introduction to Split-C. Technical report, UC Berkeley, 1993.
[3]
{CDG+93b} D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta, T. von Eicken, and K. Yelick. Parallel Programming in Split-C. In Proceedings SUPERCOMPUTING '93, Portland, OR, November 1993.
[4]
{CKP+93} D. E. Culler, R. M. Karp, D. A. Patterson, A. Sahay, K. E. Schauser, E. Santos, R. Subramonian, and T. von Eicken. LogP: Towards a Realistic Model of Parallel Computation. In Fourth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, May 1993.
[5]
{GHPW90} G. Geist, M. Heath, B. Peyton, and P. Worley. PICL --- A Portable Instrumented Communication Library. Technical report, Oak Ridge National Laboratory, 1990.
[6]
{GKM83} S. Graham, P. Kessler, and M. McKusick. An Execution Profiler for Modular Programs. Software-Practice and Experience, 13(8), 1983.
[7]
{HE91} M. Heath and J. Etheridge. Visualizing the Performance of Parallel Programs. IEEE Software, 8(5), 1991.
[8]
{HKWJ95} M. Hao, A. Karp, A. Waheed, and M. Jazayeri. VIZIR: An Integrated Environment for Distributed Program Visualization. In Proceedings of Int. Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, Durham, NC, January 1995.
[9]
{HM93} M. Homewood and M. McLaren. Meiko CS-2 Interconnect Elan-Elite Design. In Proc. of Hot Interconnects, August 1993.
[10]
{HM95} S. Hackstadt and A. Malony. Visualizing Parallel Programs and Performance. IEEE Computer Graphics and Applications, 1995.
[11]
{IBM93} IBM. VT --- A Visualization Tool, 1993. Online at: http://www.tc.cornell.edu/UserDoc/Software/PTools/vt/.
[12]
{KS93} E. Kraemer and J. Stasko. The Visualization of Parallel Systems: An Overview. Journal of Parallel and Distributed Computing, 18, 1993.
[13]
{Lus93} E. Lusk. Visualizing Parallel Program Behavior. In High Performance Computing Symposium, Arlington, VA, March 1993.
[14]
{MCC+95} B. Miller, M. Callaghan, J. Cargille, J. Hollingsworth, B. Irvin, K. Karavanic, K. Kunchithapadam, and T. Newhall. The Paradyn Parallel Performance Measurement Tool. IEEE Computer, 28(11), 1995.
[15]
{NAW+96} W. Nagel, A. Arnold, M. Weber, H. C. Hoppe, et al. VAMPIR: Visualization and Analysis of MPI Resources. Supercomputer, 12(1), 1996.
[16]
{RAB+93} B. Ries, R. Anderson, D. Breazal, et al. The Paragon Performance Monitoring Environment. In Proceedings SUPERCOMPUTING '93, Portland, OR, November 1993.
[17]
{RJ93} D. Rover and C. Wright Jr. Visualizing the Performance of SPMD and Data-Parallel Programs. Journal of Parallel and Distributed Computing, 18, 1993.
[18]
{RRA+93} D. Reed, P. Roth, R. Aydt, K. Shields, et al. Scalable Performance Analysis: the Pablo Performance Analysis Environment. In Scalable Parallel Libraries Conference, Mississippi State, MS, October 1993.
[19]
{SM93} S. Sarukkai and A. Malony. Perturbation Analysis of High Level Instrumentation for SPMD Programs. SIGPLAN Notices, 28(7), 1993.
[20]
{SS95} K. E. Schauser and C. J. Scheiman. Experience with Active Messages on the Meiko CS-2. In 9th International Parallel Processing Symposium, Santa Barbara, CA, April 1995.
[21]
{SSFK96} K. E. Schauser, C. J. Scheiman, J. M. Ferguson, and P. Z. Kolano. Exploiting the Capabilities of Communications Co-processors. In 10th International Parallel Processing Symposium, April 1996.
[22]
{vECGS92} T. von Eicken, D. Culler, S. Goldstein, and K. Schauser. Active Messages: a Mechanism for Integrated Communication and Computation. In Proceedings of the 19th Int'l Symposium on Computer Architecture, Gold Coast, Australia, May 1992.
[23]
{WEK+95} G. Weiming, G. Eisenhauer, E. Kramer, K. Schwan, et al. Falcon: On-line Monitoring and Steering of large-scale Parallel Programs. In The Fifth Symposium on the Frontiers of Massively Parallel Computation, McLean, VA, February 1995.
[24]
{WR95} A. Waheed and D. Rover. A Structured Approach to Instrumentation System Development and Evaluation. In Proceedings SUPERCOMPUTING '95, San Diego, CA, November 1995.
[25]
{WRM+96} A. Waheed, D. Rover, M. Mutka, A. Bakic, et al. Vista: a Framework for Instrumentation System Design for Multidisciplinary Applications. In Proceedings of the Fourth International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, San Jose, February 1996.
[26]
{Yan94} J. Yan. Performance Tuning with AIMS --- An Automated Instrumentation and Monitoring System for Multicomputers. In Proceedings of the Twenty-Seventh Hawaii Int. Conf. on System Sciences, Hawaii, January 1994.

Cited By

View all
  • (2004)High-level data-access analysis for characterisation of (sub)task-level parallelism in javaNinth International Workshop on High-Level Parallel Programming Models and Supportive Environments, 2004. Proceedings.10.1109/HIPS.2004.1299188(31-40)Online publication date: 2004
  • (2004)Design-Time Data-Access Analysis for Parallel Java Programs with Shared-Memory Communication ModelEuro-Par 2004 Parallel Processing10.1007/978-3-540-27866-5_26(206-213)Online publication date: 2004

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
Supercomputing '96: Proceedings of the 1996 ACM/IEEE conference on Supercomputing
November 1996
898 pages
ISBN:0897918541

Sponsors

Publisher

IEEE Computer Society

United States

Publication History

Published: 17 November 1996

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Active Messages
  2. Parallel programming
  3. Split-C
  4. fine-grained communication
  5. performance analysis
  6. profiling

Qualifiers

  • Article

Conference

SC '96
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)4
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2004)High-level data-access analysis for characterisation of (sub)task-level parallelism in javaNinth International Workshop on High-Level Parallel Programming Models and Supportive Environments, 2004. Proceedings.10.1109/HIPS.2004.1299188(31-40)Online publication date: 2004
  • (2004)Design-Time Data-Access Analysis for Parallel Java Programs with Shared-Memory Communication ModelEuro-Par 2004 Parallel Processing10.1007/978-3-540-27866-5_26(206-213)Online publication date: 2004

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media