Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

A Scalable Debugger for Massively Parallel Message-Passing Programs

Published: 01 June 1994 Publication History

Abstract

In a message-passing program, there are at least as many threads as processors, and the programmer must deal with large numbers of them on a massively parallel machine. On our target machine, the CM-5, we had previously developed Prism, a programming environment that supports debugging, data visualization, and performance analysis of data-parallel programs. We discuss how our new version, Node Prism, extends Prism's capabilities for message-passing programs. It looks and feels like the data-parallel version, but it uses new methods for user-debugger interaction that promote greater understanding of parallel programs. It offers scalable expression, execution, and interpretation of all debugging operations, making it easier to debug and understand message-passing programs.

References

[1]
1. S. Sistare et al., "Data Visualization and Performance Analysis in the Prism Programming Environment," Programming Environments for Parallel Computing, N.P. Topham, R.N. Ibbett, and T. Bemmerl, eds., Elsevier Science Publishers, Amsterdam, 1992, pp. 37-52.
[2]
2. Prism User's Guide, Thinking Machines, Cambridge, Mass., 1994.
[3]
3. CMMD User's Guide, Chapter 6, Thinking Machines, Cambridge, Mass., 1993, pp. 39-58.
[4]
4. Using the Xtra Programming Environment, BBN Advanced Computers, Cambridge, Mass., 1991.
[5]
5. Convex CXdb User's Guide, Convex Press, Richardson, Tex., 1991.
[6]
6. S. Zimmerman, "UDB: A Parallel Debugger for the KSRI," Proc. Supercomputer Debugging Workshop '92, Los Alamos Nat'l Lab., Los Alamos, N.M., 1992, pp. 95-102.
[7]
7. iPSC/2 and iPSC/860 Interactive Parallel Debugger Mannual, Intel, Santa Clara, Calif., 1991.
[8]
8. C.M. Pancake, "Direct Manipulation Techniques for Parallel Debuggers," Proc. Supercomputer Debugging Workshop '92, Los Alamos Nat'l Lab., Los Alamos, N.M., 1992, pp. 179-208.
[9]
9. D. Breazeal et al., "A Parallel Software Monitor for Debugging and Performance Tools on Distributed-Memory Multicomputers," Proc. Supercomputer Debugging Workshop '92, Los Alamos Nat'l Lab., Los Alamos, N.M., 1992, pp. 221-238.

Cited By

View all
  • (2015)Debugging high-performance computing applications at massive scalesCommunications of the ACM10.1145/266721958:9(72-81)Online publication date: 24-Aug-2015
  • (2014)PGDBProceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment10.1145/2616498.2616535(1-7)Online publication date: 13-Jul-2014
  • (2014)Distributed debugging for mobile networksJournal of Systems and Software10.1016/j.jss.2013.11.109990:C(76-90)Online publication date: 1-Apr-2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Parallel & Distributed Technology: Systems & Technology
IEEE Parallel & Distributed Technology: Systems & Technology  Volume 2, Issue 2
June 1994
78 pages

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 June 1994

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Debugging high-performance computing applications at massive scalesCommunications of the ACM10.1145/266721958:9(72-81)Online publication date: 24-Aug-2015
  • (2014)PGDBProceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment10.1145/2616498.2616535(1-7)Online publication date: 13-Jul-2014
  • (2014)Distributed debugging for mobile networksJournal of Systems and Software10.1016/j.jss.2013.11.109990:C(76-90)Online publication date: 1-Apr-2014
  • (2012)A scalable infiniband network topology-aware performance analysis tool for MPIProceedings of the 18th international conference on Parallel processing workshops10.1007/978-3-642-36949-0_49(439-450)Online publication date: 27-Aug-2012
  • (2011)An approach to creating performance visualizations in a parallel profile analysis toolProceedings of the 2011 international conference on Parallel Processing - Volume 210.1007/978-3-642-29740-3_19(156-165)Online publication date: 29-Aug-2011
  • (2006)On-line automated performance diagnosis on thousands of processesProceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming10.1145/1122971.1122984(69-80)Online publication date: 29-Mar-2006
  • (2005)An approach to the development of debuggers that use semantics of constructs of parallel programsProgramming and Computing Software10.1007/s11086-005-0009-531:1(20-28)Online publication date: 1-Jan-2005
  • (2004)PDBProceedings of the 5th IEEE/ACM International Workshop on Grid Computing10.1109/GRID.2004.49(260-265)Online publication date: 8-Nov-2004
  • (1999)MPI support in the Prism programming environmentProceedings of the 1999 ACM/IEEE conference on Supercomputing10.1145/331532.331554(22-es)Online publication date: 1-Jan-1999
  • (1996)The Mantis parallel debuggerProceedings of the SIGMETRICS symposium on Parallel and distributed tools10.1145/238020.238056(118-126)Online publication date: 1-Jan-1996
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media