Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/125826.126075acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article
Free access

Performance debugging shared memory multiprocessor programs with MTOOL

Published: 01 August 1991 Publication History
First page of PDF

References

[1]
Thomas Anderson and E. Lazowska. Quartz: A tool for tuning parallel program performance. In Proceedings of 1990 ACM SIGMETRICS, pages 115- 125, May 1990.
[2]
D. Bernstein, A. Bolmarchich, and K. So. Performance visualization of parallel programs on a shared memory multiprocessor. In Proceedings of ICPP, vol. iI, pages 1-10, Aug. 1989.
[3]
James Boyle, Ewing Lusk, et al. Portable Programs for Parallel Processors. Holt, Rinehart, and Winston, 1987.
[4]
J. Dongarra, O. Brewer, J. Kohl, and S. Fineberg. A tool to aid in the design, implementation, and understanding of matrix algorithms for parallel processors. Journal of Parallel and Distributed Computing, 9(2): 185-202, 1990.
[5]
A. Goldberg and J. Hennessy. MTOOL: A method for detecting memory bottlenecks. Technical Note WRL TN-17/90, DEC Western Research Laboratory, 1990.
[6]
A. Goldberg and J. Hennessy. Minimum cost basic block counting. Technical Report In Preparation, Stanford Computer Systems Laboratory, 1991.
[7]
A. Goldberg and J. Hennessy. MTOOL: A method for isolating memory bottlenecks in shared memory multiprocessor programs. In Proceedings of ICPP, vol. II, Aug. 1991.
[8]
Mark A. Linton, John M. Vlissides, and Paul R. Calder. Composing user interfaces with interviews. IEEE Computer, 22(2):8-22, 1989.
[9]
Allen Malony. Performance Observability. PhD thesis, University of Illinois at Urbana-Champaign, Oct. 1990.
[10]
Allen Malony. Event-based performance perturbation. In ACM SIGPLAN PPoPP, 1991.
[11]
B. Miller, et al. IPS-2: The second generation of a parallel program measurement system. IEEE Transactions on Parallel and Distributed Systems, 1(2):206-217, 1990.
[12]
E. Rothberg and A. Gupta. Parallel iccg on a hierarchical memory processor-addressing the triangular solve bottleneck. Technical Report STAN-CS- 90-1330, Stanford Department of Computer Science, October 1990.
[13]
E. Rothberg and A. Gupta. Techniques for improving the performance of sparse matrix factorization on multiprocessor workstations. In Proceedings of Supercomputing '90, pages 232-41, November 1990.
[14]
J.H. Saltz, R. Mirchandaney, and D. Baxter. Runtime parallelization and scheduling of loops. Technical Report ICASE 88-70, ICASE, NASA Langley Research Center, 1988.
[15]
Vivek Sarkar. Determining average program execution times and their variance. In Sigplan Conference on Programming Language Design and Implementation, pages 298-312, 1989.

Cited By

View all
  • (2011)Towards a Methodology for Deliberate Sample-Based Statistical Performance AnalysisProceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum10.1109/IPDPS.2011.262(1258-1265)Online publication date: 16-May-2011
  • (2010)The Cilkview scalability analyzerProceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures10.1145/1810479.1810509(145-156)Online publication date: 13-Jun-2010
  • (2006)Synthetic‐perturbation techniques for screening shared memory programsSoftware: Practice and Experience10.1002/spe.438024080224:8(679-701)Online publication date: 30-Oct-2006
  • Show More Cited By

Index Terms

  1. Performance debugging shared memory multiprocessor programs with MTOOL

                        Recommendations

                        Comments

                        Information & Contributors

                        Information

                        Published In

                        cover image ACM Conferences
                        Supercomputing '91: Proceedings of the 1991 ACM/IEEE conference on Supercomputing
                        August 1991
                        920 pages
                        ISBN:0897914597
                        DOI:10.1145/125826
                        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                        Sponsors

                        Publisher

                        Association for Computing Machinery

                        New York, NY, United States

                        Publication History

                        Published: 01 August 1991

                        Permissions

                        Request permissions for this article.

                        Check for updates

                        Qualifiers

                        • Article

                        Conference

                        SC '91
                        Sponsor:

                        Acceptance Rates

                        Supercomputing '91 Paper Acceptance Rate 83 of 215 submissions, 39%;
                        Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

                        Contributors

                        Other Metrics

                        Bibliometrics & Citations

                        Bibliometrics

                        Article Metrics

                        • Downloads (Last 12 months)30
                        • Downloads (Last 6 weeks)5
                        Reflects downloads up to 14 Oct 2024

                        Other Metrics

                        Citations

                        Cited By

                        View all
                        • (2011)Towards a Methodology for Deliberate Sample-Based Statistical Performance AnalysisProceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum10.1109/IPDPS.2011.262(1258-1265)Online publication date: 16-May-2011
                        • (2010)The Cilkview scalability analyzerProceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures10.1145/1810479.1810509(145-156)Online publication date: 13-Jun-2010
                        • (2006)Synthetic‐perturbation techniques for screening shared memory programsSoftware: Practice and Experience10.1002/spe.438024080224:8(679-701)Online publication date: 30-Oct-2006
                        • (2005)TAPEProceedings of the 19th annual international conference on Supercomputing10.1145/1088149.1088176(199-208)Online publication date: 20-Jun-2005
                        • (2005)EMPSProceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 10 - Volume 1110.1109/IPDPS.2005.196Online publication date: 4-Apr-2005
                        • (2005)Using platform-specific performance counters for dynamic compilationProceedings of the 18th international conference on Languages and Compilers for Parallel Computing10.1007/978-3-540-69330-7_23(334-346)Online publication date: 20-Oct-2005
                        • (2005)An adaptive cost system for parallel program instrumentationEuro-Par'96 Parallel Processing10.1007/3-540-61626-8_11(88-97)Online publication date: 8-Jun-2005
                        • (2005)Stanford DASH multiprocessor: The hardware and software approachPARLE '92 Parallel Architectures and Languages Europe10.1007/3-540-55599-4_125(799-805)Online publication date: 14-Jul-2005
                        • (2005)A loop-aware search strategy for automated performance analysisProceedings of the First international conference on High Performance Computing and Communications10.1007/11557654_68(573-584)Online publication date: 21-Sep-2005
                        • (2003)Dynamic instrumentation of large-scale MPI and OpenMP applicationsProceedings International Parallel and Distributed Processing Symposium10.1109/IPDPS.2003.1213161(9)Online publication date: 2003
                        • Show More Cited By

                        View Options

                        View options

                        PDF

                        View or Download as a PDF file.

                        PDF

                        eReader

                        View online with eReader.

                        eReader

                        Get Access

                        Login options

                        Media

                        Figures

                        Other

                        Tables

                        Share

                        Share

                        Share this Publication link

                        Share on social media