Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

Validating the Simulation of Large-Scale Parallel Applications Using Statistical Characteristics

Published: 12 February 2016 Publication History

Abstract

Simulation is a widely adopted method to analyze and predict the performance of large-scale parallel applications. Validating the hardware model is highly important for complex simulations with a large number of parameters. Common practice involves calculating the percent error between the projected and the real execution time of a benchmark program. However, in a high-dimensional parameter space, this coarse-grained approach often suffers from parameter insensitivity, which may not be known a priori. Moreover, the traditional approach cannot be applied to the validation of software models, such as application skeletons used in online simulations. In this work, we present a methodology and a toolset for validating both hardware and software models by quantitatively comparing fine-grained statistical characteristics obtained from execution traces. Although statistical information has been used in tasks like performance optimization, this is the first attempt to apply it to simulation validation. Our experimental results show that the proposed evaluation approach offers significant improvement in fidelity when compared to evaluation using total execution time, and the proposed metrics serve as reliable criteria that progress toward automating the simulation tuning process.

References

[1]
C. Albing, N. Troullier, S. Whalen, R. Olson, J. Glenski, H. Pritchard, and H. Mills. 2011. Scalable node allocation for improved performance in regular and anisotropic 3D torus supercomputers. In Recent Advances in the Message Passing Interface. Springer, 61--70.
[2]
P. N. Clauss, M. Stillwell, S. Genaud, F. Suter, H. Casanova, and M. Quinson. 2011. Single node on-line simulation of MPI applications with SMPI. In 2011 IEEE International Parallel & Distributed Processing Symposium (IPDPS). IEEE, 664--675.
[3]
W. E. Denzel, J. Li, P. Walker, and Y. Jin. 2010. A framework for end-to-end simulation of high-performance computing systems. Simulation 86, 5-6 (2010), 331--350.
[4]
F. Desprez, G. S. Markomanolis, M. Quinson, and F. Suter. 2011. Assessing the performance of MPI applications through time-independent trace replay. In Proceedings of the 2011 40th International Conference on Parallel Processing Workshops (ICPPW). IEEE, 467--476.
[5]
M. Geimer, F. Wolf, B. J. N. Wylie, E. Ábrahám, D. Becker, and B. Mohr. 2010. The Scalasca performance toolset architecture. Concurrency and Computation: Practice and Experience 22, 6 (2010), 702--719.
[6]
S. D. Hammond, G. R. Mudalige, J. A. Smith, S. A. Jarvis, J. A. Herdman, and A. Vadgama. 2009. WARPP: A toolkit for simulating high-performance parallel scientific codes. In Proceedings of the 2nd International Conference on Simulation Tools and Techniques. ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering), 19.
[7]
T. Hoefler, T. Schneider, and A. Lumsdaine. 2010. LogGOPSim-simulating large-scale applications in the LogGOPS model. In Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC), Vol. 10. 597--604.
[8]
K. A. Huck, A. D. Malony, S. Shende, and D. W. Jacobsen. 2014. Integrated measurement for cross-platform OpenMP performance analysis. In Using and Improving OpenMP for Devices, Tasks, and More. Springer, 146--160.
[9]
C. L. Janssen, H. Adalsteinsson, S. Cranford, J. P. Kenny, A. Pinar, D. A. Evensky, and J. Mayo. 2010. A simulator for large-scale parallel computer architectures. International Journal of Distributed Systems and Technologies 1, 2 (2010), 57--73.
[10]
C. Janssen, D. Quinlan, and J. Shalf. 2011. Architectural simulation for exascale hardware/software co-design.
[11]
S. Kamil, L. Oliker, A. Pinar, and J. Shalf. 2010. Communication requirements and interconnect optimization for high-end scientific applications. IEEE Transactions on Parallel and Distributed Systems 21, 2 (2010), 188--202.
[12]
W. E. Nagel, A. Arnold, M. Weber, H. C. Hoppe, and K. Solchenbach. 1996. VAMPIR: Visualization and Analysis of MPI Resources. Citeseer.
[13]
A. Núñez, J. Fernández, J. D. Garcia, F. Garcia, and J. Carretero. 2010. New techniques for simulating high performance MPI applications on large storage networks. The Journal of Supercomputing 51, 1 (2010), 40--57.
[14]
A. J. Peña, R. G. C. Carvalho, J. Dinan, P. Balaji, R. Thakur, and W. Gropp. 2013. Analysis of topology-dependent MPI performance on Gemini networks. In Proceedings of the 20th European MPI Users’ Group Meeting. ACM, 61--66.
[15]
B. Penoff, A. Wagner, M. Tuxen, and I. Rungeler. 2009. MPI-NetSim: A network simulation module for MPI. In Proceedings of the 2009 15th International Conference on Parallel and Distributed Systems (ICPADS). IEEE, 464--471.
[16]
R. Preissl, T. Kockerbauer, M. Schulz, D. Kranzlmuller, B. Supinski, and D. J. Quinlan. 2008. Detecting patterns in MPI communication traces. In Proceedings of the 37th International Conference on Parallel Processing (ICPP’08). IEEE, 230--237.
[17]
R. Reussner, P. Sanders, L. Prechelt, and M. Müller. 1998. SKaMPI: A detailed, accurate MPI benchmark. Recent Advances in Parallel Virtual Machine and Message Passing Interface (1998), 52--59.
[18]
A. F. Rodrigues, K. S. Hemmert, B. W. Barrett, C. Kersey, R. Oldfield, M. Weston, R. Risen, J. Cook, P. Rosenfeld, E. CooperBalls, and others. 2011. The structural simulation toolkit. ACM SIGMETRICS Performance Evaluation Review 38, 4 (2011), 37--42.
[19]
J. Shalf, D. Quinlan, and C. Janssen. 2011. Rethinking hardware-software codesign for exascale systems. Computer 44, 11 (2011), 22--30.
[20]
S. S. Shende and A. D. Malony. 2006. The TAU parallel performance system. International Journal of High Performance Computing Applications 20, 2 (2006), 287--311.
[21]
H. J. Song, X. Liu, D. Jakobsen, R. Bhagwan, X. Zhang, K. Taura, and A. Chien. 2000. The microgrid: A scientific tool for modeling computational grids. In Proceedings of the ACM/IEEE 2000 Conference on Supercomputing. IEEE, 53--53.
[22]
M. Sottile, A. Dakshinamurthy, G. Hendry, and D. Dechev. 2013. Automatic extraction of software skeletons for benchmarking large-scale parallel applications. In ACM SIGSIM Conference on Principles of Advanced Discrete Simulation (PADS).
[23]
C. D. Sudheer and A. Srinivasan. 2012. Optimization of the hop-byte metric for effective topology aware mapping. In Proceedings of the 2012 19th International Conference on High Performance Computing (HiPC). IEEE, 1--9.
[24]
R. Susukita, H. Ando, M. Aoyagi, H. Honda, Y. Inadomi, K. Inoue, S. Ishizuki, Y. Kimura, H. Komatsu, M. Kurokawa, and others. 2008. Performance prediction of large-scale parallell system and application using macro-level simulation. In Proceedings of the 2008 ACM/IEEE Conference on Supercomputing. IEEE, 20.
[25]
M. Tikir, M. Laurenzano, L. Carrington, and A. Snavely. 2009. PSINS: An open source event tracer and execution simulator for MPI applications. Euro-Par 2009 Parallel Processing, 135--148.
[26]
K. D. Underwood, M. Levenhagen, and A. Rodrigues. 2007. Simulating red storm: Challenges and successes in building a system simulation. In IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007). IEEE, 1--10.
[27]
P. Velho and A. Legrand. 2009. Accuracy study and improvement of network simulation in the SimGrid framework. In Proceedings of the 2nd International Conference on Simulation Tools and Techniques. ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering), 13.
[28]
J. S. Vetter and M. O. McCracken. 2001. Statistical scalability analysis of communication operations in distributed applications. In ACM SIGPLAN Notices, Vol. 36. ACM, 123--132.
[29]
M. Weber, R. Brendel, and H. Brunst. 2012. Trace file comparison with a hierarchical sequence alignment algorithm. In Proceedings of the 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications (ISPA). IEEE, 247--254.
[30]
J. J. Wilke, K. Sargsyan, J. P. Kenny, B. Debusschere, H. N. Najm, and G. Hendry. 2013. Validation and uncertainty assessment of extreme-scale HPC simulation through Bayesian inference. In Euro-Par 2013 Parallel Processing. Springer, 41--52.
[31]
N. J. Wright, W. Pfeiffer, and A. Snavely. 2009. Characterizing parallel scaling of scientific applications using IPM. In Proceedings of the 10th LCI International Conference on High-Performance Clustered Computing. 10--12.
[32]
X. Wu and F. Mueller. 2011. ScalaExtrap: Trace-based communication extrapolation for SPMD programs. In Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming. ACM, 113--122.
[33]
J. Zhai, W. Chen, and W. Zheng. 2010. Phantom: Predicting performance of parallel applications on large-scale parallel machines using a single node. In ACM Sigplan Notices, Vol. 45. ACM, 305--314.
[34]
D. Zhang, G. Hendry, and D. Dechev. 2014. Tools for enabling automatic validation of large-scale parallel application simulations. In 2014 IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 601--604.
[35]
G. Zheng, G. Kakulapati, and L. V. Kalé. 2004. Bigsim: A parallel simulator for performance prediction of extremely large parallel machines. In Proceedings of the 18th International Parallel and Distributed Processing Symposium, 2004. IEEE, 78.

Cited By

View all
  • (2018)Demand-Response Power Management Strategy Using Time Shifting CapabilitiesProceedings of the Ninth International Conference on Future Energy Systems10.1145/3208903.3213519(480-485)Online publication date: 12-Jun-2018
  • (2018)Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)10.1109/PAW-ATM.2018.00010(44-56)Online publication date: Nov-2018
  • (2018)REMIJournal of Intelligent Information Systems10.1007/s10844-018-0524-551:2(367-388)Online publication date: 1-Oct-2018
  1. Validating the Simulation of Large-Scale Parallel Applications Using Statistical Characteristics

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Modeling and Performance Evaluation of Computing Systems
    ACM Transactions on Modeling and Performance Evaluation of Computing Systems  Volume 1, Issue 1
    Inaugural Issue
    March 2016
    118 pages
    ISSN:2376-3639
    EISSN:2376-3647
    DOI:10.1145/2893449
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 February 2016
    Accepted: 01 July 2015
    Revised: 01 May 2015
    Received: 01 November 2014
    Published in TOMPECS Volume 1, Issue 1

    Permissions

    Request permissions for this article.

    Author Tags

    1. Simulation evaluation
    2. evaluation metrics
    3. software skeleton

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)38
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 17 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Demand-Response Power Management Strategy Using Time Shifting CapabilitiesProceedings of the Ninth International Conference on Future Energy Systems10.1145/3208903.3213519(480-485)Online publication date: 12-Jun-2018
    • (2018)Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)10.1109/PAW-ATM.2018.00010(44-56)Online publication date: Nov-2018
    • (2018)REMIJournal of Intelligent Information Systems10.1007/s10844-018-0524-551:2(367-388)Online publication date: 1-Oct-2018

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media