Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids

Published: 31 January 2018 Publication History

Abstract

Cloning is a technique to efficiently simulate a tree of multiple what-if scenarios that are unraveled during the course of a base simulation. However, cloned execution is highly challenging to realize on large, distributed memory computing platforms, due to the dynamic nature of the computational load across clones, and due to the complex dependencies spanning the clone tree. We present the conceptual simulation framework, algorithmic foundations, and runtime interface of CloneX, a new system we designed for scalable simulation cloning. It efficiently and dynamically creates whole logical copies of a dynamic tree of simulations across a large parallel system without full physical duplication of computation and memory. The performance of a prototype implementation executed on up to 1,024 graphical processing units of a supercomputing system has been evaluated with three benchmarks—heat diffusion, forest fire, and disease propagation models—delivering a speed up of over two orders of magnitude compared to replicated runs. The results demonstrate a significantly faster and scalable way to execute many what-if scenario ensembles of large simulations via cloning using the CloneX interface.

Supplementary Material

a5-yoginath-apndx.pdf (yoginath.zip)
Supplemental movie, appendix, image and software files for, Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids

References

[1]
Maksudul M. Alam, Srikanth B. Yoginath, and Kalyan S. Perumalla. 2016. Performance of point and range queries for in-memory databases using radix trees on GPUs. In Proceedings of the IEEE Conference on Data Science and Systems.
[2]
J. H. Balbi, P. A. Santoni, and J. L. Dupuy. 1999. Dynamic modelling of fire spread across a fuel bed. Int. J. Wildl. Fire 9, 4 (1999), 275--284.
[3]
Azer Bestavros and Biao Wang. 1993. Multi-version Speculative Concurrency Control with Delayed Commit. Technical Report. Boston University Computer Science Department.
[4]
D. Chen, S. J. Turner, W. Cai, B. P. Gan, and M. Y. H Low. 2005. Algorithms for HLA-based distributed simulation cloning. ACM Trans. Model. Comput. Simul. 15, 4 (2005), 316--345.
[5]
D. Chen, S. J. Turner, B. P. Gan, W. Cai, J. Wei, and N. Julka. 2003. Alternative solutions for distributed simulation cloning. Simulation 79, 5--6 (2003), 299--315.
[6]
Joseph E. Flaherty. Accessed on: 2016. Multi-Dimensional Parabolic Problems. Retrieved from http://www.cs.rpi.edu/∼flaherje/pdf/pde5.pdf.
[7]
Masatoshi Hanai, Toyotaro Suzumura, Georgios Theodoropoulos, and Kalyan S. Perumalla. 2015. Exact-differential large-scale traffic simulation. In Proceedings of the ACM SIGSIM Conference on Principles of Advanced Discrete Simulation.
[8]
Philip Heidelberger. 1988. Discrete event simulations and parallel processing: Statistical properties. SIAM J. Sci. Stat. Comput. 9, 6 (1988), 1114--1132.
[9]
Maria Hybinette and Richard M. Fujimoto. 2001. Cloning parallel simulations. ACM Trans. Model. Comput. Simul. 11, 4 (Oct. 2001), 378--407. 1049-3301
[10]
Xiaosong Li, Wentong Cai, and Stephen John Turner. 2015. Cloning agent-based simulation. In Proceedings of the ACM SIGSIM Conference on Principles of Advanced Discrete Simulation. ACM, 173--182.
[11]
Xiaosong Li, Wentong Cai, and Stephen John Turner. 2017. Cloning agent-based simulation on GPU. ACM Trans. Model. Comput. Simul. 27, 2, Article 15 (2017), 15:1--15:24 pages.
[12]
Jeffrey Vetter and Karsten Schwan. 1997. High-performance computational steering of physical simulations. In Proceedings of the 11th International Parallel Processing Symposium. IEEE, 128--132.
[13]
Jeffrey S. Vetter and Daniel A. Reed. 2000. Real-time performance monitoring, adaptive control, and interactive steering of computational grids. Intl. J. High Perf. Comp. Appl. 14, 4 (2000), 357--366.
[14]
John Von Neumann. 1956. Probabilistic logics and the synthesis of reliable organisms from unreliable components. Auto. Studies 34 (1956), 43--98.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Modeling and Computer Simulation
ACM Transactions on Modeling and Computer Simulation  Volume 28, Issue 1
January 2018
163 pages
ISSN:1049-3301
EISSN:1558-1195
DOI:10.1145/3174299
Issue’s Table of Contents
Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 January 2018
Accepted: 01 November 2017
Revised: 01 August 2017
Received: 01 January 2017
Published in TOMACS Volume 28, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CUDA
  2. Graphical processing units
  3. load balancing
  4. supercomputing
  5. time synchronization
  6. what-if decision tree

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • U.S. Department of Energy

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)86
  • Downloads (Last 6 weeks)20
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Advanced TutorialProceedings of the Winter Simulation Conference10.5555/3586210.3586232(268-282)Online publication date: 11-Dec-2022
  • (2022)Advanced Tutorial: Parallel and Distributed Methods for Scalable Discrete Simulation2022 Winter Simulation Conference (WSC)10.1109/WSC57314.2022.10015291(268-282)Online publication date: 11-Dec-2022
  • (2021)Mesoscopic Modeling and Rapid Simulation of Incremental Changes in Epidemic Scenarios on GPUsJournal of the Indian Institute of Science10.1007/s41745-021-00253-1Online publication date: 3-Aug-2021
  • (2020)Optimal solution analysis of octagonal Steiner tree problem based on GPU acceleration2020 International Conference on Virtual Reality and Intelligent Systems (ICVRIS)10.1109/ICVRIS51417.2020.00259(1063-1066)Online publication date: Jul-2020
  • (2019)Energy conservation through cloned execution of simulationsProceedings of the Winter Simulation Conference10.5555/3400397.3400607(2572-2582)Online publication date: 8-Dec-2019
  • (2019)Advancing simulation experimentation capabilities with runtime interventionsProceedings of the Annual Simulation Symposium10.5555/3338027.3338050(1-11)Online publication date: 29-Apr-2019
  • (2019)Advancing Simulation Experimentation Capabilities with Runtime Interventions2019 Spring Simulation Conference (SpringSim)10.23919/SpringSim.2019.8732869(1-11)Online publication date: Apr-2019
  • (2019)From Effects to CausesProceedings of the 2019 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation10.1145/3316480.3322891(173-184)Online publication date: 29-May-2019
  • (2019)A Survey on Agent-based Simulation Using Hardware AcceleratorsACM Computing Surveys10.1145/329104851:6(1-35)Online publication date: 28-Jan-2019
  • (2019)Energy Conservation Through Cloned Execution Of Simulations2019 Winter Simulation Conference (WSC)10.1109/WSC40007.2019.9004821(2572-2582)Online publication date: Dec-2019

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media