Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/11752578_29guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Open MPI: a flexible high performance MPI

Published: 11 September 2005 Publication History

Abstract

A large number of MPI implementations are currently available, each of which emphasize different aspects of high-performance computing or are intended to solve a specific research problem. The result is a myriad of incompatible MPI implementations, all of which require separate installation, and the combination of which present significant logistical challenges for end users. Building upon prior research, and influenced by experience gained from the code bases of the LAM/MPI, LA-MPI, FT-MPI, and PACX-MPI projects, Open MPI is an all-new, production-quality MPI-2 implementation that is fundamentally centered around component concepts. Open MPI provides a unique combination of novel features previously unavailable in an open-source, production-quality implementation of MPI. Its component architecture provides both a stable platform for third-party research as well as enabling the run-time composition of independent software add-ons. This paper presents a high-level overview the goals, design, and implementation of Open MPI, as well as performance results for it's point-to-point implementation.

References

[1]
G. Bosilca, A. Bouteiller, F. Cappello, S. Djilali, G. Fedak, C. Germain, T. Herault, P. Lemarinier, O. Lodygensky, F. Magniette, V. Neri, and A. Selikhov. MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes. In SC'2002 Conference CD, Baltimore, MD, 2002. IEEE/ACM SIGARCH. pap298, LRI.
[2]
D. E. Bernholdt et. all. A component architecture for high-performance scientific computing. Intl. J. High-Performance Computing Applications, 2004.
[3]
G. E. Fagg, E. Gabriel, Z. Chen, T. Angskun, G. Bosilca, A. Bukovski, and J. J. Dongarra. Fault tolerant communication library and applications for high perofrmance. In Los Alamos Computer Science Institute Symposium, Santa Fee, NM, October 27-29 2003.
[4]
R. L. Graham, S.-E. Choi, D. J. Daniel, N. N. Desai, R. G. Minnich, C. E. Rasmussen, L. D. Risinger, and M. W. Sukalksi. A network-failure-tolerant message-passing system for terascale clusters. International Journal of Parallel Programming, 31(4):285-303, August 2003.
[5]
Rainer Keller, Edgar Gabriel, Bettina Krammer, Matthias S. Mueller, and Michael M. Resch. Towards efficient execution of parallel applications on the grid: porting and optimization issues. International Journal of Grid Computing, 1(2):133-149, 2003.
[6]
Jiuxing Liu, Jiesheng Wu, Sushmitha P. Kini, Pete Wyckoff, and Dhabaleswar K. Panda. High performance RDMA-based MPI implementation over infiniband. In ICS '03: Proceedings of the 17th annual international conference on Supercomputing, pages 295-304, New York, NY, USA, 2003. ACM Press.
[7]
Message Passing Interface Forum. MPI: A Message Passing Interface Standard, June 1995. http://www.mpi-forum.org/.
[8]
Message Passing Interface Forum. MPI-2: Extensions to the Message Passing Interface, July 1997. http://www.mpi-forum.org/.
[9]
MPICH2. http://www.mcs.anl.gov/mpi/mpich2/.
[10]
Sriram Sankaran, Jeffrey M. Squyres, Brian Barrett, Andrew Lumsdaine, Jason Duell, Paul Hargrove, and Eric Roman. The LAM/MPI checkpoint/restart framework: System-initiated checkpointing. International Journal of High Performance Computing Applications, To appear, 2004.
[11]
Galen M. Shipman. Infiniband scalability in Open MPI. Master's thesis, University of New Mexico, December 2005.
[12]
Q. O. Snell, A. R. Mikler, and J. L. Gustafson. NetPIPE: A Network Protocol Independent Performace Evaluator. In IASTED International Conference on Intelligent Information Management and Systems, June 1996.
[13]
J. M. Squyres and A. Lumsdaine. A Component Architecture for LAM/MPI. In Proceedings, 10th European PVM/MPI Users' Group Meeting, number 2840 in Lecture Notes in Computer Science, Venice, Italy, Sept. 2003. Springer.
[14]
Rajeev Thakur, William Gropp, and Ewing Lusk. Data sieving and collective I/O in ROMIO. In Proceedings of the 7th Symposium on the Frontiers of Massively Parallel Computation, pages 182-189. IEEE Computer Society Press, Feb 1999.

Cited By

View all
  • (2024)An ad-hoc file system accelerated workflow application for accidental fire fast responseProceedings of the 2nd Workshop on Workflows in Distributed Environments10.1145/3642978.3652836(21-27)Online publication date: 22-Apr-2024
  • (2023)Implementation-Oblivious Transparent Checkpoint-Restart for MPIProceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624255(1738-1747)Online publication date: 12-Nov-2023
  • (2023)A massively parallel implicit 3D unstructured grid solver for computing turbulent flows on latest distributed memory computational architecturesJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.104750182:COnline publication date: 1-Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
PPAM'05: Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
September 2005
1121 pages
ISBN:3540341412
  • Editors:
  • Roman Wyrzykowski,
  • Jack Dongarra,
  • Norbert Meyer,
  • Jerzy Waśniewski

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 11 September 2005

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)An ad-hoc file system accelerated workflow application for accidental fire fast responseProceedings of the 2nd Workshop on Workflows in Distributed Environments10.1145/3642978.3652836(21-27)Online publication date: 22-Apr-2024
  • (2023)Implementation-Oblivious Transparent Checkpoint-Restart for MPIProceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624255(1738-1747)Online publication date: 12-Nov-2023
  • (2023)A massively parallel implicit 3D unstructured grid solver for computing turbulent flows on latest distributed memory computational architecturesJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.104750182:COnline publication date: 1-Dec-2023
  • (2023)A Case Study on PMIx-Usage for Dynamic Resource ManagementHigh Performance Computing10.1007/978-3-031-40843-4_4(42-55)Online publication date: 21-May-2023
  • (2022)Faster and scalable algorithms for densest subgraph and decompositionProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3602225(26966-26979)Online publication date: 28-Nov-2022
  • (2022)A System-Wide Communication to Couple Multiple MPI Programs for Heterogeneous ComputingParallel and Distributed Computing, Applications and Technologies10.1007/978-3-031-29927-8_25(314-327)Online publication date: 7-Dec-2022
  • (2022)Towards OpenUCX and GPUDirect Technology Support for the Angara InterconnectSupercomputing10.1007/978-3-031-22941-1_43(591-603)Online publication date: 26-Sep-2022
  • (2019)Exploring Untrusted Distributed Storage for High Performance ComputingPractice and Experience in Advanced Research Computing 2019: Rise of the Machines (learning)10.1145/3332186.3332224(1-6)Online publication date: 28-Jul-2019
  • (2018)Network slicing to improve multicasting in HPC clustersCluster Computing10.5555/3287988.328800121:3(1493-1506)Online publication date: 1-Sep-2018
  • (2018)Improving MPI Multi-threaded RMA Communication PerformanceProceedings of the 47th International Conference on Parallel Processing10.1145/3225058.3225114(1-11)Online publication date: 13-Aug-2018
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media