article

Development of Parallel Methods for a $1024$-Processor Hypercube

Authors:

John L. Gustafson,

Gary R. Montry,

Robert E. BennerAuthors Info & Claims

SIAM Journal on Scientific and Statistical Computing, Volume 9, Issue 4

Pages 609 - 638

https://doi.org/10.1137/0909041

Published: 01 July 1988 Publication History

Abstract

We have developed highly efficient parallel solutions for three practical, full-scale scientific problems: wave mechanics, fluid dynamics, and structural analysis. Several algorithmic techniques are used to keep communication and serial overhead small as both problem size and number of processors are varied. A new parameter, operation efficiency, is introduced that quantifies the tradeoff between communication and redundant computation. A 1024-processor MIMD ensemble is measured to be 502 to 637 times as fast as a single processor when problem size for the ensemble is fixed, and 1009 to 1020 times as fast as a single processor when problem size per processor is fixed. The latter measure, denoted scaled speedup, is developed and contrasted with the traditional measure of parallel speedup. The scaled-problem paradigm better reveals the capabilities of large ensembles, and permits detection of subtle hardware-induced load imbalances (such as error correction and data-dependent MFLOPS rates) that may become increasingly important as parallel processors increase in node count. Sustained performance for the applications is 70 to 130 MFLOPS, validating the massively parallel ensemble approach as a practical alternative to more conventional processing methods. The techniques presented appear extensible to even higher levels of parallelism than the 1024-processor level explored here.

Cited By

View all

Kremer-Herman NTovar BThain D(2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291708(1-13)Online publication date: 11-Nov-2018
https://dl.acm.org/doi/10.5555/3291656.3291708
Kremer-Herman NTovar BThain D(2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00042(1-13)Online publication date: 11-Nov-2018
https://dl.acm.org/doi/10.1109/SC.2018.00042
Bell GBailey DDongarra JKarp AWalsh K(2017)A look back on 30 years of the Gordon Bell PrizeInternational Journal of High Performance Computing Applications10.1177/109434201773861031:6(469-484)Online publication date: 1-Nov-2017
https://dl.acm.org/doi/10.1177/1094342017738610
Show More Cited By

Development of Parallel Methods for a $1024$-Processor Hypercube
1. Computing methodologies
  1. Parallel computing methodologies
2. Theory of computation
  1. Design and analysis of algorithms
  2. Models of computation
    1. Concurrency

Recommendations

The Effects of Problem Partitioning, Allocation, and Granularity on the Performance of Multiple-Processor Systems

In this paper we analyze the effects of the problem decomposition, the allocation of subproblems to processors, and the grain size of subproblems on the performance of a multiple- processor shared-memory architecture. Our results indicate that for ...
PASM: A Partitionable SIMD/MIMD System for Image Processing and Pattern Recognition

PASM, a large-scale multimicroprocessor system being designed at Purdue University for image processing and pattern recognition, is described. This system can be dynamically reconfigured to operate as one or more independent SIMD and/or MIMD machines. ...
Parallelizing Image Feature Extraction on Coarse-Grain Machines

In this paper, we present a fast parallel algorithm for feature extraction on coarse-grain MIMD machines. By maintaining algorithmic threads at each node, our algorithm enhances processor utilization and obtains large speed-ups. Our implementations show ...

Comments

Information & Contributors

Information

Published In

cover image SIAM Journal on Scientific and Statistical Computing

SIAM Journal on Scientific and Statistical Computing Volume 9, Issue 4

1988

163 pages

ISSN:0196-5204

Issue’s Table of Contents

Publisher

Society for Industrial and Applied Mathematics

United States

Publication History

Published: 01 July 1988

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

93
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 28 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Kremer-Herman NTovar BThain D(2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291708(1-13)Online publication date: 11-Nov-2018
https://dl.acm.org/doi/10.5555/3291656.3291708
Kremer-Herman NTovar BThain D(2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00042(1-13)Online publication date: 11-Nov-2018
https://dl.acm.org/doi/10.1109/SC.2018.00042
Bell GBailey DDongarra JKarp AWalsh K(2017)A look back on 30 years of the Gordon Bell PrizeInternational Journal of High Performance Computing Applications10.1177/109434201773861031:6(469-484)Online publication date: 1-Nov-2017
https://dl.acm.org/doi/10.1177/1094342017738610
Yang TTseng HShin SShin DLencastre M(2017)Numerical similarity-aware data partitioning for recommendations as a serviceProceedings of the Symposium on Applied Computing10.1145/3019612.3019676(887-892)Online publication date: 3-Apr-2017
https://dl.acm.org/doi/10.1145/3019612.3019676
Feng DTsolakis CChernikov AChrisochoides N(2017)Scalable 3D hybrid parallel Delaunay image-to-mesh conversion algorithm for distributed shared memory architecturesComputer-Aided Design10.1016/j.cad.2016.07.01085:C(10-19)Online publication date: 1-Apr-2017
https://dl.acm.org/doi/10.1016/j.cad.2016.07.010
Feng DChernikov AChrisochoides N(2016)Two-level locality-aware parallel Delaunay image-to-mesh conversionParallel Computing10.1016/j.parco.2016.01.00759:C(60-70)Online publication date: 1-Nov-2016
https://dl.acm.org/doi/10.1016/j.parco.2016.01.007
Numrich R(2014)Computer performance analysis and the Pi TheoremComputer Science - Research and Development10.1007/s00450-010-0147-829:1(45-71)Online publication date: 1-Feb-2014
https://dl.acm.org/doi/10.1007/s00450-010-0147-8
Prakash AChaudhury ARamachandran R(2013)Parallel simulation of population balance model-based particulate processes using multicore CPUs and GPUsModelling and Simulation in Engineering10.1155/2013/4754782013(2-2)Online publication date: 1-Jan-2013
https://dl.acm.org/doi/10.1155/2013/475478
Lobachev OLoogen RLoulergue F(2010)Estimating parallel performance, a skeleton-based approachProceedings of the fourth international workshop on High-level parallel programming and applications10.1145/1863482.1863489(25-34)Online publication date: 25-Sep-2010
https://dl.acm.org/doi/10.1145/1863482.1863489
Gupta AKoric SGeorge TPinfold W(2009)Sparse matrix factorization on massively parallel computersProceedings of the Conference on High Performance Computing Networking, Storage and Analysis10.1145/1654059.1654061(1-12)Online publication date: 14-Nov-2009
https://dl.acm.org/doi/10.1145/1654059.1654061
Show More Cited By

Abstract

Cited By

Recommendations

The Effects of Problem Partitioning, Allocation, and Granularity on the Performance of Multiple-Processor Systems

PASM: A Partitionable SIMD/MIMD System for Image Processing and Pattern Recognition

Parallelizing Image Feature Extraction on Coarse-Grain Machines

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Share

Share this Publication link

Share on social media

Affiliations