Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Development of Parallel Methods for a $1024$-Processor Hypercube

Published: 01 July 1988 Publication History

Abstract

We have developed highly efficient parallel solutions for three practical, full-scale scientific problems: wave mechanics, fluid dynamics, and structural analysis. Several algorithmic techniques are used to keep communication and serial overhead small as both problem size and number of processors are varied. A new parameter, operation efficiency, is introduced that quantifies the tradeoff between communication and redundant computation. A 1024-processor MIMD ensemble is measured to be 502 to 637 times as fast as a single processor when problem size for the ensemble is fixed, and 1009 to 1020 times as fast as a single processor when problem size per processor is fixed. The latter measure, denoted scaled speedup, is developed and contrasted with the traditional measure of parallel speedup. The scaled-problem paradigm better reveals the capabilities of large ensembles, and permits detection of subtle hardware-induced load imbalances (such as error correction and data-dependent MFLOPS rates) that may become increasingly important as parallel processors increase in node count. Sustained performance for the applications is 70 to 130 MFLOPS, validating the massively parallel ensemble approach as a practical alternative to more conventional processing methods. The techniques presented appear extensible to even higher levels of parallelism than the 1024-processor level explored here.

Cited By

View all
  • (2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291708(1-13)Online publication date: 11-Nov-2018
  • (2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00042(1-13)Online publication date: 11-Nov-2018
  • (2017)A look back on 30 years of the Gordon Bell PrizeInternational Journal of High Performance Computing Applications10.1177/109434201773861031:6(469-484)Online publication date: 1-Nov-2017
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image SIAM Journal on Scientific and Statistical Computing
SIAM Journal on Scientific and Statistical Computing  Volume 9, Issue 4
1988
163 pages

Publisher

Society for Industrial and Applied Mathematics

United States

Publication History

Published: 01 July 1988

Author Tags

  1. 65W05
  2. 68M20
  3. 68Q05
  4. 68Q10
  5. MIMD machines
  6. fluid dynamics
  7. hypercubes
  8. multiprocessor performance
  9. parallel computing
  10. structural analysis
  11. supercomputing
  12. wave mechanics

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291708(1-13)Online publication date: 11-Nov-2018
  • (2018)A lightweight model for right-sizing master-worker applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00042(1-13)Online publication date: 11-Nov-2018
  • (2017)A look back on 30 years of the Gordon Bell PrizeInternational Journal of High Performance Computing Applications10.1177/109434201773861031:6(469-484)Online publication date: 1-Nov-2017
  • (2017)Numerical similarity-aware data partitioning for recommendations as a serviceProceedings of the Symposium on Applied Computing10.1145/3019612.3019676(887-892)Online publication date: 3-Apr-2017
  • (2017)Scalable 3D hybrid parallel Delaunay image-to-mesh conversion algorithm for distributed shared memory architecturesComputer-Aided Design10.1016/j.cad.2016.07.01085:C(10-19)Online publication date: 1-Apr-2017
  • (2016)Two-level locality-aware parallel Delaunay image-to-mesh conversionParallel Computing10.1016/j.parco.2016.01.00759:C(60-70)Online publication date: 1-Nov-2016
  • (2014)Computer performance analysis and the Pi TheoremComputer Science - Research and Development10.1007/s00450-010-0147-829:1(45-71)Online publication date: 1-Feb-2014
  • (2013)Parallel simulation of population balance model-based particulate processes using multicore CPUs and GPUsModelling and Simulation in Engineering10.1155/2013/4754782013(2-2)Online publication date: 1-Jan-2013
  • (2010)Estimating parallel performance, a skeleton-based approachProceedings of the fourth international workshop on High-level parallel programming and applications10.1145/1863482.1863489(25-34)Online publication date: 25-Sep-2010
  • (2009)Sparse matrix factorization on massively parallel computersProceedings of the Conference on High Performance Computing Networking, Storage and Analysis10.1145/1654059.1654061(1-12)Online publication date: 14-Nov-2009
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media