Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Scalability versus Execution Time in Scalable Systems

Published: 01 February 2002 Publication History

Abstract

Parallel programming is elusive. The relative performance of different parallel implementations varies with machine architecture, system and problem size. How to compare different implementations over a wide range of machine architectures and problem sizes has not been well addressed due to its difficulty. Scalability has been proposed in recent years to reveal scaling properties of parallel algorithms and machines. In this paper, the relation between scalability and execution time is carefully studied. The concepts of crossing point analysis and range comparison are introduced. Crossing point analysis finds slow/fast performance crossing points of parallel algorithms and machines. Range comparison compares performance over a wide range of ensemble and problem size via scalability and crossing point analysis. Three algorithms from scientific computing are implemented on an Intel Paragon and an IBM SP2 parallel computer. Experimental and theoretical results show how the combination of scalability, crossing point analysis, and range comparison provides a practical solution for scalable performance evaluation and prediction. While our testings are conducted on homogeneous parallel computers, the proposed methodology applies to heterogeneous and network computing as well.

References

[1]
1. P. Worley, The effect of time constraints on scaled speedup, SIAM J. Sci. Stat. Comput. 11 (Sept. 1990), 838-858.]]
[2]
2. A. Y. Grama, A. Gupta, and V. Kumar, Isoefficiency: Measuring the scalability of parallel algorithms and architectures, IEEE Parallel Distrib. Technol. 1 (Aug. 1993), 12-21.]]
[3]
3. X.-H. Sun and D. Rover, Scalability of parallel algorithm-machine combinations, IEEE Trans. Parallel Distrib. Systems5 (June 1994), 599-613.]]
[4]
4. X. Zhang, Y. Yan, and K. He, Latency metric: An experimental method for measuring d evaluating parallel program and architecture scalability, J. Parallel Distrib. Comput. 22 (Sept. 1994), 392-410.]]
[5]
5. S. Sahni and V. Thanvantri, Performance metrics: Keeping the focus on runtime, IEEE Parallel Distrib. Technol. (Spring 1996), 43-56.]]
[6]
6. K. Hwang and Z. Xu, "Scalable Parallel Computing," McGraw-Hill, New York, 1998.]]
[7]
7. X.-H. Sun and J. Zhu, Performance considerations of shared virtual memory machines, IEEE Trans. Parallel Distrib. Systems (Nov. 1995), 1185-1194.]]
[8]
8. X.-H. Sun and J. Zhu, Performance prediction: A case study using a scalable shared-virtual-memory machine, IEEE Parallel Distrib. Technol. (Winter 1996), 36-49.]]
[9]
9. X.-H. Sun, "Scalability versus Execution Time in Scalable Systems," Louisiana State Uiversity, Computer Science TR-97-003, 1997. {Revised May 1998.}]]
[10]
10. C. Ho and S. Johnsson, Optimizing tridiagonal solvers for alternating direction methods on boolean cube multiprocessors, SIAM J. of. of Sci. and Stat. Computing11 (1990), 563-592.]]
[11]
11. X.-H. Sun, Application and accuracy of the parallel diagonal dominant algorithm, Parallel Computing (Aug. 1995), 1241-1267.]]
[12]
12. X.-H. Sun, H. Zhang, and L. Ni, Efficient tridiagonal solvers on multicomputers, IEEE Transactions on Computers41 (1992), 286-296.]]
[13]
13. V. Kumar et al., "Introduction to Parallel Computing: Design and Analysis of Algorithms," Benjamin-Commings, Redwood City, CA, 1994.]]
[14]
14. V. Bala et al., Ccl: A portable and tunable collective communication library for scalable parallel computers, IEEE Trans. Parallel Distrib. Systems6 (Feb. 1995), 154-164.]]
[15]
15. C. Hirsch, "Numerical Computation of Internal and External Flows," Wiley, New York, 1988.]]
[16]
16. X.-H. Sun, M. Pantano, and T. Fahringer, Integrated range comparison for data-parallel compilation systems, IEEE Trans. Parallel Distrib. Systems10 (May, 1999), 448-458.]]

Cited By

View all
  • (2023)The Memory-Bounded Speedup Model and Its Impacts in ComputingJournal of Computer Science and Technology10.1007/s11390-022-2911-138:1(64-79)Online publication date: 31-Jan-2023
  • (2008)Scalability measurement of a proxy-based multimedia content repurposing systemInternational Journal of Advanced Media and Communication10.1504/IJAMC.2008.0201802:3(267-287)Online publication date: 1-Sep-2008
  • (2004)On performance analysis of heterogeneous parallel algorithmsParallel Computing10.1016/j.parco.2004.07.00730:11(1195-1216)Online publication date: 1-Nov-2004

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Journal of Parallel and Distributed Computing
Journal of Parallel and Distributed Computing  Volume 62, Issue 2
February 2002
154 pages

Publisher

Academic Press, Inc.

United States

Publication History

Published: 01 February 2002

Author Tags

  1. execution time
  2. numerical algorithms
  3. parallel processing
  4. performance evaluation and measurement
  5. scalability

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)The Memory-Bounded Speedup Model and Its Impacts in ComputingJournal of Computer Science and Technology10.1007/s11390-022-2911-138:1(64-79)Online publication date: 31-Jan-2023
  • (2008)Scalability measurement of a proxy-based multimedia content repurposing systemInternational Journal of Advanced Media and Communication10.1504/IJAMC.2008.0201802:3(267-287)Online publication date: 1-Sep-2008
  • (2004)On performance analysis of heterogeneous parallel algorithmsParallel Computing10.1016/j.parco.2004.07.00730:11(1195-1216)Online publication date: 1-Nov-2004

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media