Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Benchmarking parallel compilers: a UPC case study

Published: 01 August 2006 Publication History

Abstract

Unified Parallel C (UPC) is an explicit parallel extension to ISO C which follows the Partitioned Global Address Space (PGAS) programming model. UPC, therefore, combines the ability to express parallelism while exploiting locality. To do so, compilers must embody effective UPC-specific optimizations. In this paper we present a strategy for evaluating the performance of PGAS compilers. It is based on emulating possible optimizations and comparing the performance to the raw compiler performance. It will be shown that this technique uncovers missed optimization opportunities. The results also demonstrate that, with such automatic optimizations, the UPC performance will be compared favorably with other paradigms.

References

[1]
{1} D.E. Culler, A. Dusseau, S.C. Goldstein, A. Krishnamurthy, S. Lumetta, T. Von Eicken, Y. Yelick, Introduction to Split-C, University of California, Berkeley, 1993.
[2]
{2} W.W. Carlson, J.M. Draper, Distributed data access in AC, in: Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP, Santa Barbara, CA, July 19-21, 1995, pp. 39-47.
[3]
{3} E. Brooks, K. Warren, Development and Evaluation of an Efficient Parallel Programming Methodology, Spanning Uniprocessor, Symmetric Shared-memory Multi-processor, and Distributed-memory Massively Parallel Architectures, Poster SuperComputing'95, San Diego, CA, 1995.
[4]
{4} T.A. El-Ghazawi, W.W. Carlson, J.M. Draper, UPC Language Specifications V1.0, http://upc.gwu.edu, February 2001.
[5]
{5} T.A. El-Ghazawi, W.W. Carlson, J.M. Draper, UPC Language Specifications V1.0, http://upc.gwu.edu, September 2004.
[6]
{6} UPC Consortium, UPC Language Specifications V1.0, http://upc.gwu.edu, June 2005.
[7]
{7} T.A. El-Ghazawi, W. Carlson, T. Stealing, K. Yelick, UPC: Distributed Shared Memory Programming, Published by John Wiley and Sons, ISBN: 0-471-22048-5, May, 2005.
[8]
{8} T.A. El-Ghazawi, S. Chauvin, UPC benchmarking issues, in: Proceedings of the International Conference on Parallel Processing, ICPP'01, IEEE CS Press, Valencia, Spain, September 2001.
[9]
{9} Intrepid, The GCC UPC Compiler for SGI Origin Family, http://www.intrepid.com/upc/.
[10]
{10} Silicon Graphics Inc., Speedshop User Guide.
[11]
{11} D. Bailey, E. Barszcz, J. Barton, The NAS Parallel Benchmark RNR Technical Report RNR-94-007, March 1994.
[12]
{12} M. Snir, S. Otto, S. Huss, D. Walker, J. Dongarra, MPI: The Complete Reference, MIT Press, Boston, 1996.
[13]
{13} K. Hwang, C.L. Wang, Z. Xu, Resource scaling effects on MPP performance: The STAP benchmark implications, IEEE Transactions on Parallel and Distributed Systems 10 (5) (1999).
[14]
{14} J.D. McCalpin, Sustainable Memory Bandwidth in Current High Performance Computers, Silicon Graphics Inc.
[15]
{15} B.R. Gaeke, K. Yelick, GUPS (Giga-Updates per Second) Benchmark, Berkeley.
[16]
{16} T.A. El-Ghazawi, F. Cantonnet, UPC Performance and Potential: A NPB Experimental Study, SuperComputing 2002 (SC2002), IEEE, Baltimore MD, USA, 2002.

Cited By

View all
  • (2017)Performance Evaluation of a Two-Dimensional Lattice Boltzmann Solver Using CUDA and PGAS UPC Based ParallelisationACM Transactions on Mathematical Software10.1145/308559044:1(1-22)Online publication date: 14-Jul-2017
  • (2015)Enabling PGAS Productivity with Hardware Support for Shared Address MappingACM Transactions on Architecture and Code Optimization10.1145/284268612:4(1-26)Online publication date: 22-Dec-2015
  • (2009)UPC performance evaluation on a multicore systemProceedings of the Third Conference on Partitioned Global Address Space Programing Models10.1145/1809961.1809974(1-7)Online publication date: 5-Oct-2009

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Future Generation Computer Systems
Future Generation Computer Systems  Volume 22, Issue 7
Systems performance analysis and evaluation
August 2006
127 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 01 August 2006

Author Tags

  1. NAS parallel benchmark
  2. benchmark
  3. compilers
  4. distributed shared memory
  5. non-uniform memory accesses architecture
  6. unified parallel C

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 06 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Performance Evaluation of a Two-Dimensional Lattice Boltzmann Solver Using CUDA and PGAS UPC Based ParallelisationACM Transactions on Mathematical Software10.1145/308559044:1(1-22)Online publication date: 14-Jul-2017
  • (2015)Enabling PGAS Productivity with Hardware Support for Shared Address MappingACM Transactions on Architecture and Code Optimization10.1145/284268612:4(1-26)Online publication date: 22-Dec-2015
  • (2009)UPC performance evaluation on a multicore systemProceedings of the Third Conference on Partitioned Global Address Space Programing Models10.1145/1809961.1809974(1-7)Online publication date: 5-Oct-2009

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media