Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/IPDPS.2005.123guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Benchmark Measurements of Current UPC Platforms

Published: 04 April 2005 Publication History
  • Get Citation Alerts
  • Abstract

    UPC is a parallel programming language based on the concept of partitioned shared memory. There are now several UPC compilers available and several different parallel architectures that support one or more of these compilers. This paper is the first to compare the performance of most of the currently available UPC implementations on several commonly used parallel platforms. These compilers are the GASNet UPC compiler from UC Berkeley, the v1.1 MuPC compiler from Michigan Tech, the Hewlet-Packard v2.2 compiler, and the Intrepid UPC compiler. The parallel architectures used in this study are a 16-node x86 Myrinet cluster, a 32-processor AlphaServer SC-40, and a 48-processor Cray T3E. A STREAM-like microbenchmark was developed to measure fine- and course-grained shared memory accesses. Also measured are five NPB kernels using existing UPC implementations. These measurements and associated observations provide a snapshot of the relative performance of current UPC platforms.

    References

    [1]
    D. Bailey, E. Barszcz, and J. Barton. The NAS Parallel Benchmark RNR. Technical Report RNR-94-007, NASA Ames Research Center, Mar. 1994.
    [2]
    K. Berlin, J. Huan, M. Jacob, G. Kochhar, J. Prins, W. Pugh, P. Sadayappan, J. Spacco, and C.-W. Tseng. Evaluating the Impact of Programming Language Features on the Performance of Parallel Applications on Cluster Architectures. In Languages and Compilers for Parallel Computing (LCPC), 2003.
    [3]
    F. Cantonnet and T. El-Ghazawi. UPC Performance and Potential: A NPB Experimental Study. In Proceedings, Supercomputing 2002: Baltimore, Maryland, Nov. 2002.
    [4]
    F. Cantonnet, Y. Yao, S. Annareddy, A. Mohamed, and T. El-Ghazawi. Performance Monitoring and Evaluation of a UPC Implementation on a NUMA Architecture. In Proceedings of International Parallel and Distributed Processing Symposium, 2004.
    [5]
    W. Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks, and K.Warren. Introduction to UPC and Language Specification. Technical Report CCS-TR-99-157, IDA Center for Computing Sciences, May 1999.
    [6]
    W. Chen, D. Bonachea, J. Duell, P. Husbands, C. Iancu, and K. Yelick. A Performance Analysis of the Berkeley UPC Compiler. In Proceedings of 17th Annual International Conference on Supercomputing (ICS), 2003.
    [7]
    T. El-Ghazawi, W. Carlson, and J. Draper. UPC Language Specifications, Oct. 2003. http://www.gwu.edu/~upc/docs/upc_spec_1.1.1.pdf.
    [8]
    T. El-Ghazawi and S. Chauvin. UPC Benchmarking Issues. In Proceedings of ICPP (2001), 2001.
    [9]
    Hewlett-Packard. Compaq UPC for Tru64 UNIX, 2004. http://www.hp.com/go/upc.
    [10]
    Intrepid Technology. Intrepid UPC Home Page, 2004. http://www.intrepid.com/upc.
    [11]
    ISO/IEC. Programming Languages - C, ISO/IEC 9989, May 2000.
    [12]
    J. McCalpin. STREAM: Sustainable Memory Bandwidth in High Performance Computers. http://www.cs.virginia.edu/stream/.
    [13]
    Michigan Technological University. UPC Projects at MTU. http://www.upc.mtu.edu.
    [14]
    J. Savant. MuPC: A Run Time System for Unified Parallel C. Master's thesis, Department of Computer Science, Michigan Technological University, 2002.
    [15]
    UC Berkeley. Berkeley Unified Parallel C Home Page, 2004. http://upc.nersc.gov.
    [16]
    UC Berkeley. GASNet Home Page, 2004. http://www.cs.berkeley.edu/~bonachea/gasnet.

    Cited By

    View all
    • (2015)Enabling PGAS Productivity with Hardware Support for Shared Address MappingACM Transactions on Architecture and Code Optimization10.1145/284268612:4(1-26)Online publication date: 22-Dec-2015
    • (2011)Optimizing the Barnes-Hut algorithm in UPCProceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2063384.2063485(1-11)Online publication date: 12-Nov-2011
    • (2010)Development and performance analysis of a UPC Particle-in-Cell codeProceedings of the Fourth Conference on Partitioned Global Address Space Programming Model10.1145/2020373.2020383(1-9)Online publication date: 12-Oct-2010
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    IPDPS '05: Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 15 - Volume 16
    April 2005
    ISBN:0769523129

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 04 April 2005

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)Enabling PGAS Productivity with Hardware Support for Shared Address MappingACM Transactions on Architecture and Code Optimization10.1145/284268612:4(1-26)Online publication date: 22-Dec-2015
    • (2011)Optimizing the Barnes-Hut algorithm in UPCProceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2063384.2063485(1-11)Online publication date: 12-Nov-2011
    • (2010)Development and performance analysis of a UPC Particle-in-Cell codeProceedings of the Fourth Conference on Partitioned Global Address Space Programming Model10.1145/2020373.2020383(1-9)Online publication date: 12-Oct-2010
    • (2009)UPC performance evaluation on a multicore systemProceedings of the Third Conference on Partitioned Global Address Space Programing Models10.1145/1809961.1809974(1-7)Online publication date: 5-Oct-2009
    • (2009)A practical study of UPC using the NAS Parallel BenchmarksProceedings of the Third Conference on Partitioned Global Address Space Programing Models10.1145/1809961.1809973(1-7)Online publication date: 5-Oct-2009
    • (2007)UPC collective operations optimizationProceedings of the 2007 international conference on Computational science and its applications - Volume Part I10.5555/1802834.1802887(536-549)Online publication date: 26-Aug-2007
    • (2007)Automatic nonblocking communication for partitioned global address space programsProceedings of the 21st annual international conference on Supercomputing10.1145/1274971.1274995(158-167)Online publication date: 17-Jun-2007
    • (2006)A performance model for fine-grain accesses in UPCProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898953.1898999(65-65)Online publication date: 25-Apr-2006
    • (2006)A characterization of shared data access patterns in UPC programsProceedings of the 19th international conference on Languages and compilers for parallel computing10.5555/1757112.1757125(111-125)Online publication date: 2-Nov-2006

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media