Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/602770.602838acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article
Free access

Application-specific protocols for user-level shared memory

Published: 14 November 1994 Publication History

Abstract

Recent distributed shared memory (DSM) systems and proposed shared-memory machines have implemented some or all of their cache coherence protocols in software. One way to exploit the flexibility of this software is to tailor a coherence protocol to match an application's communication patterns and memory semantics. This paper presents evidence that this approach can lead to large performance improvements. It shows that application-specific protocols substantially improved the performance of three application programs--appbt, em3d, and barnes--over carefully tuned transparent shared memory implementations. The speed-ups were obtained on Blizzard, a fine-grained DSM system running on a 32-node Thinking Machines CM-5.

References

[1]
David Bailey, John Barton, Thomas Lasinski, and Horst Simon. The NAS Parallel Benchmarks. Technical Report RNR-91-002 Revision 2, Ames Research Center, August 1991.]]
[2]
John B. Carter, John K. Bennett, and Willy Zwaenepoel. Implementation and Performance of Munin. In Proceedings of the Thirteenth ACM Symposium on Operating System Principles (SOSP), pages 152-164, October 1991.]]
[3]
David Chaiken, John Kubiatowicz, and Anant Agarwal. LimitLESS Directories: A Scalable Cache Coherence Scheme. In Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS IV), pages 224-234, April 1991.]]
[4]
D. E. Culler, A. Dusseau, S. C. Goldstein, A. Krishnamurthy, S. Lumetta, T. von Eicken, and K. Yelick. Parallel Programming in Split-C. In Proceedings of Supercomputing 93, pages 262-273, November 1993.]]
[5]
Matthew I. Frank and Mary K. Vernon. A Hybrid Shared Memory/Message Passing Parallel Machine. In Proceedings of the 1993 International Conference on Parallel Processing (Vol. I Architecture), pages 232-236, August 1993.]]
[6]
Dana S. Henry and Christopher F. Joerg. A Tightly-Coupled Processor-Network Interface. In Proceedings of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS V), pages 111-122, October 1992.]]
[7]
Mark D. Hill, James R. Larus, Steven K. Reinhardt, and David A. Wood. Cooperative Shared Memory: Software and Hardware for Scalable Multiprocessors. ACM Transactions on Computer Systems, 11(4):300-318, November 1993. Earlier version appeared in ASPLOS V, Oct. 1992.]]
[8]
W. Daniel Hillis and Lewis W. Tucker. The CM-5 Connection Machine: A Scalable Supercomputer. Communications of the ACM, 36(11):31-40, November 1993.]]
[9]
Kendall Square Research. Kendall Square Research Technical Summary, 1992.]]
[10]
David Kranz, Kirk Johnson, Anant Agarwal, John Kubiatowicz, and Beng-Hong Lim. Integrating Message-Passing and Shared-Memory: Early Experience. In Fifth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), pages 54-63, May 1993.]]
[11]
Jeffrey Kuskin et al. The Stanford FLASH Multiprocessor. In Proceedings of the 21st Annual International Symposium on Computer Architecture, pages 302-313, April 1994.]]
[12]
James R. Larus. Compiling for Shared-Memory and Message-Passing Computers. ACM Letters on Programming Languages and Systems, 2(1-4):165-180, March-December 1994.]]
[13]
James R. Larus and Thomas Ball. Rewriting Executable Files to Measure Program Behavior. Software Practice & Experience, 24(2):197-218, February 1994.]]
[14]
James R. Larus, Brad Richards, and Guhan Viswanathan. LCM: Memory System Support for Parallel Language Implementation. In Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS VI), October 1994. To appear.]]
[15]
Daniel Lenoski, James Laudon, Kourosh Gharachorloo, Wolf-Dietrich Weber, Anoop Gupta, John Hennessy, Mark Horowitz, and Monica Lam. The Stanford DASH Multiprocessor. IEEE Computer, 25(3):63-79, March 1992.]]
[16]
Kai Li and Paul Hudak. Memory Coherence in Shared Virtual Memory Systems. ACM Transactions on Computer Systems, 7(4):321-359, November 1989.]]
[17]
John M. Mellor-Crummey and Michael L. Scott. Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors. ACM Transactions on Computer Systems, 9(1):21-65, February 1991.]]
[18]
R. S. Nikhil, G. M. Papadopoulos, and Arvind. *T: A Multithreaded Massively Parallel Architecture. In Proceedings of the 19th Annual International Symposium on Computer Architecture, pages 156-167, May 1992.]]
[19]
Steven K. Reinhardt, Babak Falsafi, and David A. Wood. Kernel Support for the Wisconsin Wind Tunnel. In Proceedings of the Usenix Symposium on Microkernels and Other Kernel Architectures, September 1993.]]
[20]
Steven K. Reinhardt, Mark D. Hill, James R. Larus, Alvin R. Lebeck, James C. Lewis, and David A. Wood. The Wisconsin Wind Tunnel: Virtual Prototyping of Parallel Computers. In Proceedings of the 1993 ACM Sigmetrics Conference on Measurement and Modeling of Computer Systems, pages 48-60, May 1993.]]
[21]
Steven K. Reinhardt, James R. Larus, and David A. Wood. Tempest and Typhoon: User-Level Shared Memory. In Proceedings of the 21st Annual International Symposium on Computer Architecture, pages 325-337, April 1994.]]
[22]
Ioannis Schoinas, Babak Falsafi, Alvin R. Lebeck, Steven K. Reinhardt, James R. Larus, and David A. Wood. Fine-grain Access Control for Distributed Shared Memory. In Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS VI), October 1994. To appear.]]
[23]
Jaswinder Pal Singh, Wolf-Dietrich Weber, and Anoop Gupta. SPLASH: Stanford Parallel Applications for Shared Memory. Computer Architecture News, 20(1):5-44, March 1992.]]
[24]
Daniel Stodolsky, J. Brad Chen, and Brian Bershad. Fast Interrupt Priority Management in Operating Systems. In Second USENIX Symposium on Microkernels and Other Kernel Archtitectures, pages 105-110, September 1993. San Diego, CA.]]
[25]
Thorsten von Eicken, David E. Culler, Seth Copen Goldstein, and Klaus Erik Schauser. Active Messages: a Mechanism for Integrating Communication and Computation. In Proceedings of the 19th Annual International Symposium on Computer Architecture, pages 256-266, May 1992.]]
[26]
David A. Wood, Satish Chandra, Babak Falsafi, Mark D. Hill, James R. Larus, Alvin R. Lebeck, James C. Lewis, Shubhendu S. Mukherjee, Subbarao Palacharla, and Steven K. Reinhardt. Mechanisms for Cooperative Shared Memory. In Proceedings of the 20th Annual International Symposium on Computer Architecture, pages 156-168, May 1993.]]
[27]
William A Wulf. Compilers and Computer Architecture. IEEE Computer, 14(7):41-47, July 1981.]]

Cited By

View all
  • (2015)Manycore network interfaces for in-memory rack-scale computingACM SIGARCH Computer Architecture News10.1145/2872887.275041543:3S(567-579)Online publication date: 13-Jun-2015
  • (2015)Manycore network interfaces for in-memory rack-scale computingProceedings of the 42nd Annual International Symposium on Computer Architecture10.1145/2749469.2750415(567-579)Online publication date: 13-Jun-2015
  • (2014)Scale-out NUMAACM SIGARCH Computer Architecture News10.1145/2654822.254196542:1(3-18)Online publication date: 24-Feb-2014
  • Show More Cited By
  1. Application-specific protocols for user-level shared memory

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    Supercomputing '94: Proceedings of the 1994 ACM/IEEE conference on Supercomputing
    November 1994
    840 pages
    ISBN:0818666056

    Sponsors

    Publisher

    IEEE Computer Society Press

    Washington, DC, United States

    Publication History

    Published: 14 November 1994

    Check for updates

    Qualifiers

    • Article

    Conference

    SC '94
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)68
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 09 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)Manycore network interfaces for in-memory rack-scale computingACM SIGARCH Computer Architecture News10.1145/2872887.275041543:3S(567-579)Online publication date: 13-Jun-2015
    • (2015)Manycore network interfaces for in-memory rack-scale computingProceedings of the 42nd Annual International Symposium on Computer Architecture10.1145/2749469.2750415(567-579)Online publication date: 13-Jun-2015
    • (2014)Scale-out NUMAACM SIGARCH Computer Architecture News10.1145/2654822.254196542:1(3-18)Online publication date: 24-Feb-2014
    • (2014)Scale-out NUMAACM SIGPLAN Notices10.1145/2644865.254196549:4(3-18)Online publication date: 24-Feb-2014
    • (2014)Scale-out NUMAProceedings of the 19th international conference on Architectural support for programming languages and operating systems10.1145/2541940.2541965(3-18)Online publication date: 24-Feb-2014
    • (2010)On-chip communication and synchronization mechanisms with cache-integrated network interfacesProceedings of the 7th ACM international conference on Computing frontiers10.1145/1787275.1787328(217-226)Online publication date: 17-May-2010
    • (2008)Extending CC-NUMA systems to support write update optimizationsProceedings of the 2008 ACM/IEEE conference on Supercomputing10.5555/1413370.1413401(1-12)Online publication date: 15-Nov-2008
    • (2006)Exploiting localityProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898953.1898969(33-33)Online publication date: 25-Apr-2006
    • (2004)Coherence decouplingACM SIGOPS Operating Systems Review10.1145/1037949.102440638:5(97-106)Online publication date: 7-Oct-2004
    • (2004)Coherence decouplingACM SIGARCH Computer Architecture News10.1145/1037947.102440632:5(97-106)Online publication date: 7-Oct-2004
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media