Article

A localizing directory coherence protocol

Authors:

WMPI '04: Proceedings of the 3rd workshop on Memory performance issues: in conjunction with the 31st international symposium on computer architecture

Pages 23 - 29

https://doi.org/10.1145/1054943.1054947

Published: 20 June 2004 Publication History

Get Access

Abstract

User-controllable coherence revives the idea of cooperation between software and hardware in an attempt to bridge the gap between efficient small-scale shared memory machines and massive distributed memory machines. It proposes a new multiprocessor architecture which has both a global address-space and multiple processor-local address-spaces with new memory instructions and a new coherence protocol to manage the dual address-spaces.The purpose of this paper is twofold. First, we solidify the semantics of instruction set extensions that enable "localization" -- the act of moving data from the global address-space to a processor's local address-space -- thus clearly defining the requirements for a localizing coherence protocol. Second, we demonstrate the feasibility of localizing coherence by describing the workings of a full-scale directory-based protocol that we have implemented and tested using an existing protocol specification tool.

References

[1]

M. Hill, J. Larus, S. Reinhardt, and D. Wood. Cooperative Shared Memory: Software and Hardware for Scalable Multiprocessors. ACM Transactions on Computer Systems, 1993.

Digital Library

Google Scholar

[2]

C. McCurdy and C. Fischer. User-Controllable Coherence for High Performance Shared Memory Multiprocessors. In Proceedings of the Principles and Practice of Parallel Programming (PPoPP), 2003.

Digital Library

Google Scholar

[3]

M. Garzaran, M. Prvulovic, Y. Zhang, A. Jula, H. Yu, L. Rauchwerger, and J. Torrellas. Architectural Support for Parallel Reductions in Scalable Shared-Memory Multiprocessors. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT), 2001.

Digital Library

Google Scholar

[4]

M. Plakal, D. Sorin, A. Condon, and M. Hill. Lamport Clocks: Verifying a Directory Cache-Coherence Protocol. In Proceedings of the ACM Symposium on Parallel Algorithms and Architectures (SPAA), 1998.

Digital Library

Google Scholar

[5]

D. Culler and J. Singh. Parallel Computer Architecture: A Hardware/Software Approach. Morgan Kaufman, 1999.

Digital Library

Google Scholar

[6]

D. Sorin, M. Plakal, A. Condon, M. Hill, M. Martin, and D. Wood. Specifying and Verifying a Broadcast and a Multicast Snooping Cache Coherence Protocol. IEEE Transactions on Parallel and Distributed Systems, 2002.

Digital Library

Google Scholar

Cited By

View all

Meng JSkadron K(2009)Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling2009 IEEE International Conference on Computer Design10.1109/ICCD.2009.5413143(282-288)Online publication date: Oct-2009
https://doi.org/10.1109/ICCD.2009.5413143

Index Terms

A localizing directory coherence protocol

Recommendations

User-controllable coherence for high performance shared memory multiprocessors
PPoPP '03: Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming

In programming high performance applications, shared address-space platforms are preferable for fine-grained computation, while distributed address-space platforms are more suitable for coarse-grained computation. However, currently only distributed ...
User-controllable coherence for high performance shared memory multiprocessors
Proceedings of the ACM SIGPLAN symposium on principles and practice of parallel programming (PPoPP 2003) and workshop on partial evaluation and semantics-based program manipulation (PEPM 2003)

In programming high performance applications, shared address-space platforms are preferable for fine-grained computation, while distributed address-space platforms are more suitable for coarse-grained computation. However, currently only distributed ...
A tagless coherence directory
MICRO 42: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture

A key challenge in architecting a CMP with many cores is maintaining cache coherence in an efficient manner. Directory-based protocols avoid the bandwidth overhead of snoop-based protocols, and therefore scale to a large number of cores. Unfortunately, ...

Comments

Information & Contributors

Information

Published In

WMPI '04: Proceedings of the 3rd workshop on Memory performance issues: in conjunction with the 31st international symposium on computer architecture

June 2004

146 pages

ISBN:159593040X

DOI:10.1145/1054943

Conference Chairs:
John Carter
University of Utah
,
Lixin Zhang
IBM Austin Research Lab

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2004

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
214
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 26 Jul 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Meng JSkadron K(2009)Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling2009 IEEE International Conference on Computer Design10.1109/ICCD.2009.5413143(282-288)Online publication date: Oct-2009
https://doi.org/10.1109/ICCD.2009.5413143

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

User-controllable coherence for high performance shared memory multiprocessors

User-controllable coherence for high performance shared memory multiprocessors

A tagless coherence directory