Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/548716.822670guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Reducing Remote Conflict Misses: NUMA with Remote Cache versus COMA

Published: 01 February 1997 Publication History

Abstract

Many future applications for scalable shared-memory multiprocessors are likely to have large working sets that overflow secondary or tertiary caches. Two possible solutions to this problem are to add a very large cache called remote cache that caches remote data (NUMA-RC), or organize the machine as a cache-only memory architecture (COMA). This paper tries to determine which solution is best. To compare the performance of the two organizations for the same amount of total memory, we introduce a model of data sharing. The model uses three data sharing patterns: replication, read-mostly migration, and read-write migration. Replication data is accessed in read-mostly mode by several processors, while migration data is accessed largely by one processor at a time. For large working sets, the weight of the migration data largely determines whether COMA outperforms NUMA-RC. Ideally, COMA only needs to fit the replication data in its extra memory; the migration data will simply be swapped between attraction memories. The remote cache of NUMA-RC, instead, needs to house both the replication and the migration data. However, simulations of seven Splash2 applications show that COMA does not outperform NUMA-RC. This is due to two reasons. First, the extra memory added has more associativity in NUMA-RC than in COMA and, therefore, can be utilized better by the working set in NUMA-RC. Second, COMA memory accesses are more expensive Of course, our results are affected by the applications used, which have been optimized for a cache-coherent NUMA machine. Overall, since NUMA-RC is cheaper, NUMA-RC is more cost-effective for these applications.

Cited By

View all

Index Terms

  1. Reducing Remote Conflict Misses: NUMA with Remote Cache versus COMA
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Guide Proceedings
        HPCA '97: Proceedings of the 3rd IEEE Symposium on High-Performance Computer Architecture
        February 1997
        ISBN:0818677643

        Publisher

        IEEE Computer Society

        United States

        Publication History

        Published: 01 February 1997

        Author Tags

        1. COMA
        2. NUMA with Remote Cache
        3. cache coherent
        4. remote conflict miss
        5. shared-memory multiprocessor

        Qualifiers

        • Article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 06 Oct 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2021)DvéProceedings of the 48th Annual International Symposium on Computer Architecture10.1109/ISCA52012.2021.00048(526-539)Online publication date: 14-Jun-2021
        • (2016)C3DThe 49th Annual IEEE/ACM International Symposium on Microarchitecture10.5555/3195638.3195681(1-12)Online publication date: 15-Oct-2016
        • (2009)Reactive NUCAACM SIGARCH Computer Architecture News10.1145/1555815.155577937:3(184-195)Online publication date: 20-Jun-2009
        • (2009)Reactive NUCAProceedings of the 36th annual international symposium on Computer architecture10.1145/1555754.1555779(184-195)Online publication date: 20-Jun-2009
        • (2007)Proximity-aware directory-based coherence for multi-core processor architecturesProceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures10.1145/1248377.1248398(126-134)Online publication date: 9-Jun-2007
        • (2006)ASRProceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture10.1109/MICRO.2006.10(443-454)Online publication date: 9-Dec-2006
        • (2005)Victim ReplicationACM SIGARCH Computer Architecture News10.1145/1080695.106999833:2(336-345)Online publication date: 1-May-2005
        • (2005)Victim ReplicationProceedings of the 32nd annual international symposium on Computer Architecture10.1109/ISCA.2005.53(336-345)Online publication date: 4-Jun-2005
        • (2001)Efficient schemes to scale the interconnection network bandwidth in a ring-based multiprocessor systemProceedings of the 2001 ACM symposium on Applied computing10.1145/372202.372447(510-516)Online publication date: 1-Mar-2001
        • (2001)Exploiting Network Locality for CC-NUMA MultiprocessorsThe Journal of Supercomputing10.1023/A:100811512540918:1(63-87)Online publication date: 1-Jan-2001
        • Show More Cited By

        View Options

        View options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media