Article

Adaptive insertion policies for high performance caching

Authors:

Moinuddin K. Qureshi,

Simon C. Steely,

Joel EmerAuthors Info & Claims

ISCA '07: Proceedings of the 34th annual international symposium on Computer architecture

Pages 381 - 391

https://doi.org/10.1145/1250662.1250709

Published: 09 June 2007 Publication History

Abstract

The commonly used LRU replacement policy is susceptible to thrashing for memory-intensive workloads that have a working set greater than the available cache size. For such applications, the majority of lines traverse from the MRU position to the LRU position without receiving any cache hits, resulting in inefficient use of cache space. Cache performance can be improved if some fraction of the working set is retained in the cache so that at least that fraction of the working set can contribute to cache hits.

We show that simple changes to the insertion policy can significantly reduce cache misses for memory-intensive workloads. We propose the LRU Insertion Policy (LIP) which places the incoming line in the LRU position instead of the MRU position. LIP protects the cache from thrashing and results in close to optimal hitrate for applications that have a cyclic reference pattern. We also propose the Bimodal Insertion Policy (BIP) as an enhancement of LIP that adapts to changes in the working set while maintaining the thrashing protection of LIP. We finally propose a Dynamic Insertion Policy (DIP) to choose between BIP and the traditional LRU policy depending on which policy incurs fewer misses. The proposed insertion policies do not require any change to the existing cache structure, are trivial to implement, and have a storage requirement of less than two bytes. We show that DIP reduces the average MPKI of the baseline 1MB 16-way L2 cache by 21%, bridging two-thirds of the gap between LRU and OPT.

References

[1]

L. A. Belady. A study of replacement algorithms for a virtual-storage computer. In IBM Systems journal, pages 78--101, 1966.

[2]

W. fen Lin et al. Reducing dram latencies with an integrated memory hierarchy design. In HPCA-7, pages 301--312, 2001.

Digital Library

[3]

A. Gonzalez, C. Aliagas, and M. Valero. A data cache with multiple caching strategies tuned to different types of locality. In ICS-9, 1995.

Digital Library

[4]

F. Guo and Y. Solihin. An analytical model for cache replacement policy performance. SIGMETRICS Perform. Eval. Rev., 34(1):228--239, 2006.

Digital Library

[5]

E. G. Hallnor and S. K. Reinhardt. A fully associative software-managed cache design. In ISCA-27, 2000.

Digital Library

[6]

T. L. Johnson. Run-time adaptive cache management. PhD thesis, University of Illinois, Urbana, IL, May 1998.

Digital Library

[7]

S. Kaxiras et al. Cache decay: exploiting generational behavior to reduce cache leakage power. In ISCA-28, 2001.

Digital Library

[8]

A. Lai, C. Fide, and B. Falsafi. Dead-block prediction & dead-block correlating prefetchers. In ISCA-28, 2001.

Digital Library

[9]

W. Lin and S. Reinhardt. Predicting last-touch references under optimal replacement. In Technical Report CSE-TR-447-02, University of Michigan, 2002.

[10]

S. McFarling. Cache replacement with dynamic exclusion. In ISCA-19, pages 191--200, 1992.

Digital Library

[11]

N. Megiddo and D. S. Modha. ARC: A self-tuning, low overhead replacement cache. In Proceeding of the 2nd USENIX Conference on File and Storage Technologies, 2003.

Digital Library

[12]

E. Perelman et al. Using simpoint for accurate and efficient simulation. SIGMETRICS Perform. Eval. Rev., 31(1):318--319, 2003.

Digital Library

[13]

M. K. Qureshi, D. N. Lynch, O. Mutlu, and Y. N. Patt. A case for MLP-aware cache replacement. In ISCA-33, 2006.

Digital Library

[14]

M. K. Qureshi, D. Thompson, and Y. N. Patt. The V-Way Cache: Demand Based Associativity via Global Replacement. In ISCA-32, pages 544--555, 2005.

Digital Library

[15]

J. T. Robinson and M. V. Devarakonda. Data cache management using frequency-based replacement. In SIGMETRICS '90, 1990.

Digital Library

[16]

S. Ross. A First Course in Probability. Pearson Prentice Hall, 2006.

[17]

Y. Smaragdakis et al. The EELRU adaptive replacement algorithm. Performance Evaluation, 53(2):93--123, 2003.

Digital Library

[18]

R. Subramanian et al. Adaptive caches: Effective shaping of cache behavior to workloads. In MICRO-39, 2006.

Digital Library

[19]

G. Tyson, M. Farrens, J. Matthews, and A. R. Pleszkun. A modified approach to data cache management. In MICRO-28, 1995.

Digital Library

[20]

Z. Wang et al. Using the compiler to improve cache replacement decisions. In PACT, page 199, 2002.

Digital Library

[21]

W. A. Wong and J.-L. Baer. Modified LRU policies for improving second-level cache behavior. In HPCA-6, 2000.

Cited By

Wang YMeng YWang JYang C(2024)LSTM-CRP: Algorithm-Hardware Co-Design and Implementation of Cache Replacement Policy Using Long Short-Term MemoryBig Data and Cognitive Computing10.3390/bdcc81001408:10(140)Online publication date: 21-Oct-2024
https://doi.org/10.3390/bdcc8100140
Zhao ZMorrison AFletcher CTorrellas JTsafrir DMusuvathi MGupta RAbu-Ghazaleh N(2024)Last-Level Cache Side-Channel Attacks Are Feasible in the Modern Public CloudProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3620665.3640403(582-600)Online publication date: 27-Apr-2024
https://dl.acm.org/doi/10.1145/3620665.3640403
Ding CReber BPatru DKneipp AFigorito M(2023)E=MC^2: Efficient Mobility Centric CachingProceedings of the International Symposium on Memory Systems10.1145/3631882.3631892(1-5)Online publication date: 2-Oct-2023
https://dl.acm.org/doi/10.1145/3631882.3631892
Show More Cited By

Index Terms

Adaptive insertion policies for high performance caching
1. Hardware
  1. Integrated circuits
    1. Semiconductor memory
      1. Dynamic memory

Recommendations

SHiP: signature-based hit predictor for high performance caching
MICRO-44: Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture

The shared last-level caches in CMPs play an important role in improving application performance and reducing off-chip memory bandwidth requirements. In order to use LLCs more efficiently, recent research has shown that changing the re-reference ...
High performance cache replacement using re-reference interval prediction (RRIP)
ISCA '10

Practical cache replacement policies attempt to emulate optimal replacement by predicting the re-reference interval of a cache block. The commonly used LRU replacement policy always predicts a near-immediate re-reference interval on cache hits and ...
Adaptive insertion policies for high performance caching

The commonly used LRU replacement policy is susceptible to thrashing for memory-intensive workloads that have a working set greater than the available cache size. For such applications, the majority of lines traverse from the MRU position to the LRU ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ISCA '07: Proceedings of the 34th annual international symposium on Computer architecture

June 2007

542 pages

ISBN:9781595937063

DOI:10.1145/1250662

General Chair:
Dean Tullsen
University of California, San Diego
,
Program Chair:
Brad Calder
Microsoft & University of California, San Diego

ACM SIGARCH Computer Architecture News Volume 35, Issue 2
May 2007
527 pages
ISSN:0163-5964
DOI:10.1145/1273440
Issue’s Table of Contents

Copyright © 2007 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGARCH: ACM Special Interest Group on Computer Architecture
IEEE-CS: Computer Society

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

SPAA07

Sponsor:

SIGARCH
IEEE-CS

SPAA07: 19th ACM Symposium on Parallelism in Algorithms and Architectures

June 9 - 13, 2007

California, San Diego, USA

Acceptance Rates

Overall Acceptance Rate 543 of 3,203 submissions, 17%

Upcoming Conference

ISCA '25

Sponsor:
sigarch

The 52nd Annual International Symposium on Computer Architecture

June 21 - 25, 2025

Tokyo , Japan

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

564
Total Citations
View Citations
5,505
Total Downloads

Downloads (Last 12 months)605
Downloads (Last 6 weeks)78

Reflects downloads up to 23 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Wang YMeng YWang JYang C(2024)LSTM-CRP: Algorithm-Hardware Co-Design and Implementation of Cache Replacement Policy Using Long Short-Term MemoryBig Data and Cognitive Computing10.3390/bdcc81001408:10(140)Online publication date: 21-Oct-2024
https://doi.org/10.3390/bdcc8100140
Zhao ZMorrison AFletcher CTorrellas JTsafrir DMusuvathi MGupta RAbu-Ghazaleh N(2024)Last-Level Cache Side-Channel Attacks Are Feasible in the Modern Public CloudProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3620665.3640403(582-600)Online publication date: 27-Apr-2024
https://dl.acm.org/doi/10.1145/3620665.3640403
Ding CReber BPatru DKneipp AFigorito M(2023)E=MC^2: Efficient Mobility Centric CachingProceedings of the International Symposium on Memory Systems10.1145/3631882.3631892(1-5)Online publication date: 2-Oct-2023
https://dl.acm.org/doi/10.1145/3631882.3631892
Zhao YLiu WDong FYang TLi YYang KLiu ZJia ZYang YSchulzrinne HKohler EMaltz DMisra V(2023)P4LRU: Towards An LRU Cache Entirely in Programmable Data PlaneProceedings of the ACM SIGCOMM 2023 Conference10.1145/3603269.3604813(967-980)Online publication date: 10-Sep-2023
https://dl.acm.org/doi/10.1145/3603269.3604813
Wu YZhu JWei WChen LWang LWei SLiu LSolihin YHeinrich M(2023)Shogun: A Task Scheduling Framework for Graph Mining AcceleratorsProceedings of the 50th Annual International Symposium on Computer Architecture10.1145/3579371.3589086(1-15)Online publication date: 17-Jun-2023
https://dl.acm.org/doi/10.1145/3579371.3589086
Singh SSurana NPrasad KJain PMekie JAwasthi M(2023)HyGain: High-performance, Energy-efficient Hybrid Gain Cell-based Cache HierarchyACM Transactions on Architecture and Code Optimization10.1145/357283920:2(1-20)Online publication date: 1-Mar-2023
https://dl.acm.org/doi/10.1145/3572839
Zhou FWu SYue JJin HShen J(2023)Object Fingerprint Cache for Heterogeneous Memory SystemIEEE Transactions on Computers10.1109/TC.2023.325185272:9(2496-2507)Online publication date: 1-Sep-2023
https://doi.org/10.1109/TC.2023.3251852
Joseph DAragón JParcerisa JGonzález A(2023)Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUsProceedings of the 32nd International Conference on Parallel Architectures and Compilation Techniques10.1109/PACT58117.2023.00019(124-136)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1109/PACT58117.2023.00019
Jimenez DTeran EGratz P(2023)Last-Level Cache Insertion and Promotion Policy in the Presence of Aggressive PrefetchingIEEE Computer Architecture Letters10.1109/LCA.2023.324217822:1(17-20)Online publication date: Jan-2023
https://doi.org/10.1109/LCA.2023.3242178
García SGracioli GHoornaert DKloda TCaccamo M(2023)Improving the Execution Time of Industrial Applications through Planned Cache Eviction Policy Selection2023 IEEE 32nd International Symposium on Industrial Electronics (ISIE)10.1109/ISIE51358.2023.10228033(1-6)Online publication date: 19-Jun-2023
https://doi.org/10.1109/ISIE51358.2023.10228033
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents