Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/775832.776052acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
Article

Improved indexing for cache miss reduction in embedded systems

Published: 02 June 2003 Publication History

Abstract

The increasing use of microprocessor cores in embedded systems as well as mobile and portable devices creates an opportunity for customizing the cache subsystem for improved performance. In traditional cache design, the index portion of the memory address bus consists of the K least significant bits, where K=log2(D) and D is the depth of the cache. However, in devices where the application set is known and characterized (e.g., systems that execute a fixed application set) there is an opportunity to improve cache performance by choosing an optimal set of bits used as index into the cache. This technique does not add any overhead in terms of area or delay. We give an efficient heuristic algorithm for selecting K index bits for improved cache performance. We show the feasibility of our algorithm by applying it to a large number of embedded system applications as well as the integer SPEC CPU 2000 benchmarks.

References

[1]
Technology Roadmap for Semiconductors. <http://www.itrs.net>.
[2]
C. Kozyrakis, D. Patterson. A New Direction for Computer Architecture Research, IEEE Computer, pp. 24--32, 1998.
[3]
F. Vahid, T. Givargis. The Case for a Configure-and-Execute Paradigm. Conference on HW/SW Codesign, 1999.
[4]
A. Malik, B. Moyer, D. Cermak. A Lower Power Unified Cache Architecture Providing Power and Performance Flexibility. Symposium on Low Power Electronics and Design, 2000.
[5]
P. Petrov, A. Orailoglu. Towards Effective Embedded Processors in Codesigns: Customizable Partitioned Caches. Workshop on HW/SW Codesign, 2001.
[6]
K. Suzuki, T. Arai, N. Kouhei, I. Kuroda. V830R/AV: Embedded Multimedia Superscalar RISC Processor. IEEE Micro, 1998.
[7]
P. Petrov, A. Orailoglu. Towards Effective Embedded Processors in Codesigns: Customizable Partitioned Caches. Workshop on Hardware/Software Codesign, 2001.
[8]
C. Su, A.M. Despain. Cache Design Trade-offs for Power and Performance Optimization: A Case Study. International Symposium on Low Power Electronics and Design, 1995.
[9]
R. Balasubramonian, D. Albonesi, A. Buyuktosunoglu, S. Dwarkadas. Memory Hierarchy Reconfiguration for Energy and Performance in GP Processor Architectures. International Symposium on Microarchitecture, 2000.
[10]
D.A. Patterson, J.L. Hennessy. Computer Organization and Design: The Hardware/Software Interface, Second Edition. Morgan Kaufmann, 1997.
[11]
P.R. Panda, H. Nakamura, N.D. Dutt, A. Nicolau. Improving Cache Performance through Tiling and Data Alignment. Workshop on Parallel Algorithms for Irregularly Structured Problems, 1997.
[12]
A. Vandecappelle, M. Miranda, E. Brockmeyer, F. Catthoor, D. Verkest. Global Multimedia System Design Exploration using Accurate Memory Organization Feedback. Design Automation Conference, 1999.
[13]
G. Rivera, C. Tseng. Compiler Optimizations for Eliminating Cache Conflict Misses. Department of Computer Science, Technical Report, UM, 1997.
[14]
T.L. Johnson, M.C. Merten, W.W. Hwu. Run-Time Spatial Locality Detection and Optimization. International Symposium on Microarchitecture, 1997.
[15]
M. Waldvogel, et al. Scalable High Speed IP Routing Lookups. ACM Special Interest Group on Data Communication, 1997.
[16]
S. Nilsson, G. Karlsson. Fast address lookup for Internet routers. IEEE Broadband Communications, 1998.
[17]
E. Gurari. An Introduction to the Theory of Computation. Ohio State University, Computer Science Press, 1989.
[18]
SPEC CPU 2000. <http://www.spec.org>.
[19]
D. Burger, T.M. Austin, The SimpleScalar Tool Set, Version 2.0. University of Wisconsin-Madison Computer Sciences Department Technical Report #1342, June 1997.

Cited By

View all
  • (2024)FASTA: Revisiting Fully Associative Memories in Computer MicroarchitectureIEEE Access10.1109/ACCESS.2024.335596112(13923-13943)Online publication date: 2024
  • (2020)Evolution of application-specific cache mappingsInternational Journal of Hybrid Intelligent Systems10.3233/HIS-200281(1-13)Online publication date: 3-Jul-2020
  • (2020)CHASM: Security Evaluation of Cache Mapping SchemesEmbedded Computer Systems: Architectures, Modeling, and Simulation10.1007/978-3-030-60939-9_17(245-261)Online publication date: 7-Oct-2020
  • Show More Cited By

Index Terms

  1. Improved indexing for cache miss reduction in embedded systems

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DAC '03: Proceedings of the 40th annual Design Automation Conference
    June 2003
    1014 pages
    ISBN:1581136889
    DOI:10.1145/775832
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 June 2003

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cache optimization
    2. design space exploration
    3. index hashing

    Qualifiers

    • Article

    Conference

    DAC03
    Sponsor:

    Acceptance Rates

    DAC '03 Paper Acceptance Rate 152 of 628 submissions, 24%;
    Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

    Upcoming Conference

    DAC '25
    62nd ACM/IEEE Design Automation Conference
    June 22 - 26, 2025
    San Francisco , CA , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)FASTA: Revisiting Fully Associative Memories in Computer MicroarchitectureIEEE Access10.1109/ACCESS.2024.335596112(13923-13943)Online publication date: 2024
    • (2020)Evolution of application-specific cache mappingsInternational Journal of Hybrid Intelligent Systems10.3233/HIS-200281(1-13)Online publication date: 3-Jul-2020
    • (2020)CHASM: Security Evaluation of Cache Mapping SchemesEmbedded Computer Systems: Architectures, Modeling, and Simulation10.1007/978-3-030-60939-9_17(245-261)Online publication date: 7-Oct-2020
    • (2020)Optimization of Application-Specific L1 Cache Translation Functions of the LEON3 ProcessorProceedings of the 11th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2019)10.1007/978-3-030-49345-5_28(266-276)Online publication date: 1-Aug-2020
    • (2018)Reducing the second-level cache conflict misses using a set folding techniqueThe Journal of Supercomputing10.1007/s11227-017-2174-874:2(970-993)Online publication date: 1-Feb-2018
    • (2017)An access pattern based adaptive mapping function for GPGPU scratchpad memoryIEICE Electronics Express10.1587/elex.14.2017037314:12(20170373-20170373)Online publication date: 2017
    • (2017)Contention-Aware Selective Caching to Mitigate Intra-Warp Contention on GPUs2017 16th International Symposium on Parallel and Distributed Computing (ISPDC)10.1109/ISPDC.2017.17(1-8)Online publication date: Jul-2017
    • (2017)Evolvable caches: Optimization of reconfigurable cache mappings for a LEON3/Linux-based multi-core processor2017 International Conference on Field Programmable Technology (ICFPT)10.1109/FPT.2017.8280144(215-218)Online publication date: Dec-2017
    • (2017)Evaluation methodology for complex non-deterministic functions: A case study in metaheuristic optimization of caches2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS)10.1109/AHS.2017.8046380(206-213)Online publication date: Jul-2017
    • (2017)Memory ArchitecturesHandbook of Hardware/Software Codesign10.1007/978-94-017-7358-4_14-1(1-31)Online publication date: 8-Apr-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media