Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2616606.2616707acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdateConference Proceedingsconference-collections
research-article

Reducing set-associative L1 data cache energy by early load data dependence detection (ELD3)

Published: 24 March 2014 Publication History

Abstract

Fast set-associative level-one data caches (L1 DCs) access all ways in parallel during load operations for reduced access latency. This is required in order to resolve data dependencies as early as possible in the pipeline, which otherwise would suffer from stall cycles. A significant amount of energy is wasted due to this fast access, since the data can only reside in one of the ways. While it is possible to reduce L1 DC energy usage by accessing the tag and data memories sequentially, hence activating only one data way on a tag match, this approach significantly increases execution time due to an increased number of stall cycles. We propose an early load data dependency detection (ELD3) technique for in-order pipelines. This technique makes it possible to detect if a load instruction has a data dependency with a subsequent instruction. If there is no such dependency, then the tag and data accesses for the load are sequentially performed so that only the data way in which the data resides is accessed. If there is a dependency, then the tag and data arrays are accessed in parallel to avoid introducing additional stall cycles. For the MiBench benchmark suite, the ELD3 technique enables about 49% of all load operations to access the L1 DC sequentially. Based on 65-nm data using commercial SRAM blocks, the proposed technique reduces L1 DC energy by 13%.

References

[1]
D. Williamson, ARM Cortex A8: A High Performance Processor for Low Power Applications, ARM.
[2]
M. D. Powell, A. Agarwal, T. N. Vijaykumar, B. Falsafi, and K. Roy, "Reducing set-associative cache energy via way-prediction and selective direct-mapping," in Proc. ACM/IEEE MICRO, Dec. 2001, pp. 54--65.
[3]
K. Inoue, T. Ishihara, and K. Murakami, "Way-predicting set-associative cache for high performance and low energy consumption," in Proc. IEEE ISLPED, Aug. 1999, pp. 273--275.
[4]
D. Nicolaescu, B. Salamat, A. Veidenbaum, and M. Valero, "Fast speculative address generation and way caching for reducing L1 data cache energy," in Proc. IEEE ICCD, Oct. 2006, pp. 101--107.
[5]
A. Bardizbanyan, M. Själander, D. Whalley, and P. Larsson-Edefors, "Speculative tag access for reduced energy dissipation in set-associative L1 data caches," in Proc. IEEE ICCD, Oct. 2013, pp. 302--308.
[6]
T. R. Halfhill, "ARM's midsize multiprocessor," Microprocessor, Oct. 2009.
[7]
MIPS® 1004K#8482; Coherent Processing System Datasheet, MIPS Technologies, Jul. 2009.
[8]
M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge, and R. B. Brown, "MBench: A free, commercially representative embedded benchmark suite," in Proc. Int. Workshop on Workload Characterization, Dec. 2001, pp. 3--14.
[9]
V. Saljooghi, A. Bardizbanyan, M. Själander, and P. Larsson-Edefors, "Configurable RTL model for level-1 caches," in Proc. IEEE NORCHIP, Nov. 2012.
[10]
T. Austin, E. Larson, and D. Ernst, "SimpleScalar: An infrastructure for computer system modeling," Computer, vol. 35, no. 2, pp. 59--67, Feb. 2002.

Cited By

View all
  • (2018)Decoupling address generation from loads and stores to improve data access energy efficiencyACM SIGPLAN Notices10.1145/3299710.321134053:6(65-75)Online publication date: 19-Jun-2018
  • (2018)Decoupling address generation from loads and stores to improve data access energy efficiencyProceedings of the 19th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems10.1145/3211332.3211340(65-75)Online publication date: 19-Jun-2018
  • (2016)Practical way halting by speculatively accessing halt tagsProceedings of the 2016 Conference on Design, Automation & Test in Europe10.5555/2971808.2972128(1375-1380)Online publication date: 14-Mar-2016

Index Terms

  1. Reducing set-associative L1 data cache energy by early load data dependence detection (ELD3)

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      DATE '14: Proceedings of the conference on Design, Automation & Test in Europe
      March 2014
      1959 pages
      ISBN:9783981537024

      Sponsors

      • EDAA: European Design Automation Association
      • ECSI
      • EDAC: Electronic Design Automation Consortium
      • IEEE Council on Electronic Design Automation (CEDA)
      • The Russian Academy of Sciences: The Russian Academy of Sciences

      In-Cooperation

      Publisher

      European Design and Automation Association

      Leuven, Belgium

      Publication History

      Published: 24 March 2014

      Check for updates

      Qualifiers

      • Research-article

      Conference

      DATE '14
      Sponsor:
      • EDAA
      • EDAC
      • The Russian Academy of Sciences
      DATE '14: Design, Automation and Test in Europe
      March 24 - 28, 2014
      Dresden, Germany

      Acceptance Rates

      Overall Acceptance Rate 518 of 1,794 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 26 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Decoupling address generation from loads and stores to improve data access energy efficiencyACM SIGPLAN Notices10.1145/3299710.321134053:6(65-75)Online publication date: 19-Jun-2018
      • (2018)Decoupling address generation from loads and stores to improve data access energy efficiencyProceedings of the 19th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems10.1145/3211332.3211340(65-75)Online publication date: 19-Jun-2018
      • (2016)Practical way halting by speculatively accessing halt tagsProceedings of the 2016 Conference on Design, Automation & Test in Europe10.5555/2971808.2972128(1375-1380)Online publication date: 14-Mar-2016

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media