Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1016720.1016747acmconferencesArticle/Chapter ViewAbstractPublication PagesesweekConference Proceedingsconference-collections
Article

Compiler-directed code restructuring for reducing data TLB energy

Published: 08 September 2004 Publication History

Abstract

Prior work on TLB power optimization considered circuit and architectural techniques. A recent software-based technique for data TLBs has considered the possibility of storing the frequently used virtual-to-physical address translations in a set of translation registers (TRs), and using them when necessary instead of going to the data TLB. This paper presents a compiler-based strategy for increasing the effectiveness of TRs. The idea is to restructure the application code in such a fashion that once a TR is loaded, its contents are reused as much as possible. Our experimental evaluation with six array-based benchmarks from the Spec2000 suite indicates that the proposed TR reuse strategy brings significant reductions in data TLB energy over an alternate strategy that employs TRs but does not restructure the code for TR reuse.

References

[1]
S. Amarasinghe, J. Anderson, C. Wilson, S. Liao, R. French, M. Hall, B. Murphy, and M. Lam. The multiprocessor as a general-purpose processor: a software perspective. IEEE Micro, 1996.
[2]
D. Burger, T. M. Austin, and S. Bennett. Evaluating future microprocessors: the Simplescalar toolset. Technical Report CS-TR-1996-1308, Department of Computer Science, UW, 1996.
[3]
T.-C. Chiueh and R. H. Katz. Eliminating. address translation bottleneck for physical address cache. In Proceedings of ASPLOS, 1992.
[4]
J.-H. Choi, J.-H. Leek, S.-W. Jeong, S.-D. Kim, and C. Weems. A low power TLB structure for embedded systems. IEEE Computer Architecture Letters, Volume 1, January 2002.
[5]
L. T. Clark, B. Choi, and M. Wilkerson. Reducing translation lookaside buffer active power. In Proceedings of the International Symposium on Low-Power Electronics and Design, p.10--13, 2003, Seoul, Korea.
[6]
G. De Micheli. Synthesis and Optimization of Digital Circuits. McGraw-Hill, 1994.
[7]
Intel StrongArm Processor. http://www.intel.com/design/pca/applicationsprocessors/1110_brf.htm.
[8]
T. Juan, T. Lang, and J. J. Navarro. Reducing TLB power requirements. In Proceedings of the International Symposium on Low-Power Electronics and Design, 1997.
[9]
I. Kadayif, P. Nath, M. Kandemir, and A. Sivasubramaniam. Compiler-directed physical address generation for reducing data TLB power. In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, March 2004.
[10]
J. Knight and P. Rosenfeld. Segmented virtual to real translation assist. IBM Technical Disclosure Bulletin, 27(2):1077--1078, July 1984.
[11]
H.-H. S. Lee and C. S. Ballapuram. Energy efficient data TLB and data cache using semantic-aware multilateral partitioning. In Proceedings of the International Symposium on Low-Power Electronics and Design, 2003.
[12]
J.-H. Lee, G.-H. Park, S.-B. Park, and S.-D. Kim. A selective filter-bank TLB system. In Proceedings of the International Symposium on Low-Power Electronics and Design, pp. 312--317, 2003, Seoul, Korea.
[13]
S. Manne, A. Klauser, D. Grunwald, and F. Somenzi. Low-power TLB design for high-performance microprocessor. Technical Report, Department of Electrical and Computer Engineering and Department of Computer Science, University of Colorado, Boulder, CO, 1997.
[14]
S. S. Muchnick. Advanced Compiler Design and Implementation. Morgan-Kaufmann, 1997.
[15]
G. Reinman and N. P. Jouppi. CACTI 2.0: an integrated cache timing and power model. Research Report 2000/7, Compaq WRL, 2000.
[16]
SH-3 RISC processor family. http://www.hitachi-eu.com/hel/ecg/products/micro/32bit/sh_3.html.
[17]
M. Wolfe. High-performance compiler for parallel computing. Addison-Wesley, 1996.

Cited By

View all
  • (2017)DTLB: Deterministic TLB for Tightly Bound Hard Real-Time Systems2017 30th International Conference on VLSI Design and 2017 16th International Conference on Embedded Systems (VLSID)10.1109/VLSID.2017.50(207-212)Online publication date: Jan-2017
  • (2016)Energy-efficient address translation2016 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2016.7446100(631-643)Online publication date: Mar-2016
  • (2016)A survey of techniques for architecting TLBsConcurrency and Computation: Practice and Experience10.1002/cpe.406129:10Online publication date: 22-Dec-2016
  • Show More Cited By

Index Terms

  1. Compiler-directed code restructuring for reducing data TLB energy

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CODES+ISSS '04: Proceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
    September 2004
    266 pages
    ISBN:158113 9373
    DOI:10.1145/1016720
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 September 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tag

    1. code restructuring

    Qualifiers

    • Article

    Conference

    CODES/ISSS04

    Acceptance Rates

    Overall Acceptance Rate 280 of 864 submissions, 32%

    Upcoming Conference

    ESWEEK '24
    Twentieth Embedded Systems Week
    September 29 - October 4, 2024
    Raleigh , NC , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)DTLB: Deterministic TLB for Tightly Bound Hard Real-Time Systems2017 30th International Conference on VLSI Design and 2017 16th International Conference on Embedded Systems (VLSID)10.1109/VLSID.2017.50(207-212)Online publication date: Jan-2017
    • (2016)Energy-efficient address translation2016 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2016.7446100(631-643)Online publication date: Mar-2016
    • (2016)A survey of techniques for architecting TLBsConcurrency and Computation: Practice and Experience10.1002/cpe.406129:10Online publication date: 22-Dec-2016
    • (2012)Composable Virtual Memory for an Embedded SoCProceedings of the 2012 15th Euromicro Conference on Digital System Design10.1109/DSD.2012.32(766-773)Online publication date: 5-Sep-2012
    • (2011)Towards virtual memory support in real-time and memory-constrained embedded applications: the interval page tableIET Computers & Digital Techniques10.1049/iet-cdt.2009.00305:4(287)Online publication date: 2011
    • (2010)Context-aware TLB preloading for interference reduction in embedded multi-tasked systemsProceedings of the 20th symposium on Great lakes symposium on VLSI10.1145/1785481.1785574(401-404)Online publication date: 16-May-2010
    • (2010)Code Transformations for TLB Power ReductionInternational Journal of Parallel Programming10.1007/s10766-009-0123-838:3-4(254-276)Online publication date: 21-Jan-2010
    • (2009)Direct address translation for virtual memory in energy-efficient embedded systemsACM Transactions on Embedded Computing Systems10.1145/1457246.14572518:1(1-31)Online publication date: 4-Jan-2009
    • (2008)Heterogeneously tagged caches for low-power embedded systems with virtual memory supportACM Transactions on Design Automation of Electronic Systems10.1145/1344418.134442813:2(1-24)Online publication date: 23-Apr-2008
    • (2008)Low-power and real-time address translation through arithmetic operations for virtual memory support in embedded systemsIET Computers & Digital Techniques10.1049/iet-cdt:200700902:2(75)Online publication date: 2008
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media