Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1735997.1736018acmconferencesArticle/Chapter ViewAbstractPublication PagesveeConference Proceedingsconference-collections
research-article

DBT path selection for holistic memory efficiency and performance

Published: 17 March 2010 Publication History

Abstract

Dynamic binary translators(DBTs) provide powerful platforms for building dynamic program monitoring and adaptation tools. DBTs, however, have high memory demands because they cache translated code and auxiliary code to a software code cache and must also maintain data structures to support the code cache. The high memory demands make it difficult for memory-constrained embedded systems to take advantage of DBT-based tools. Previous research on DBT memory management focused on the translated code and auxiliary code only. However, we found that data structures are comparable to the code cache in size. We show that the translated code size, auxiliary code size and the data structure size interact in a complex manner, depending on the path selection (trace selection and link formation) strategy. Therefore, holistic memory efficiency (comprising translated code, auxiliary code and data structures) cannot be improved by focusing on the code cache only. In this paper, we use path selection for improving holistic memory efficiency which in turn impacts performance in memory-constrained environments. Although there has been previous research on path selection, such research only considered performance in memory-unconstrained environments.
The challenge for holistic memory efficiency is that the path selection strategy results in complex interactions between the memory demand components. Also, individual aspects of path selection and the holistic memory efficiency may impact performance in complex ways. We explore these interactions to motivate path selection targeting holistic memory demand. We enumerate all the aspects involved in a path selection design and evaluate a comprehensive set of approaches for each aspect. Finally, we propose a path selection strategy that reduces memory demands by 20% and at the same time improves performance by 5-20% compared to an industrial-strength DBT.

References

[1]
J. Baiocchi, B. R. Childers, J. W. Davidson, J. D. Hiser, and J. Misurda. Fragment cache management for dynamic binary translators in embedded systems with scratchpad. In International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, pages 75--84, Salzburg, Austria, 2007.
[2]
J. A. Baiocchi, B. R. Childers, J. W. Davidson, and J. D. Hiser. Reducing pressure in bounded DBT code caches. In International Conference on Compilers, Architectures and Synthesis for Embedded Systems, pages 109--118, Atlanta, GA, USA, 2008.
[3]
V. Bala, E. Duesterwald, and S. Banerjia. Dynamo: a transparent dynamic optimization system. In Conference on Programming Language Design and Implementation, pages 1--12, Vancouver, British Columbia, Canada, 2000.
[4]
D. Bruening and S. Amarasinghe. Maintaining consistency and bounding capacity of software code caches. In International Symposium on Code Generation and Optimization, pages 74--85, San Jose, California, 2005.
[5]
D. Bruening and E. Duesterwald. Exploring optimal compilation unit shapes for an embedded just-in-time compiler. In In Proceedings of the 2000 ACM Workshop on Feedback-Directed and Dynamic Optimization FDDO-3, pages 13--20, 2000.
[6]
D. Bruening, T. Garnett, and S. Amarasinghe. An infrastructure for adaptive dynamic optimization. In International Symposium on Code Generation and Optimization, pages 265--275, San Francisco, California, 2003.
[7]
D. Bruening, V. Kiriansky, T. Garnett, and S. Banerji. Thread-shared software code caches. In 4th Int'l Symposium on Code Generation and Optimization, pages 28--38, Manhattan, New York, NY, March 2006.
[8]
D. L. Bruening. Efficient, Transparent and Comprehensive Runtime Code Manipulation. PhD thesis, Massachusetts Institute of Technology, Cambridge, MA, September 2004.
[9]
G. Desoli, N. Mateev, E. Duesterwald, P. Faraboschi, and J. A. Fisher. Deli: a new run-time control point. In 35th International Symposium on Microarchitecture, pages 257--268, Istanbul, Turkey, 2002.
[10]
E. Duesterwald and V. Bala. Software profiling for hot path prediction: less is more. In ASPLOS-IX: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems, pages 202--211, Cambridge, Massachusetts, United States, 2000.
[11]
A. Guha, K. Hazelwood, and M. L. Soffa. Reducing exit stub memory consumption in code caches. In International Conference on High-Performance Embedded Architectures and Compilers (HiPEAC), pages 87--101, Ghent, Belgium, January 2007.
[12]
A. Guha, K. Hazelwood, and M. L. Soffa. Code lifetime based memory reduction for virtual execution environments. In 6th Workshop on Optimizations for DSP and Embedded Systems (ODES), Boston, MA, March 2008.
[13]
M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge, and R. B. Brown. Mibench : A free, commercially representative embedded benchmark suite. In Workshop on Workload Characterization, pages 3--14, 2001.
[14]
K. Hazelwood and A. Klauser. A dynamic binary instrumentation engine for the ARM architecture. In International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, pages 261--270, Seoul, Korea, 2006.
[15]
K. Hazelwood, G. Lueck, and R. Cohn. Scalable support for multithreaded applications on dynamic binary instrumentation systems. In ISMM '09: Proceedings of the 2009 international symposium on Memory management, pages 20--29, Dublin, Ireland, 2009.
[16]
K. Hazelwood and M. D. Smith. Managing bounded code caches in dynamic binary optimization systems. Transactions on Code Generation and Optimization (TACO), 3(3):263--294, September 2006.
[17]
J. L. Henning. Spec cpu2000: Measuring CPU performance in the new millennium. Computer, 2000.
[18]
D. J. Hiniker, K. Hazelwood, and M. D. Smith. Improving region selection in dynamic optimization systems. In 38th International Symposium on Microarchitecture, pages 141--154, Barcelona, Spain, November 2005.
[19]
J. D. Hiser, D.Williams, A. Filipi, J.W. Davidson, and B. R. Childers. Evaluating fragment construction policies for SDT systems. In Conference on Virtual Execution Environments, pages 122--132, Ottawa, Ontario, Canada, 2006.
[20]
W. Hu, J. Hiser, D.Williams, A. Filipi, J.W. Davidson, D. Evans, J. C.Knight, A. Nguyen-Tuong, and J. Rowanhill. Secure and practical defense against code-injection attacks using software dynamic translation. In Conference on Virtual Execution Environments, pages 2--12, Ottawa, Canada, 2006.
[21]
V. Janapareddi, D. Connors, R. Cohn, and M. D. Smith. Persistent code caching: Exploiting code reuse across executions and applications. In International Symposium on Code Generation and Optimization, pages 74--88, San Jose, California, 2007.
[22]
W. ke Chen, S. Lerner, R. Chaiken, and D. Gilles. Mojo: A dynamic optimization system. In Proceedings of the 4th ACM Workshop on Feedback-Directed and Dynamic Optimization, pages 81--90, 2000.
[23]
V. Kiriansky, D. Bruening, and S. Amarasinghe. Secure execution via program shepherding. In 11th USENIX Security Symposium, pages 191--206, San Francisco, CA, 2002.
[24]
N. Kumar, B. R. Childers, D. Williams, J. W. Davidson, and M. L.Soffa. Compile-time planning for overhead reduction in software dynamic translators. Int. J. Parallel Program., 33(2):103--114, 2005.
[25]
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. Janapareddi, and K. Hazelwood. Pin: Building customized program analysis tools with dynamic instrumentation. In Conference on Programming Language Design and Implementation, pages 190--200, Chicago, IL, June 2005.
[26]
R.W. Moore, J. A. Baiocchi, B. R. Childers, J.W. Davidson, and J. D.Hiser. Addressing the challenges of DBT for the ARM architecture. In LCTES '09: Proceedings of the 2009 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems, pages 147--156, Dublin, Ireland, 2009.
[27]
N. Nethercote and J. Seward. Valgrind: a framework for heavyweight dynamic binary instrumentation. In PLDI '07: Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, pages 89--100, San Diego, California, USA, 2007.
[28]
K. Scott, N. Kumar, S. Velusamy, B. Childers, J. Davidson, and M. L.Soffa. Reconfigurable and retargetable software dynamic translation. In 1st Int'l Symposium on Code Generation and Optimization, pages 36--47, San Francisco, California, March 2003.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
VEE '10: Proceedings of the 6th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
March 2010
176 pages
ISBN:9781605589107
DOI:10.1145/1735997
  • cover image ACM SIGPLAN Notices
    ACM SIGPLAN Notices  Volume 45, Issue 7
    VEE '10
    July 2010
    161 pages
    ISSN:0362-1340
    EISSN:1558-1160
    DOI:10.1145/1837854
    Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 March 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. dynamic binary translation
  2. embedded systems
  3. memory management
  4. path selection
  5. virtual machines

Qualifiers

  • Research-article

Conference

VEE '10

Acceptance Rates

Overall Acceptance Rate 80 of 235 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)1
Reflects downloads up to 24 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2016)Hardware-Accelerated Cross-Architecture Full-System VirtualizationACM Transactions on Architecture and Code Optimization10.1145/299679813:4(1-25)Online publication date: 25-Oct-2016
  • (2014)SPTUProceedings of International Conference on Systems and Storage10.1145/2611354.2611368(1-12)Online publication date: 30-Jun-2014
  • (2013)SPIREACM SIGPLAN Notices10.1145/2517326.245151648:7(1-12)Online publication date: 16-Mar-2013
  • (2013)SPIREProceedings of the 9th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments10.1145/2451512.2451516(1-12)Online publication date: 16-Mar-2013
  • (2011)LnQProceedings of the 2011 International Conference on Parallel Processing10.1109/ICPP.2011.57(226-234)Online publication date: 13-Sep-2011
  • (2011)SEntre: A dynamic binary instrumentation infrastructure2011 International Conference on Electronics, Communications and Control (ICECC)10.1109/ICECC.2011.6066387(1016-1019)Online publication date: Sep-2011
  • (2010)Balancing memory and performance through selective flushing of software code cachesProceedings of the 2010 international conference on Compilers, architectures and synthesis for embedded systems10.1145/1878921.1878923(1-10)Online publication date: 24-Oct-2010
  • (2022)Dynamic Binary ModificationundefinedOnline publication date: 7-Mar-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media