Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1023833.1023843acmconferencesArticle/Chapter ViewAbstractPublication PagesesweekConference Proceedingsconference-collections
Article

Balancing design options with Sherpa

Published: 22 September 2004 Publication History
  • Get Citation Alerts
  • Abstract

    Application specific processors offer the potential of rapidly designed logic specifically constructed to meet the performance and area demands of the task at hand. Recently, there have been several major projects that attempt to automate the process of transforming a predetermined processor configuration into a low level description for fabrication. These projects either leave the specification of the processor to the designer, which can be a significant engineering burden, or handle it in a fully automated fashion, which completely removes the designer from the loop.In this paper we introduce a technique for guiding the design and optimization of application specific processors. The goal of the Sherpa design framework is to automate certain design tasks and provide early feedback to help the designer navigate their way through the architecture design space. Our approach is to decompose the overall problem of choosing an optimal architecture into a set of sub-problems that are, to the first order, independent. For each sub-problem, we create a model that relates performance to area. From this, we build a constraint system that can be solved using integer-linear programming techniques, and arrive at an ideal parameter selection for all architectural components. Our approach only takes a few minutes to explore the design space allowing the designer or compiler to see the potential benefits of optimizations rapidly. We show that the expected performance using our model correlates strongly to detailed pipeline simulations, and present results showing design tradeoffs for several different benchmarks.

    References

    [1]
    S. Abraham, B. Rau, R. Schreiber, G. Snider, and M. Schlansker. Efficient design space exploration in pico. In Proc. of International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, pages 71--79, San Jose, California, November 2000.
    [2]
    S. G. Abraham and S. A. Mahlke. Automatic and efficient evaluation of memory hierarchies for embedded systems. In 32nd International Symposium on Microarchitecture, 1999.
    [3]
    Anant Agarwal, Mark Horowitz, and John Hennessy. An analytical cache model. ACM Transactions on Computer Systems, 7(2):184--215, 1989.
    [4]
    ARC. Whitepaper: Customizing a soft microprocessor core. http://www.arccores.com, 2001.
    [5]
    L. Barroso, K. Gharachorloo, R. McNamara, A. Nowatzyk, S. Qadeer, B. Sano, S. Smith, R. Stets, and B. Verghese. Piranha: A scalable architecture based on single-chip multiprocessing. In 27th Annual International Symposium on Computer Architecture, Vancouver, Canada, June 2000.
    [6]
    M. Berkelaar. lp solve: a mixed integer linear program solver. ftp://ftp.es.ele.tue.nl/pub/lp_solve, September 1997.
    [7]
    D. C. Burger and T. M. Austin. The simplescalar tool set, version 2.0. Technical Report CS-TR-97-1342, University of Wisconsin, Madison, June 1997.
    [8]
    Paolo Faraboschi, Geoffrey Brown, Joseph A. Fisher, Giuseppe Desoli, and Fred Homewood. Lx: a technology platform for customizable vliw embedded processing. In 27th Annual International Symposium on Computer Architecture, pages 203--213, 2000.
    [9]
    J. A. Fisher, P. Faraboschi, and G. Desoli. Custom-fit processors: Letting applications define architectures. In 29th International Symposium on Microarchitecture, pages 324--335, December 1996.
    [10]
    Joseph A. Fisher. Customized instruction-sets for embedded processors. In Proceedings of the Design Automation Conference, 1999, pages 253--257, 1999.
    [11]
    T. Givargis and F. Vahid. Platune: A tuning framework for system-on-a-chip platforms. IEEE Transactions on Computer Aided Design, 21(11), November 2002.
    [12]
    T. Givargis, F. Vahid, and J. Henkel. System-level exploration for pareto-optimal configurations in parameterized systems-on-a-chip. In International Conference on Computer Aided Design, November 2001.
    [13]
    R. E. Gonzalez. Xtensa: A configurable and extensible processor. IEEE Micro, 20(2):60--70, March-April 2000.
    [14]
    G. Hadjiyiannis, P. Russo, and S. Devadas. A methodology for accurate performance evaluation in architecture exploration. In In Proceedings of the Design Automation Conference (DAC 99), pages 927--932, 1999.
    [15]
    M. Itoh, S. Higaki, J. Sato, A. Shiomi, Y. Takeuchi, A. Kitajima, and M. Imai. Effectiveness of the asip design system peas-iii in design of pipelined processors. In In Proceedings of Asia and South Pacific Desing Automation Conference 2001 (ASP--DAC 2001), pages 649--654, 2001.
    [16]
    E. Lawler and D. Wood. Branch and bound methods: A survey. Operations Research, 14(291):699--719, 1966.
    [17]
    S. Leibson. Xscale (strongarm-2) muscles in. Microprocessor Report, September 2000.
    [18]
    T. Morimoto, K. Saito, H. Nakamura, T. Boku, and K. Nakazawa. Advanced processor design using hardware description language aidl. In In Proceedings of Asia and South Pacific Desing Automation Conference 1997 (ASP--DAC 1997), pages 387--390, 1997.
    [19]
    J. Mulder. An area model for on-chip memories and its applications. IEEE Journal of Solid States Circuits, 26(2):98--106, February 1991.
    [20]
    M. Puig-Medina, G. Ezer, and P. Konas. Verification of configurable processor cores. In Proceedings of the Design Automation Conference (DAC2000), pages 426--431, 2000.
    [21]
    G. Reinman and N. Jouppi. Cacti version 2.0. http://www.research.digital.com/wrl/people/jouppi/CACTI.html, June 1999.
    [22]
    S. Santhanam. Strongarm 110: A 160mhz 32b 0.5w cmos arm processor. In Proceedings of HotChips VIII, pages 119--130, 1996.
    [23]
    T. Sherwood and B. Calder. Automated design of finite state machine predictors for customized processors. In Annual International Symposium on Computer Architecture, June 2001.
    [24]
    C. Snyder. Synthesizable core makeover: Is lexra's seven-stage pipelined core the speed king? In Microprocessor Report, June 2001.
    [25]
    C.D. Snyder. Fpga processors cores get serious. Microprocessor Report, 14(9), September 2000.
    [26]
    A. Srivastava and A. Eustace. ATOM: A system for building customized program analysis tools. In Proceedings of the Conference on Programming Language Design and Implementation, pages 196--205. ACM, 1994.
    [27]
    Rabin A. Sugumar and Santosh G. Abraham. Set-associative cache simulation using generalized binomial trees. ACM Transactions on Computer Systems, 13(1):32--56, 1995.
    [28]
    S. Wilton and N. Jouppi. Cacti: An enhanced cache access and cycle time model. In IEEE Journal of Solid-State Circuits, May 1996.
    [29]
    Lisa Wu, Chris Weaver, and Todd Austin. Cryptomaniac: a fast flexible architecture for secure communication. In 28th Annual International Symposium on Computer Architecture, pages 110--119, 2001.

    Cited By

    View all

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CASES '04: Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems
    September 2004
    324 pages
    ISBN:1581138903
    DOI:10.1145/1023833
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 September 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. application specific processor (ASIP)
    2. area minimization
    3. computer architecture
    4. design space exploration
    5. peicewise linear model

    Qualifiers

    • Article

    Conference

    CASES04

    Acceptance Rates

    Overall Acceptance Rate 52 of 230 submissions, 23%

    Upcoming Conference

    ESWEEK '24
    Twentieth Embedded Systems Week
    September 29 - October 4, 2024
    Raleigh , NC , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)A Procedural Approach to Creating American Second Empire HousesJournal on Computing and Cultural Heritage 10.1145/334319613:1(1-19)Online publication date: 5-Feb-2020
    • (2011)Towards automatic generation of formal specifications to validate and verify reliable distributed systemsACM SIGPLAN Notices10.1145/2189751.204788847:3(147-156)Online publication date: 22-Oct-2011
    • (2011)Less is moreACM SIGPLAN Notices10.1145/2189751.204788747:3(137-146)Online publication date: 22-Oct-2011
    • (2009)Energy- and area-efficient architectures through application clustering and architectural heterogeneityACM Transactions on Architecture and Code Optimization10.1145/1509864.15098686:1(1-31)Online publication date: 2-Apr-2009
    • (2006)Automatic application-specific microarchitecture reconfigurationProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898953.1899153(200-200)Online publication date: 25-Apr-2006
    • (2006)Selection of instruction set extensions for an FPGA embedded processor coreProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898953.1899151(199-199)Online publication date: 25-Apr-2006
    • (2006)Efficient design space exploration of high performance embedded out-of-order processorsProceedings of the conference on Design, automation and test in Europe: Proceedings10.5555/1131481.1131578(351-356)Online publication date: 6-Mar-2006
    • (2006)Application-specific customization of parameterized FPGA soft-core processorsProceedings of the 2006 IEEE/ACM international conference on Computer-aided design10.1145/1233501.1233553(261-268)Online publication date: 5-Nov-2006
    • (2006)The Future of SimulationComputer10.1109/MC.2006.40439:11(22-29)Online publication date: 1-Nov-2006
    • (2006)Automatic application-specific microarchitecture reconfigurationProceedings 20th IEEE International Parallel & Distributed Processing Symposium10.1109/IPDPS.2006.1639457(8 pp.)Online publication date: 2006
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media