Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Limits on multiple instruction issue

Published: 01 April 1989 Publication History
  • Get Citation Alerts
  • Abstract

    This paper investigates the limitations on designing a processor which can sustain an execution rate of greater than one instruction per cycle on highly-optimized, non-scientific applications. We have used trace-driven simulations to determine that these applications contain enough instruction independence to sustain an instruction rate of about two instructions per cycle. In a straightforward implementation, cost considerations argue strongly against decoding more than two instructions in one cycle. Given this constraint, the efficiency in instruction fetching rather than the complexity of the execution hardware limits the concurrency attainable at the instruction level.

    References

    [1]
    R.D. Acosta, J. Kjelstrup, and H.C. Torng, "An Instruction Issuing Approacl~ to Enhancing Performance ill I~'iultiple Functional Unit Processors". }EEE Transactions on Computers, Vol. C-35 (September 1986), pp. 815-828.
    [2]
    A.V. Aho, R. Sethi. and J.D. Ullman, Compilers Principles, Techniques, and Tools. Addison-Wesley Publishing Company, 1986.
    [3]
    Apollo Computer Inc. Marketing Brochure. The series 10000 Personal Supercomputer. Chelmsford. MA, 1988.
    [4]
    C.C. Foster and E.M. Riseman, "Percolation of Code to Enhance Parallel Dispatching and Execution IEEE Transactions on. Computers, Vol. C-21 (December 1972), pp. 1411-1415.
    [5]
    J.L. Hennessy, "RISC-Based Processors: Concepts and Prospects". New Froniiers in Corn p,ler Architecture Conference Proceedings (hlarch 1986), pp. 95- 103.
    [6]
    R.M. Keller, "Look-Ahead Processors". Computing Surveys, Vol.7, No.4 (December 1975). pp. 177-195.
    [7]
    D.J. Kuck. Y. Muraol,'a, and S. C. hen, "On the Number of Opera. Lions Simultaneously Executable in Fortran-like Programs and Their resulting Speedup". IEEE Transaciions on computers, Vol. C-21 (December 1972), pp. 1293--1310.
    [8]
    J.K.F. Lee and A.J. Smith, "Branch Prediction Strategies and Branch Target Buffer Design" IEEE Computer (January 1984). pp. 6-22.
    [9]
    L. Logrippo. "Renamillg in Program Schemas". Proceedings of the IEEE 13th Annual Symposium on switching and Automata Theory. (October 1972), Pp. 67- 70.
    [10]
    S. McFarling and J. Hennessy, "Reducing t. he Cost of Branches". Proc. 13th Annnual Symposium on Computer Architccture (June 1986), pp. 396-404.
    [11]
    MIPS Computer Systems, Inc., MIPS Language Programmers Guide (1986).
    [12]
    A. Nicolau and J.A. Fisher, "Mea.suring tile Parallelism Availa.ble for Very Long Instruction Word Architectures". IEEE Transactions on Computers. vol. C-33 (November 1984), pp. 968-976.
    [13]
    E.hl. Riselnan and C.C. Foster, "The Inhibition of Potential Parallelism by Conditional Pumps". IEEE Transactions on Computers, Vol. C-21 (December 1972), pp. 1405-1411.
    [14]
    G.A. Slaxenburg, Phillips Research Laboratories Sunnyvale, Signetics Corporation, Sunnyvale, CA. Personal Correspondence, 12 May 1988.
    [15]
    J.E. Smith, et. a l, "The ZS-1 Central Processor". Proceedings. Second Internalional Conference on Architectural Support for Programming Languages and Operating Systems (October 1987), pp. 199- 204.
    [16]
    G.S. Sohi and S. Vajapeyam, "Instruction Issue Logic for High-Performance, Interruptable Pipelined Processors". Proceedings, 14th Annual International Symposium on Computer Architecture (June 1987), pp.27-34.
    [17]
    G.S. Tjaden and M.J. Flynn, "Detection and Parallel Execution of Independent. Instructions". IEEE Transacactions on Computers, Vol. C-19 (October 1970), pp. 889-895.
    [18]
    R.M. Tomasulo, "An Efficient Algorithm for Exploiting Multiple Arithmetic Units'. IBM Journal, Vol. 11 (January 1967), pp. 25-33.
    [19]
    S. Weiss and J.E. Smith, "Instrtlction Issue Logic in Pipelined Supercomputers". IEEE Transaction"s on Computer's, Vol. C-33 (November 1984), pp. 1013-1022.

    Cited By

    View all
    • (2000)Limits and Graph Structure of Available Instruction-Level ParallelismEuro-Par 2000 Parallel Processing10.1007/3-540-44520-X_144(1018-1022)Online publication date: 18-Aug-2000
    • (1999)Branch Prediction, Instruction-Window Size, and Cache SizeIEEE Transactions on Computers10.1109/12.81111548:11(1260-1281)Online publication date: 1-Nov-1999
    • (1998)Modeled and Measured Instruction Fetching Performance for Superscalar MicroprocessorsIEEE Transactions on Parallel and Distributed Systems10.1109/71.6894449:6(570-578)Online publication date: 1-Jun-1998
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 17, Issue 2
    Special issue: Proceedings of ASPLOS-III: the third international conference on architecture support for programming languages and operating systems
    April 1989
    291 pages
    ISSN:0163-5964
    DOI:10.1145/68182
    Issue’s Table of Contents
    • cover image ACM Conferences
      ASPLOS III: Proceedings of the third international conference on Architectural support for programming languages and operating systems
      April 1989
      303 pages
      ISBN:0897913000
      DOI:10.1145/70082
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 April 1989
    Published in SIGARCH Volume 17, Issue 2

    Check for updates

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)108
    • Downloads (Last 6 weeks)9

    Other Metrics

    Citations

    Cited By

    View all
    • (2000)Limits and Graph Structure of Available Instruction-Level ParallelismEuro-Par 2000 Parallel Processing10.1007/3-540-44520-X_144(1018-1022)Online publication date: 18-Aug-2000
    • (1999)Branch Prediction, Instruction-Window Size, and Cache SizeIEEE Transactions on Computers10.1109/12.81111548:11(1260-1281)Online publication date: 1-Nov-1999
    • (1998)Modeled and Measured Instruction Fetching Performance for Superscalar MicroprocessorsIEEE Transactions on Parallel and Distributed Systems10.1109/71.6894449:6(570-578)Online publication date: 1-Jun-1998
    • (1996)Energy dissipation in general purpose microprocessorsIEEE Journal of Solid-State Circuits10.1109/4.53541131:9(1277-1284)Online publication date: Jan-1996
    • (1996)Efficient Exploitation of Instruction-Level Parallelism for Superscalar Processors by the Conjugate Register File SchemeIEEE Transactions on Computers10.1109/12.48556745:3(278-293)Online publication date: 1-Mar-1996
    • (1995)Three Architectural Models for Compiler-Controlled Speculative ExecutionIEEE Transactions on Computers10.1109/12.37616444:4(481-494)Online publication date: 1-Apr-1995
    • (1991)How many operation units are adequate?ACM SIGARCH Computer Architecture News10.1145/122576.12258619:4(94-108)Online publication date: 1-Jul-1991
    • (2023)EVMTracer: Dynamic Analysis of the Parallelization and Redundancy Potential in the Ethereum Virtual MachineIEEE Access10.1109/ACCESS.2023.326727711(47159-47178)Online publication date: 2023
    • (2016)Bounded distortion parametrization in the space of metricsACM Transactions on Graphics10.1145/2980179.298242635:6(1-16)Online publication date: 5-Dec-2016
    • (2016)Scalable inside-out image-based renderingACM Transactions on Graphics10.1145/2980179.298242035:6(1-11)Online publication date: 5-Dec-2016
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media