Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/143369.143419acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
Article
Free access

Register requirements of pipelined processors

Published: 01 August 1992 Publication History
  • Get Citation Alerts
  • Abstract

    To enable concurrent instruction execution, scientific computers generally rely on pipelining, which combines with faster system clocks to achieve greater throughput. Each concurrently executing instruction requires buffer space, usually implemented as a register, to receive its result. This paper focuses on the issue of how many registers are required to achieve optimal performance in pipelined scientific computers. Four machine models are considered: single, double, and triple issue scalar machines, and vector machines with various register lengths. A model is presented that accurately relates the register requirements for optimum performance cyclically scheduled loops with tree-dependence graphs to the degree of function unit pipelining, the instruction issue bandwidth, and code properties. A method for finding upper and lower bounds on the minimum register requirements is also presented.
    The result of this work is a theory for assessing register requirements that can be used to reveal fundamental differences among machines within a space of architectural and implementation design choices. Some experimental data is also provided to support the theory.

    References

    [1]
    M. J. Flynn, "Very High-Speed Computing Systems," Proc. IEEB, vol. 54, pp. 1901-1909, December 1966.
    [2]
    R. P. Colwell, R. P. Nix, J. J. O'Donnell, D. B. Papworth, and P. K. Rodman, "A VLIW Architecture for a Trace Scheduling Compiler," in Proc. ASPLOS.II, pp. 180-192, 1982.
    [3]
    B. J. Smith, "Architecture and Applications of the HEP Multiprocessor Computer System," Real Time Signal Procesing IV, vol. 298, August 1981.
    [4]
    Y. N. Patt, W.-M. W. Hwu, and M. Shebanow, "HPS, a New Microarchitecture: Rationale and Introduction," in Proc. of 18th Microprogramming Workshop, 1985.
    [5]
    D. W. Anderson, F. J. Sparacio, and R. M. Tomasulo, "IBM System/360 Model 91: Machine Philosophy and Instruction Handling," IBM Journal of Research and Development, pp. 8-24, January 1967.
    [6]
    W. H. Mangione-Smith, S. G. Abraham, and E. S. Davidson, "Vector Register Design for Polycyclic Vector Scheduling," in Proc. Fourth Conference on Architectural ~upport for Programming Languages and Operating Systems, April 1991.
    [7]
    P. Y. Hsu, Highly Concurrent Scalar Processing. PAD thesis, Coordinated Science Laboratory Report ~CSG-49, University of Illinois at Urbana- Champaign, 1986.
    [8]
    R. F. Touzeau, "A Fortran Compiler for the FPS- 164 Scientific Computer," in Proc. A CM SIGPLAN '84 Syrup on Compiler Construction, pp. 48-57, June 1984.
    [9]
    J. H. Patel and E. S. Davidson, "Improving the Throughput of a Pipeline by Insertion of Delays," in Proc. Third Annual ISCA, pp. 159-164, 1976.
    [10]
    B. R. Ran, C. D. Glaeser, and R. L. Picard, "Efficient Code Generation for Horizontal Architectures: Compiler Techniques and Architectural Support," in Proc. Int. Symp. on Computer Architecture, pp. 295-304, 1982.
    [11]
    B. R. Ran, D. W. L. Yen, W. Yen, and R. A. Towle, "The Cydra 5 Departmental Supercomputer," IEEE Computer, pp. 12-35, 1989.
    [12]
    C. Eisenbeis, W. Jalby, and A. Lichnewsky, "Compile-Time Optimization of Memory and Register Usage on the Cray-2," in Proceedings of the Second Workshop on Languages and Compilers, Urbana llinois, August 1989.
    [13]
    M. Lam, A Systolic Array Optimizing Compiler. PAD thesis, Carnegie Mellon University, 1987.
    [14]
    M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP- Completeness. W. H. Freeman and Company, 1979.
    [15]
    J. E. Smith, "Decoupled Access/Execute Architecture Computer Architectures," A CM Transactions on Computer Systems, pp. 289-308, November 1984.
    [16]
    W. H. Mangione-Smith, S. G. Abraham, and E. S. Davidson, "A Performance Comparison of the IBM RS/6000 and the Astronautics ZS-I," IEEE Computer, vol. 24, January 1991.
    [17]
    W. H. Mangione-Smith, Performance Bounds and Buffer Space Requirements for Concurrent Processors. PAD thesis, Univ. of Mich., EECS Dept., Univ. of Mich., Ann Arbor, MI, 1992.

    Cited By

    View all
    • (2016)Minimizing Register Requirements of a Modulo Schedule via Optimum Stage SchedulingInternational Journal of Parallel Programming10.1007/BF0335674424:2(103-132)Online publication date: 26-May-2016
    • (2014)Optimum modulo schedules for minimum register requirementsACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2667171(227-236)Online publication date: 10-Jun-2014
    • (2014)Author retrospective for optimum modulo schedules for minimum register requirementsACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2591653(35-36)Online publication date: 10-Jun-2014
    • Show More Cited By

    Index Terms

    1. Register requirements of pipelined processors

                            Recommendations

                            Comments

                            Information & Contributors

                            Information

                            Published In

                            cover image ACM Conferences
                            ICS '92: Proceedings of the 6th international conference on Supercomputing
                            August 1992
                            495 pages
                            ISBN:0897914856
                            DOI:10.1145/143369
                            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                            Sponsors

                            Publisher

                            Association for Computing Machinery

                            New York, NY, United States

                            Publication History

                            Published: 01 August 1992

                            Permissions

                            Request permissions for this article.

                            Check for updates

                            Qualifiers

                            • Article

                            Conference

                            ICS92
                            Sponsor:

                            Acceptance Rates

                            Overall Acceptance Rate 629 of 2,180 submissions, 29%

                            Contributors

                            Other Metrics

                            Bibliometrics & Citations

                            Bibliometrics

                            Article Metrics

                            • Downloads (Last 12 months)54
                            • Downloads (Last 6 weeks)6
                            Reflects downloads up to 12 Aug 2024

                            Other Metrics

                            Citations

                            Cited By

                            View all
                            • (2016)Minimizing Register Requirements of a Modulo Schedule via Optimum Stage SchedulingInternational Journal of Parallel Programming10.1007/BF0335674424:2(103-132)Online publication date: 26-May-2016
                            • (2014)Optimum modulo schedules for minimum register requirementsACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2667171(227-236)Online publication date: 10-Jun-2014
                            • (2014)Author retrospective for optimum modulo schedules for minimum register requirementsACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2591653(35-36)Online publication date: 10-Jun-2014
                            • (2011)Minimizing Schedule Length via Cooperative Register Allocation and Loop Scheduling for Embedded SystemsProceedings of the 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications10.1109/TrustCom.2011.142(1038-1044)Online publication date: 16-Nov-2011
                            • (2007)On Periodic Register Need in Software PipeliningIEEE Transactions on Computers10.1109/TC.2007.7075256:11(1493-1504)Online publication date: 1-Nov-2007
                            • (2005)Using sacks to organize registers in VLIW machinesParallel Processing: CONPAR 94 — VAPP VI10.1007/3-540-58430-7_55(628-639)Online publication date: 3-Jun-2005
                            • (2004)Register Constrained Modulo SchedulingIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2004.127809915:5(417-430)Online publication date: 1-May-2004
                            • (2003)MIRS: Modulo Scheduling with Integrated Register SpillingLanguages and Compilers for Parallel Computing10.1007/3-540-35767-X_16(239-253)Online publication date: 13-May-2003
                            • (2002)Partitioning and scheduling DSP applications with maximal memory access hidingEURASIP Journal on Advances in Signal Processing10.5555/1283100.12831942002:1(926-935)Online publication date: 1-Jan-2002
                            • (2001)MIRSProceedings of the 14th international conference on Languages and compilers for parallel computing10.5555/1769331.1769347(239-253)Online publication date: 1-Aug-2001
                            • Show More Cited By

                            View Options

                            View options

                            PDF

                            View or Download as a PDF file.

                            PDF

                            eReader

                            View online with eReader.

                            eReader

                            Get Access

                            Login options

                            Media

                            Figures

                            Other

                            Tables

                            Share

                            Share

                            Share this Publication link

                            Share on social media