Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/139669.140380acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article
Free access

Memory latency effects in decoupled architectures with a single data memory module

Published: 01 April 1992 Publication History

Abstract

Decoupled computer architectures partition the memory access and execute functions in a computer program and achieve high performance by exploiting the fine-grain parallelism between the two. These architectures make use of an access processor to perform the data fetch ahead of demand by the execute process and hence are often less sensitive to memory access delays than conventional architectures. Past performance studies of decoupled computers used memory systems that are interleave or pipelined. We undertake a simulation study of the latency effects in decoupled computers when connected to a single, conventional non-interleaved data memory module so that the effect of decoupling is isolated from the improvement caused by interleaving. We compare decoupled computer performance to single processors with caches, study the memory latency sensitivity of the decoupled systems, and also perform simulations to determine the significance of data caches in a decoupled computer architecture. The Lawrence Livermore Loops and two signal processing algorithms are used as the simulation benchmark.

References

[1]
Alpert, D.B., Flynn M.}., "Performance Trade-offs for Microprocessor Cache Memories", IEEE Micro, August 1988, pp. 44-53.
[2]
Brantley W.C., Weiss J.,"Organization and architecture tradeoffs in FOM", presented at IEEE int. Workshop Comp. Syst. Organization, New Orleans, LA, March 1983.
[3]
Coraor L.D., Hulina P.T. and Mannai D.N.,"A Queue- based Instruction Cache Memory", Proc. of the International Symposium on Computer Architecture and Digital Signal Processing, October 1989, Hong Kong, pp.281-286.
[4]
Cohler E.U., Storer J.E., "Functionally parallel architectures for array processors", IEEE Computer, vol. 14, pp.28-36, Sept. 1981.
[5]
Eickemeyer R.J., Patel J.H., "Performance Evaluation of On-chip Register and Cache Organizations", Proc. 15th Int. Symp. on Computer Architecture, 1988, pp. 64-72.
[6]
Fattens M.K.,Pleszkun A.R., "Improving Performance of Small on-chip Instruction Caches", Proc. 16th Int. Symp. on Computer Architecture, 1989, pp. 234-241.
[7]
Farrens M.K.,Pleszkun A.R., "Implementation of the PIPE processor", IEEE Computer, Jan. 1991, pp. 65-70.
[8]
Goodman J.R., Hsieh J.T., Liou K., Pleszkun A.R., Schechter P.B., Young H.C., "PIPE: A VLSI Decoupled Architecture", 12th Annum International Symposium on Computer Architecture, June 17-19, 1985, Boston, Massachusetts, pp.20-27.
[9]
Hsieh J.T., Pteszkun A.R. and Goodman J.R., "Performance Evaluation of the PIPE Computer Architecture", Technical Report ://=566, Computer Sciences Department, University of Wisconsin- Madison, Nov. 1984.
[10]
Hulina P.T., Coraor L.D., Sun S.W., " Performance Analysis of an Address Generation Coprocessor", IEEE International Conference on Parallel Processing, 1991.
[11]
Kane G., "MIPS RISC Architecture", Prentice- Hall, Englewood Cliffs, N.J., 1988.
[12]
Kurian L., Hulina P.T., Coraor L.D., Mannai D.N., "Classification and Performance Evaluation of Instruction Buffering Techniques", 18th Intl. Symposium on Computer Architecture, May 1991, Toronto, Canada, pp. 150-159.
[13]
Pleszkun A.R., "A Structured Memory Access Architecture", Computer Systems Group report CSG-10, Coordinated Science Lab, University of Illinois, Urbana, Oct 1982.
[14]
Pleszkun A.R., Davidson E.S., "Structured Memory Access Architecture, Proc. IEEE International Conference on Parallel Processing 1983, pp 461-471.
[15]
Pleszkun A.R., Kahalleh B.Z., Davidson E.S., "Features of the Structured Memory Access (SMA) Architecture", Third IEEE Computer Society International Conference, San Francisco, CA, March 1986.
[16]
Shivley R.R., "Architecture of a Programmable Digital Signal Processor", IEEE Trans. on Computers, voi.C-31, Jan. 1982.
[17]
Smith A.J., "Cache Memories", ACM Computing Surveys, Vo1.14, No. 3, September 1982, pp. 473- 53O.
[18]
Smith :I.E., "Decoupled Access/Execute Computer Architecture", ACM Transactions on Computer Systems, Vol.2, No.4, November 1984, pp 289-308.
[19]
Smith J.E., et .al., "The ZS-1 Central Processor", Proceedings of the Second International Conference on Architectural Support for Programming Languages and Operating Systems, Palo Alto, CA, pp 199-204, October 1987.
[20]
Smith A.J., "Line (Block) Size Choice for CPU Cache Memories", IEEE Transactions on Computers, Vol. C-36, No. 9, September 1987, pp. 1063- 1075.
[21]
Smith :I.E., " Dynamic Instruction Scheduhng and the Astronautics ZS-I", IEEE Computer, July 1989.
[22]
Smith J.E., Pleszkun A.R., Katz R.H., Goodman J.R., "PIPE: A High Performance VLSI Architecture", IEEE Workshop on Computer System Organization, New Orleans, LA, pp.131-138, March 1983.
[23]
Smith :I.E., Weiss S. and Pang N.Y, "A Simulation Study of Decoupled Architecture Computers", IEEE Transactions on Computers, Vol.C-35, No.8, August 1986.
[24]
Smith W.M, Abraham S.G. and Davidson E.S, "A performance comparison of the IBM RS/6000 and the Astronautics ZS-I", IEEE Computer, January 1991.
[25]
Smith W.M., Abraham S.G., Davidson E.S., "The Effects of Memory Latency and Fine-Grain Parallelism on Astronautics ZS-1 Performance", Proceedings of the 23rd Annual Hawaii Intl. Conf. on System Sciences, CS Press, Los Alamitos, Calif, Order No.2008, 1990, pp.288-296.
[26]
Sohi G.S, Davidson E.S., "Performance of the Structured Memory Access Architecture", Proc. of the International Conference on Parallel Processing, pp.506-513, August 1984.

Cited By

View all
  • (2019)Stream-based memory access specialization for general purpose processorsProceedings of the 46th International Symposium on Computer Architecture10.1145/3307650.3322229(736-749)Online publication date: 22-Jun-2019
  • (2018)The Processing-in-Memory Paradigm: Mechanisms to Enable AdoptionBeyond-CMOS Technologies for Next Generation Computer Design10.1007/978-3-319-90385-9_5(133-194)Online publication date: 21-Aug-2018
  • (2011)OUTRIDERACM SIGARCH Computer Architecture News10.1145/2024723.200007939:3(117-128)Online publication date: 4-Jun-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ISCA '92: Proceedings of the 19th annual international symposium on Computer architecture
May 1992
439 pages
ISBN:0897915097
DOI:10.1145/139669
  • cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 20, Issue 2
    Special Issue: Proceedings of the 19th annual international symposium on Computer architecture (ISCA '92)
    May 1992
    429 pages
    ISSN:0163-5964
    DOI:10.1145/146628
    Issue’s Table of Contents

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 April 1992

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

ISCA92
Sponsor:
ISCA92: International Conference on Computer Architecture
May 19 - 21, 1992
Queensland, Australia

Acceptance Rates

Overall Acceptance Rate 543 of 3,203 submissions, 17%

Upcoming Conference

ISCA '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)42
  • Downloads (Last 6 weeks)11
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Stream-based memory access specialization for general purpose processorsProceedings of the 46th International Symposium on Computer Architecture10.1145/3307650.3322229(736-749)Online publication date: 22-Jun-2019
  • (2018)The Processing-in-Memory Paradigm: Mechanisms to Enable AdoptionBeyond-CMOS Technologies for Next Generation Computer Design10.1007/978-3-319-90385-9_5(133-194)Online publication date: 21-Aug-2018
  • (2011)OUTRIDERACM SIGARCH Computer Architecture News10.1145/2024723.200007939:3(117-128)Online publication date: 4-Jun-2011
  • (2011)OUTRIDERProceedings of the 38th annual international symposium on Computer architecture10.1145/2000064.2000079(117-128)Online publication date: 4-Jun-2011
  • (2010)Speculative-aware executionProceedings of the 19th international conference on Parallel architectures and compilation techniques10.1145/1854273.1854326(421-430)Online publication date: 11-Sep-2010
  • (2005)SSTProceedings of the 2005 International Conference on Computer Design10.1109/ICCD.2005.98(662-665)Online publication date: 2-Oct-2005
  • (1998)Load latency tolerance in dynamically scheduled processorsProceedings of the 31st annual ACM/IEEE international symposium on Microarchitecture10.5555/290940.290973(148-159)Online publication date: 1-Nov-1998
  • (1996)Improving the parallelism and concurrency in decoupled architecturesProceedings of SPDP '96: 8th IEEE Symposium on Parallel and Distributed Processing10.1109/SPDP.1996.570325(130-137)Online publication date: 1996
  • (1995)Program balance and its impact on high performance RISC architecturesProceedings of the 1st IEEE Symposium on High-Performance Computer Architecture10.5555/527072.822633Online publication date: 22-Jan-1995
  • (1995)Program balance and its impact on high performance RISC architecturesProceedings of 1995 1st IEEE Symposium on High Performance Computer Architecture10.1109/HPCA.1995.386526(370-379)Online publication date: 1995

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media