Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/74925.74956acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article
Free access

Exploring the benefits of multiple hardware contexts in a multiprocessor architecture: preliminary results

Published: 01 April 1989 Publication History

Abstract

A fundamental problem that any scalable multiprocessor must address is the ability to tolerate high latency memory operations. This paper explores the extent to which multiple hardware contexts per processor can help to mitigate the negative effects of high latency. In particular, we evaluate the performance of a directory-based cache coherent multiprocessor using memory reference traces obtained from three parallel applications. We explore the case where there are a small fixed number (2-4) of hardware contexts per processor and the context switch overhead is low. In contrast to previously proposed approaches, we also use a very simple context switch criterion, namely a cache miss or a write-hit to shared data. Our results show that the effectiveness of multiple contexts depends on the nature of the applications, the context switch overhead, and the inherent latency of the machine architecture. Given reasonably low overhead hardware context switches, we show that two or four contexts can achieve substantial performance gains over a single context. For one application, the processor utilization increased by about 46% with two contexts and by about 80% with four contexts.

References

[1]
Anant Agarwal, Richard Simoni, John Hennessy, and Mark Horowitz. An Evaluation of Directory Schemes for Cache Coherence. In 15th International Symposium on Computer Architecture, 1988.
[2]
Arvind and R. A. Iannucci. A Critique of MultiProcessing von Neumann Style. In 10th International Symposium on Computer Architecture, pages 42w36, 1983.
[3]
W. Crowther, J. Goodhue, E. Starr, R. Thomas, W. Millnen, and T. Blackadar. Performance Measurements on a 128-node Butterfly Parallel Processor. In Intl. Conf. on Parallel Processing, pages 531-540, 1985.
[4]
WiIliam J. Dally et al. Architecture of a Message-Driven Processor. The 14th Annual International Symposium on Computer Architecture, pages 189-196, June 1987.
[5]
Stephen R. Goldschmidt. Simulating Multiprocessor Memory Traces. EE390 Report, Stanford University, December 1987.
[6]
T. Gross, J. Hennessy, S. Pnybylski, and C. Rowen. Measurement and Evaluation of the MIPS Architecture and Processor. ACM TOCS, 6, August 1988.
[7]
Anoop Gupta et al. Parallel Implementation of OPS5 on the Encore Multiprocessor: Results and Analysis. Znternutional Journal of Parallel Programming, 17, 1988.
[8]
R. H. Halstead and T. Fujita MAS,A: A Multithreaded Processor Architecture for Parallel Symbolic Computing. In 15th Znternational Symposium on Compuler Architecture, pages 443- 451, 1988.
[9]
J. P. Hayes et al. A Microprocessor Based Hybrid Supercomputer. IEEE Micro, 6, October 1986.
[10]
S. K. Heller. An I-Structure Memory Controller (ISMC). Technical report, Massachusetts Institute of Technology, June 1983.
[11]
R. A. Iannucci. Toward a Dataflow / von Neumann Hybrid Architecture. In 15th International Symposium on Computer Architecture, pages 131-140, 1988.
[12]
Robert A. Iannucci. A Dataflow f van Neumann Hybrid Architecture. PhD thesis, Massachusetts Institute of Technology, 1988.
[13]
Jeffrey D. McDonald and Donald Baganoff. Vectorization of a Particle Simulation Method for Hypersonic Rarified Flow. In AUA Thermdynamics, Plasmadynamics and Lasers Conference, June 1988.
[14]
D. Patterson. Reduced Instruction Set Computers. Comm. ACM, 28, January 1985.
[15]
G.F. PEster, WC. Brantley, et al. The IBM Research Parallel Processor Prototype (RP3): Introduction and Architecture. In International Conference on Parallel Processing. IEEE, 1985.
[16]
Jonathan Rose. LocusRoute: A Parallel Global Router for Standard Cells. In Design Automation Conference, pages 189-195, June 1988.
[17]
Jonathan Rose. The Parallel Decomposition and Implementation of an Integrated Circuit Global Router. In Proc ACM SIGPLAN PPEALS, pages 138-145, July 1988.
[18]
Charles L. Seitz, WiIliam C. Athas, Charles M. Flaig, Alain J. Martin, Jakov Seizovic, Craig S. Steele, and Wen-King Su. The Arehiteetun and Programming of the Ametek Series 2010 Multicomputer. In Hypercube Concurrent Computers and Applications. 1988.
[19]
B. J. Smith. Architecture and applications of the HEP multiprocessor computer system. In SPIE, volume 298, pages 241-248, 1981.
[20]
Larry Soule and Anoop Gupta Characterization of Parallelism and Deadlocks in Distributed Digital Logic Simulation. ln 26th Design Automation Conference, June 1989.
[21]
C. P. Thacker, E. M. McCreight, et al. Alto: A Personal Computer. Inc. Gordon Bell Daniel P. Siewiorek and Allen Newell, editors, Computer Strut tures: Principles and Examples, pages 549-572. McGraw-Hill, 1982.
[22]
Andrew Tucker and Anoop Gupta Process Control and Scheduling Issues for Multiprogrammed Shared-Memory Multiprocessors. submitted for publication, March 1989.

Cited By

View all
  • (2020)Processing in storage class memoryProceedings of the 12th USENIX Conference on Hot Topics in Storage and File Systems10.5555/3488733.3488746(13-13)Online publication date: 13-Jul-2020
  • (2014)A survey of computational intelligence approaches for software reliability predictionACM SIGSOFT Software Engineering Notes10.1145/2579281.257929339:2(1-10)Online publication date: 29-Mar-2014
  • (2014)Neural network based black box testingACM SIGSOFT Software Engineering Notes10.1145/2579281.257929239:2(1-6)Online publication date: 29-Mar-2014
  • Show More Cited By

Index Terms

  1. Exploring the benefits of multiple hardware contexts in a multiprocessor architecture: preliminary results

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ISCA '89: Proceedings of the 16th annual international symposium on Computer architecture
      April 1989
      426 pages
      ISBN:0897913191
      DOI:10.1145/74925
      • cover image ACM SIGARCH Computer Architecture News
        ACM SIGARCH Computer Architecture News  Volume 17, Issue 3
        Special Issue: Proceedings of the 16th annual international symposium on Computer Architecture
        June 1989
        400 pages
        ISSN:0163-5964
        DOI:10.1145/74926
        Issue’s Table of Contents

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 April 1989

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 543 of 3,203 submissions, 17%

      Upcoming Conference

      ISCA '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)215
      • Downloads (Last 6 weeks)45
      Reflects downloads up to 28 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2020)Processing in storage class memoryProceedings of the 12th USENIX Conference on Hot Topics in Storage and File Systems10.5555/3488733.3488746(13-13)Online publication date: 13-Jul-2020
      • (2014)A survey of computational intelligence approaches for software reliability predictionACM SIGSOFT Software Engineering Notes10.1145/2579281.257929339:2(1-10)Online publication date: 29-Mar-2014
      • (2014)Neural network based black box testingACM SIGSOFT Software Engineering Notes10.1145/2579281.257929239:2(1-6)Online publication date: 29-Mar-2014
      • (2013)A first measurement look at the deployment and evolution of thelocator/id separation protocolACM SIGCOMM Computer Communication Review10.1145/2479957.247996343:2(37-43)Online publication date: 29-Apr-2013
      • (2013)Routing topology inference for wireless sensor networksACM SIGCOMM Computer Communication Review10.1145/2479957.247996143:2(21-28)Online publication date: 29-Apr-2013
      • (2013)Understanding the latency benefits of multi-cloud webservice deploymentsACM SIGCOMM Computer Communication Review10.1145/2479957.247996043:2(13-20)Online publication date: 29-Apr-2013
      • (2013)A forensic case study on as hijackingACM SIGCOMM Computer Communication Review10.1145/2479957.247995943:2(5-12)Online publication date: 29-Apr-2013
      • (2010)A low cost split-issue technique to improve performance of SMT clustered VLIW processors2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)10.1109/IPDPS.2010.5470351(1-12)Online publication date: May-2010
      • (2009)Hybrid multithreading for VLIW processorsProceedings of the 2009 international conference on Compilers, architecture, and synthesis for embedded systems10.1145/1629395.1629403(37-46)Online publication date: 11-Oct-2009
      • (2009)Thread Merging Schemes for Multithreaded Clustered VLIW ProcessorsProceedings of the 2009 International Conference on Parallel Processing10.1109/ICPP.2009.48(445-452)Online publication date: 22-Sep-2009
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media