Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/266800.266803acmconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
Article
Free access

Alternative fetch and issue policies for the trace cache fetch mechanism

Published: 01 December 1997 Publication History

Abstract

The increasing widths of superscalar processors are placing greater demands upon the fetch mechanism. The trace cache meets these demands by placing logically contiguous instructions in physically contiguous storage. It is capable of supplying multiple fetch blocks each cycle. In this paper we examine two fetch and issue techniques, partial matching and inactive issue, that improve the overall performance of the trace cache by improving the effective fetch rate. We show that for the SPECint95 benchmarks partial matching increases the overall performance by 12% and inactive issue by 15%. Furthermore we apply these two techniques to issue blocks from trace segments which contain multiple execution paths. We conclude with a performance comparison between a trace cache implementing partial matching and inactive issue and an aggressive single block fetch mechanism. The trace cache increases performance by an average of 25% over the instruction cache.

References

[1]
J. O. Bondi, A. K. Nanda, and S. Dutta, "integrating a misprediction recovery cache (mrc) into a superscalar pipeline," in Proceedings of the 29th Annual A CM/IEEE International Symposium on Microarchitecture, pp. 14- 23, December 1996.
[2]
D. Burger, T. Austin, and S. Bennett, "Evaluating future microprocessors: The simplescalar tool set," Technical Report 1308, University of Wisconsin- Madison Technical Report, July 1996.
[3]
P.-Y. Chang, E. Hao, T.-Y. Yeh, and Y. N. Patt, "Branch classification: A new mechanism for improving branch predictor performance," in Proceedlngs of the 27ih Annual A CM/IEEE International Symposium on Microarchileclure, pp. 22- 31, 1994.
[4]
M. Franklin and M. Smotherman, "A fill-unit approach to multiple instruction issue," in Proceedzngs of the 27th Annual A CM/IEEE International Symposium on Microarchilecture, pp. 162- 171, 1994.
[5]
W. W. Hwu and Y. N. Patt, "Checkpoint repair for out-of-order execution machines," in Proceedings of the ldih Annual Inlernational Symposium on Computer Architecture, pp. 18-26, 1987.
[6]
E. Jacobsen, E. Rotenberg, and J. E. Smith, "Assigning confidence to conditional branch predict. ions,'' in micro96, pp. 142-152, 1996.
[7]
J. Keller, The 2126~: A Superscalar Alpha Processor wilh Out-of-Order Execution, Digital Equipment Corporation, Hudson, MA, October 1996.
[8]
S. McFarling, "Combining branch predictors," Technical Report TN-36, Digital Western Research Laboratory, June 1993.
[9]
S. W. Melvin and Y. N. Patt, "Performance benefits of large execution atomic units in dynamically scheduled machines," in Proceedings of the 1989 Inlernatio~al Conference on Supercompuling, pp. 427-432, 1989.
[10]
S. W. Melvin, M. C. Shebanow, and Y. N. Patt, "Hardware support for large atomic units in dynamically scheduled machines," in Proceedings of the ~1st Annual A CM/IEEE International Symposium on Microarchitecture, pp. 60-63, 1988.
[11]
S. J. Patel, D. H. Friendly, and Y. N. Patt, "Critical issues regarding the trace cache fetch mechanism,'' Technical Report CSE-TR-335-97, University of Michigan Technical Report., May 1997.
[12]
Y. Patt, W. Hwu, and M. Shebanow, "HPS, a new microarchitecture: Rationale and introduction,'' in Proceedings of lhe 18th Annual A CM/IEEE Iniernalional Symposium on Microarchiteclure, pp. 103-107, 1985.
[13]
A. Peleg and U. Weiser. Dynamic Flow Instruclion Cache Memory Organized Around Trace Segments Independanl of Virtual Address Line. U.S. Patent Number 5,381,533, 1994.
[14]
E. Rotenberg, S. Bennett, and J. E. Smith, "Trace cache: a low latency approach to high bandwidth instruction fetching," in Proceedings of the 2gth Annual A CM/IEEE Internaiional Symposium on Microarchiiecture, pp. 24- 34, December 1996.
[15]
M. Smotherman and M. Franklin, "Improving cisc instruction decoding performance using a fill unit," in Proceedings of the ~Sth Annual A CM/iEEE International Symposium on Microarchitecture, pp. 219-229, 1995.

Cited By

View all
  • (2014)Software trace cacheACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2667175(261-268)Online publication date: 10-Jun-2014
  • (2014)Author retrospective for software trace cacheACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2594508(45-47)Online publication date: 10-Jun-2014
  • (2006)Wide and efficient trace prediction using the local trace predictorProceedings of the 20th annual international conference on Supercomputing10.1145/1183401.1183411(55-65)Online publication date: 28-Jun-2006
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
December 1997
369 pages
ISBN:0818679778

Sponsors

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 December 1997

Check for updates

Author Tags

  1. high bandwidth fetch mechanisms
  2. inactive issue
  3. partial matching
  4. speculative execution
  5. trace cache
  6. wide issue machines

Qualifiers

  • Article

Conference

MICRO97
Sponsor:
MICRO97: 30th Annual International Symposium on Microarchitecture
December 1 - 3, 1997
North Carolina, Research Triangle Park, USA

Acceptance Rates

MICRO 30 Paper Acceptance Rate 35 of 103 submissions, 34%;
Overall Acceptance Rate 484 of 2,242 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)41
  • Downloads (Last 6 weeks)2
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2014)Software trace cacheACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2667175(261-268)Online publication date: 10-Jun-2014
  • (2014)Author retrospective for software trace cacheACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2594508(45-47)Online publication date: 10-Jun-2014
  • (2006)Wide and efficient trace prediction using the local trace predictorProceedings of the 20th annual international conference on Supercomputing10.1145/1183401.1183411(55-65)Online publication date: 28-Jun-2006
  • (2006)Do trace cache, value prediction and prefetching improve SMT throughput?Proceedings of the 19th international conference on Architecture of Computing Systems10.1007/11682127_17(232-251)Online publication date: 13-Mar-2006
  • (2005)Energy-efficient and high-performance instruction fetch using a block-aware ISAProceedings of the 2005 international symposium on Low power electronics and design10.1145/1077603.1077614(36-41)Online publication date: 8-Aug-2005
  • (2005)Software Trace CacheIEEE Transactions on Computers10.1109/TC.2005.1354:1(22-35)Online publication date: 1-Jan-2005
  • (2005)Improving instruction delivery with a block-aware ISAProceedings of the 11th international Euro-Par conference on Parallel Processing10.1007/11549468_60(530-539)Online publication date: 30-Aug-2005
  • (2003)Effective ahead pipelining of instruction block address generationACM SIGARCH Computer Architecture News10.1145/871656.85964631:2(241-252)Online publication date: 1-May-2003
  • (2003)Effective ahead pipelining of instruction block address generationProceedings of the 30th annual international symposium on Computer architecture10.1145/859618.859646(241-252)Online publication date: 9-Jun-2003
  • (2003)Selecting long atomic traces for high coverageProceedings of the 17th annual international conference on Supercomputing10.1145/782814.782818(2-11)Online publication date: 23-Jun-2003
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media