Correlation-based hardware prefetching

October 1996

Author:
Mary Jay Charney
Cornell Univ.

Publisher:

Cornell University
PO Box 250, 124 Roberts Place Ithaca, NY
United States

Order Number:UMI Order No. GAX95-42429

Bibliometrics

Abstract

Correlation-based prefetching is a technique to observe and record spatial links between temporal events and address references in a processor with the goal of predicting what data will be needed by the processor. The main contribution of this dissertation is that correlation-based prefetching is shown to be an effective technique for prefetching data into a data cache, by observing the cache-miss traffic.

This thesis examines the issues that are critical to the performance of correlation-based prefetching. First, intrinsic qualities of test programs are studied which indicate the potential for using correlations for prefetching. Metrics of linked and strided spatial locality indicate the potential of correlation-based prefetching.

Hardware prefetching works because there often exists predictable patterns in the references of programs. Hardware correlation-based prefetching seeks to allow prefetching in the situations not easily handled by compilers or characterized by simple stride patterns. It is shown to be very efficient in such cases. Correlation-based prefetching is a complementary technique to compiler-controlled prefetching or stride-based prefetching done in hardware.

Specifically, this thesis examines mechanisms for construction of temporal events that are used as triggers for prefetching. These temporal events should be good indicators of the context of a program. A flexible mechanism for pairing temporal events with addresses, called the horizon and skip model, is shown to be useful for increasing the accuracy of prefetching for some applications.

Confirmation mechanisms are studied. They allow for a tradeoff between the coverage of memory accesses predicted and accuracy of prefetching.

An important issue in prefetching is where to store the accumulated prefetching meta-data. We examine the effects of varying the capacity of a table used to store this meta-data as well as a novel mechanism for storing the meta-data in a large, pre-existing secondary cache. We examine an alternative approach involving a hybrid prefetcher that combines a stride and pure correlation-based prefetcher.

The hybrid mechanism is shown to dramatically reduce the pair storage requirements of correlation-based prefetching.

Cited By

Contributors

Mary Jay Charney
Cornell University
- Publication Years1996 - 1996
- Publication counts1
- Citation count20
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article20
View Full Profile

Index Terms

Correlation-based hardware prefetching
1. Applied computing
  1. Physical sciences and engineering
    1. Electronics

Comments

Recommendations

Increasing hardware data prefetching performance using the second-level cache

Techniques to reduce or tolerate large memory latencies are critical for achieving high processor performance. Hardware data prefetching is one of the most heavily studied solutions, but it is essentially applied to first-level caches where it can ...
Effective cache prefetching on bus-based multiprocessors

Compiler-directed cache prefetching has the potential to hide much of the high memory latency seen by current and future high-performance processors. However, prefetching is not without costs, particularly on a shared-memory multiprocessor. Prefetching ...
Improving memory hierarchy performance with hardware prefetching and cache replacement

Browse Theses

Sections

Cited By

Index Terms

Increasing hardware data prefetching performance using the second-level cache

Effective cache prefetching on bus-based multiprocessors

Improving memory hierarchy performance with hardware prefetching and cache replacement

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Increasing hardware data prefetching performance using the second-level cache

Effective cache prefetching on bus-based multiprocessors

Improving memory hierarchy performance with hardware prefetching and cache replacement