A Primer on Hardware Prefetching: | Guide books

A Primer on Hardware PrefetchingJune 2014

June 2014

Publisher:

Morgan & Claypool Publishers

ISBN:978-1-60845-952-0

Published:01 June 2014

Pages:

Available at Amazon

Bibliometrics

Abstract

Since the 1970s, microprocessor-based digital platforms have been riding Moores law, allowing for doubling of density for the same area roughly every two years. However, whereas microprocessor fabrication has focused on increasing instruction execution rate, memory fabrication technologies have focused primarily on an increase in capacity with negligible increase in speed. This divergent trend in performance between the processors and memory has led to a phenomenon referred to as the Memory Wall. To overcome the memory wall, designers have resorted to a hierarchy of cache memory levels, which rely on the principal of memory access locality to reduce the observed memory access time and the performance gap between processors and memory. Unfortunately, important workload classes exhibit adverse memory access patterns that baffle the simple policies built into modern cache hierarchies to move instructions and data across cache levels. As such, processors often spend much time idling upon a demand fetch of memory blocks that miss in higher cache levels. Prefetchingpredicting future memory accesses and issuing requests for the corresponding memory blocks in advance of explicit accessesis an effective approach to hide memory access latency. There have been a myriad of proposed prefetching techniques, and nearly every modern processor includes some hardware prefetching mechanisms targeting simple and regular memory access patterns. This primer offers an overview of the various classes of hardware prefetchers for instructions and data proposed in the research literature, and presents examples of techniques incorporated into modern microprocessors.

Cited By

Contributors

Babak Falsafi
EPFL
- Publication Years1993 - 2023
- Publication counts136
- Citation count12,711
- Available for Download128
- Downloads (cumulative)125,911
- Downloads (12 months)13,015
- Downloads (6 weeks)1,307
- Average Downloads per Article984
- Average Citation per Article93
View Full Profile
Thomas F Wenisch
University of Michigan, Ann Arbor
- Publication Years2003 - 2024
- Publication counts86
- Citation count10,238
- Available for Download83
- Downloads (cumulative)99,439
- Downloads (12 months)9,003
- Downloads (6 weeks)768
- Average Downloads per Article1,198
- Average Citation per Article119
View Full Profile

Index Terms

A Primer on Hardware Prefetching

Comments

Recommendations

Increasing hardware data prefetching performance using the second-level cache

Techniques to reduce or tolerate large memory latencies are critical for achieving high processor performance. Hardware data prefetching is one of the most heavily studied solutions, but it is essentially applied to first-level caches where it can ...
Designing a Modern Memory Hierarchy with Hardware Prefetching

In this paper, we address the severe performance gap caused by high processor clock rates and slow DRAM accesses. We show that, even with an aggressive, next-generation memory system using four Direct Rambus channels and an integrated one-megabyte level-...
Improving memory hierarchy performance with hardware prefetching and cache replacement

Browse Books

Sections

Cited By

Index Terms

Increasing hardware data prefetching performance using the second-level cache

Designing a Modern Memory Hierarchy with Hardware Prefetching

Improving memory hierarchy performance with hardware prefetching and cache replacement

Save to Binder

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Increasing hardware data prefetching performance using the second-level cache

Designing a Modern Memory Hierarchy with Hardware Prefetching

Improving memory hierarchy performance with hardware prefetching and cache replacement