Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2791188.2791197guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article
Free access

Algorithms to take advantage of hardware prefetching

Published: 06 January 2007 Publication History

Abstract

Cache-oblivious and cache-aware algorithms have been developed to minimize cache misses. Some of the newest processors have hardware prefetching where cache misses are avoided by predicting ahead of time what memory will be needed in the future and bringing that memory into the cache before it is used. It is shown that hardware prefetching permits the standard Floyd-Warshall algorithm for all-pairs shortest paths to outperform cache-oblivious and cache-aware algorithms. A simple improvement to the standard simple dynamic programming algorithm yields an algorithm that takes advantage of prefetching, and outperforms cache-oblivious and cache-aware algorithms. Finally, it is shown that variants of standard FFT algorithms exhibit good prefetching performance.

References

[1]
C. Cherng and R. E. Ladner, Cache efficient simple dynamic programming, Proceedings of the International Conference on the Analysis of Algorithms, 2005, pp. 49--58.
[2]
R. A. Chowdhury and V. Ramachandran, Cache-oblivious dynamic programming, Proceedings of the Seventeenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), 2006, pp. 591--600.
[3]
T. Cormen, C. Leiserson, R. Rivest, and C. Stein, Introduction to Algorithms, MIT Press, 2nd ed., 2001.
[4]
M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran, Cache-oblivious algorithms, Proceedings of the 40th Annual Symposium on Foundations of Computer Science (FOCS), 1999, pp. 17--18.
[5]
M. E. Furman, Application of fast multiplication of matrices in the problem of finding the transitive closure of a graph, Dokl. Akad Nauk SSSR, 194:524 (Russian), Soviet Math. Dokl., 11(5):1252, 1970.
[6]
Intel 64 and IA-32 Architectures Software Developer's Manual, Volume 3B, http://www.intel.com/design/processor/manuals/253669.pdf.
[7]
G. Hinton, D. Sager, M. Upton, D. Boggs, D. Carmean, A. Kyker, and P. Roussel, The Microarchitecture of the Pentium 4 Processor, http://www.intel.com/.
[8]
T. Kasami, An efficient recognition and syntax algorithm for context-free languages, Scientific Report AFCRL-65-758, Air Force Cambridge Research Laboratory, Bedford, Mass., 1965.
[9]
I. Munro, Efficient determination of the transitive closure of a directed graph, Information Processing Letters, 1(2):56--58, 1971.
[10]
J.-S. Park, M. Penner and V. K. Prasanna, Optimizing graph algorithms for improved cache performance, IEEE Transactions on Parallel and Distributed Systems, vol. 15(9), pp. 769--782, 2004.
[11]
W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, Numerical Recipes in C: The Art of Scientific Computing, Second Edition, Cambridge University Press, Cambridge, UK, 1992.
[12]
L. G. Valiant, General context-free recognition in less than cubic time, Journal of Computer and Systems Sciences, 10:308--315, 1975.
[13]
D. H. Younger, Recognition of context-free languages in time n3, Information and Control, 10(2):189--208, 1967.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
Proceedings of the Meeting on Algorithm Engineering & Expermiments
January 2007
163 pages

Publisher

Society for Industrial and Applied Mathematics

United States

Publication History

Published: 06 January 2007

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 87
    Total Downloads
  • Downloads (Last 12 months)40
  • Downloads (Last 6 weeks)10
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media