Compile-time performance prediction of scientific programs

January 2000

Author:
Gheorghe Calin Cascaval,
Adviser:
David A Padua

Publisher:

University of Illinois at Urbana-Champaign
Champaign, IL
United States

ISBN:978-0-599-97401-2

Order Number:AAI9989955

Pages:

124

Purchase on ProQuest

Bibliometrics

Abstract

In this disertation we present a compile-time performance prediction environment for Fortran scientific programs. The performance data are expressed as symbolic expressions, with variables for program constructs, input data size and machine parameters. We focus on modeling the processor and its memory hierarchy. The results from the static estimation can be used to drive optimizations or can be displayed using performance visualization tools. The integration of our model within the Delphi system allows the user to do performance tuning and scalability analysis faster and easier than by using instrumentation.

The main contribution of this work is the cache behavior estimation using the stack distances algorithm. We have designed and implemented a compile-time algorithm that computes the stack histogram at compile-time. We use the stack histogram to predict program performance statically with very good accuracy. Experimental results are presented for two processor/memory architectures, the MIPS R10000 and UltraSparc II i . The most interesting feature of the stack algorithm is that once the histogram is computed, the number of cache misses can be estimated for any cache size.

We use stack distances to quantify locality and we show that the average locality computed using stack distances is a very reliable metric. A new algorithm for stack processing, that is 30% faster than the best know algorithm on the suite of programs traced, is also presented.

Cited By

Contributors

David Padua
University of Illinois Urbana-Champaign
- Publication Years1981 - 2019
- Publication counts127
- Citation count5,430
- Available for Download67
- Downloads (cumulative)67,671
- Downloads (12 months)8,393
- Downloads (6 weeks)875
- Average Downloads per Article1,010
- Average Citation per Article43
View Full Profile
Calin Cascaval
Qualcomm Incorporated
- Publication Years1995 - 2015
- Publication counts48
- Citation count1,983
- Available for Download28
- Downloads (cumulative)91,508
- Downloads (12 months)5,309
- Downloads (6 weeks)432
- Average Downloads per Article3,268
- Average Citation per Article41
View Full Profile

Comments

Recommendations

Compile-time Performance Prediction of Scientific Programs
Predicting Cache Contention for Multithread Applications at Compile Time
IPDPSW '14: Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops

Shared cache in multicore processors is an important hardware resource that should be utilized effectively to achieve high performance for parallel applications. It is critical to coordinate accesses by multiple threads to data that reside in shared ...
High performance cache replacement using re-reference interval prediction (RRIP)
ISCA '10

Practical cache replacement policies attempt to emulate optimal replacement by predicting the re-reference interval of a cache block. The commonly used LRU replacement policy always predicts a near-immediate re-reference interval on cache hits and ...

Browse Theses

Sections

Cited By