Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Compile-time performance prediction of scientific programs
Publisher:
  • University of Illinois at Urbana-Champaign
  • Champaign, IL
  • United States
ISBN:978-0-599-97401-2
Order Number:AAI9989955
Pages:
124
Reflects downloads up to 12 Nov 2024Bibliometrics
Skip Abstract Section
Abstract

In this disertation we present a compile-time performance prediction environment for Fortran scientific programs. The performance data are expressed as symbolic expressions, with variables for program constructs, input data size and machine parameters. We focus on modeling the processor and its memory hierarchy. The results from the static estimation can be used to drive optimizations or can be displayed using performance visualization tools. The integration of our model within the Delphi system allows the user to do performance tuning and scalability analysis faster and easier than by using instrumentation.

The main contribution of this work is the cache behavior estimation using the stack distances algorithm. We have designed and implemented a compile-time algorithm that computes the stack histogram at compile-time. We use the stack histogram to predict program performance statically with very good accuracy. Experimental results are presented for two processor/memory architectures, the MIPS R10000 and UltraSparc II i . The most interesting feature of the stack algorithm is that once the histogram is computed, the number of cache misses can be estimated for any cache size.

We use stack distances to quantify locality and we show that the average locality computed using stack distances is a very reliable metric. A new algorithm for stack processing, that is 30% faster than the best know algorithm on the suite of programs traced, is also presented.

Cited By

  1. Chen G, Wu B, Li D and Shen X PORPLE Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, (88-100)
  2. ACM
    Wu B, Zhao Z, Zhang E, Jiang Y and Shen X (2013). Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU, ACM SIGPLAN Notices, 48:8, (57-68), Online publication date: 23-Aug-2013.
  3. ACM
    Wu B, Zhao Z, Zhang E, Jiang Y and Shen X Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming, (57-68)
  4. ACM
    Andrade D, Fraguela B and Doallo R (2007). Precise automatable analytical modeling of the cache behavior of codes with indirections, ACM Transactions on Architecture and Code Optimization, 4:3, (16-es), Online publication date: 1-Sep-2007.
  5. Andrade D, Fraguela B and Doallo R Cache behavior modelling for codes involving banded matrices Proceedings of the 19th international conference on Languages and compilers for parallel computing, (205-219)
  6. Fraguela B, Doallo R, Touriño J and Zapata E (2004). A compiler tool to predict memory hierarchy performance of scientific codes, Parallel Computing, 30:2, (225-248), Online publication date: 1-Feb-2004.
  7. ACM
    Ding C and Zhong Y Predicting whole-program locality through reuse distance analysis Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation, (245-257)
  8. ACM
    Ding C and Zhong Y (2003). Predicting whole-program locality through reuse distance analysis, ACM SIGPLAN Notices, 38:5, (245-257), Online publication date: 9-May-2003.
  9. Fraguela B, Doallo R and Zapata E (2003). Probabilistic Miss Equations, IEEE Transactions on Computers, 52:3, (321-336), Online publication date: 1-Mar-2003.
  10. ACM
    Chatterjee S, Parker E, Hanlon P and Lebeck A Exact analysis of the cache behavior of nested loops Proceedings of the ACM SIGPLAN 2001 conference on Programming language design and implementation, (286-297)
  11. ACM
    Chatterjee S, Parker E, Hanlon P and Lebeck A (2001). Exact analysis of the cache behavior of nested loops, ACM SIGPLAN Notices, 36:5, (286-297), Online publication date: 1-May-2001.
Contributors
  • University of Illinois Urbana-Champaign
  • Qualcomm Incorporated

Recommendations