A unified transformation technique for multilevel blocking

Jiménez, M.; Llabería, J. M.; Fernández, A.; Morancho, E.

doi:10.1007/3-540-61626-8_53

M. Jiménez¹,
J. M. Llabería¹,
A. Fernández¹ &
…
E. Morancho¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1123))

Included in the following conference series:

European Conference on Parallel Processing

202 Accesses
2 Citations

Abstract

This paper presents a new unified method for simultaneously tiling the register and cache levels of the memory hierarchy. We will only focus on the code transformation phase of tiling. Our algorithm uses strip-mining and loop interchange on all memory hierarchy levels to determine the tiles as usual, and, afterwards, and due to the special characteristics of the register level, we apply index set splitting, unrolling and scalar replacement to this level. After applying strip-mining, the iteration space is non-convex. To perform in a single step the loop interchange in non-convex iteration spaces, we use non-unimodular matrices. The order proposed to perform index set splitting to the loops guarantees that each loop in the nest has to be processed only once and also avoids code explosion.

Download to read the full chapter text

Chapter PDF

An Analytical Model for Loop Tiling Transformation

Parallel Tiled Cache and Energy Efficient Code for Zuker’s RNA Folding

Loop Nest Tiling for Image Processing and Communication Applications

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

D. Callahan, S. Carr, K. Kennedy. Improving Register Allocation for Subscripted Variables. Int. Conf. on Programming Language Design and Implementation, June 1990, pp. 53–65
Google Scholar
S. Carr. Memory-Hierarchy Management. Ph.D. Dissertation, Rice University, Feb 1993.
Google Scholar
S. Carr, K. McKinley, C-W. Tseng. Compiler Optimizations for Improving Data Locality. Int. Conf. on Architectural Support for Programming Languages and Operating Systems, Aug 1994, pp.252–262
Google Scholar
A. Fernández, J.M. Llabería, M. Valero-García. Loop Transformation using non-unimodular matrices. IEEE Transactions on Parallel and Distributed Systems, Vol. 6, No. 8, Aug 1995, pp. 832–840
Article Google Scholar
M. Jiménez, J.M. Llabería, A. Fernández, E. Morancho. A Unified Transformation Technique for Multilevel Blocking. TR. UPC-DAC-1995-51, Dept. of Computer Architecture, Polytechnic University of Catalonia, Dec 1995.
Google Scholar
M. Lam, E.Rothberg, M. Wolf. The Cache Performance and Optimizations of Blocked Algorithms. Int. Conf. on Architectural Support for Programming Languages and Operating Systems, 1991, pp. 63–74
Google Scholar
J.J Navarro, T. Juan, T. Lang. MOB Forms: A Class of Multilevel Block Algorithms for Dense Linear Algebra Operations. Int. Conf. on Supercomputing, July 1994, pp. 354–363
Google Scholar
M. Wolf. Improving Locality and Parallelism in Nested Loops. Technical Report CSL-TR-92-538, Stanford University, Aug 1992.
Google Scholar
M. Wolf, M. Lam. A Data Optimizing Algorithm. Int. Conf. on Programming Language Design and Implementation, June 1991, pp. 30–44
Google Scholar
M. Wolf, M. Lam. A Loop Transformation Theory and an Algorithm to Maximize Parallelism. IEEE Trans. on Parallel and Distributed System, Vol. 2, No. 4, October 1991, pp. 452–471
Article Google Scholar
M. Wolfe. More Iteration Space Tiling. Int. Conf. on Supercomputing, 1989, pp. 655–664
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Arquitectura de Computadores, Universidad Politécnica de Cataluña Gran Capitán s/n, Módulo D6, E-08034, Barcelona, Spain
M. Jiménez, J. M. Llabería, A. Fernández & E. Morancho

Authors

M. Jiménez
View author publications
You can also search for this author in PubMed Google Scholar
J. M. Llabería
View author publications
You can also search for this author in PubMed Google Scholar
A. Fernández
View author publications
You can also search for this author in PubMed Google Scholar
E. Morancho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Luc Bougé Pierre Fraigniaud Anne Mignotte Yves Robert

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiménez, M., Llabería, J.M., Fernández, A., Morancho, E. (1996). A unified transformation technique for multilevel blocking. In: Bougé, L., Fraigniaud, P., Mignotte, A., Robert, Y. (eds) Euro-Par'96 Parallel Processing. Euro-Par 1996. Lecture Notes in Computer Science, vol 1123. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61626-8_53

Download citation

DOI: https://doi.org/10.1007/3-540-61626-8_53
Published: 08 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61626-9
Online ISBN: 978-3-540-70633-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

A unified transformation technique for multilevel blocking

Abstract

Chapter PDF

Similar content being viewed by others

An Analytical Model for Loop Tiling Transformation

Parallel Tiled Cache and Energy Efficient Code for Zuker’s RNA Folding

Loop Nest Tiling for Image Processing and Communication Applications

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A unified transformation technique for multilevel blocking

Abstract

Chapter PDF

Similar content being viewed by others

An Analytical Model for Loop Tiling Transformation

Parallel Tiled Cache and Energy Efficient Code for Zuker’s RNA Folding

Loop Nest Tiling for Image Processing and Communication Applications

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation