Hierarchical overlapped tiling

X Zhou, JP Giacalone, MJ Garzarán, RH Kuhn… - Proceedings of the …, 2012 - dl.acm.org
X Zhou, JP Giacalone, MJ Garzarán, RH Kuhn, Y Ni, D Padua
Proceedings of the Tenth International Symposium on Code Generation and …, 2012dl.acm.org
This paper introduces hierarchical overlapped tiling, a transformation that applies loop tiling
and fusion to conventional loops. Overlapped tiling is a useful transformation to reduce
communication overhead, but it may also generate a significant amount of redundant
computation. Hierarchical overlapped tiling performs overlapped tiling hierarchically to
balance communication overhead and redundant computation, and thus has the potential to
provide better performance. In this paper, we describe the hierarchical overlapped tiling …
This paper introduces hierarchical overlapped tiling, a transformation that applies loop tiling and fusion to conventional loops. Overlapped tiling is a useful transformation to reduce communication overhead, but it may also generate a significant amount of redundant computation. Hierarchical overlapped tiling performs overlapped tiling hierarchically to balance communication overhead and redundant computation, and thus has the potential to provide better performance.
In this paper, we describe the hierarchical overlapped tiling optimization and its implementation in an OpenCL compiler. We also evaluate the effectiveness of this optimization using 8 programs that implement different forms of stencil computation. Our results show that hierarchical overlapped tiling achieves an average 37% speedup over traditional tiling on a 32-core workstation.
ACM Digital Library