Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Optimal semi-oblique tiling

Published: 01 September 2003 Publication History
  • Get Citation Alerts
  • Abstract

    For 2D iteration space tiling, we address the problem of determining the tile parameters that minimize the total execution time on a parallel machine. We consider uniform dependency computations tiled so that (at least) one of the tile boundaries is parallel to the domain boundaries. We determine the optimal tile size as a closed form solution. In addition, we determine the optimal number of processors and also the optimal slope of the oblique tile boundary. Our results are based on the BSP model, which assures the portability of the results. Our predictions are justified on a sequence global alignment problem specialized to similar sequences using Fickett's k-band algorithm, for which our optimal semi-oblique tiling yields an improvement of a factor of 2.5 over orthogonal tiling. Our optimal solution requires a block-cyclic distribution of tiles to processors. The best one can obtain with only block distribution (as many authors require) is three times slower. Furthermore, our best running time is within 10 percent of the "predicted theoretical peak" performance of the machine!.

    Cited By

    View all
    • (2024)Stencil Computation with Vector Outer ProductProceedings of the 38th ACM International Conference on Supercomputing10.1145/3650200.3656611(247-258)Online publication date: 30-May-2024
    • (2017)Tessellating stencilsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3126908.3126920(1-13)Online publication date: 12-Nov-2017
    • (2015)Locality aware concurrent start for stencil applicationsProceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization10.5555/2738600.2738620(157-166)Online publication date: 7-Feb-2015
    • Show More Cited By

    Index Terms

    1. Optimal semi-oblique tiling
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Parallel and Distributed Systems
      IEEE Transactions on Parallel and Distributed Systems  Volume 14, Issue 9
      September 2003
      144 pages

      Publisher

      IEEE Press

      Publication History

      Published: 01 September 2003

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 10 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Stencil Computation with Vector Outer ProductProceedings of the 38th ACM International Conference on Supercomputing10.1145/3650200.3656611(247-258)Online publication date: 30-May-2024
      • (2017)Tessellating stencilsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3126908.3126920(1-13)Online publication date: 12-Nov-2017
      • (2015)Locality aware concurrent start for stencil applicationsProceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization10.5555/2738600.2738620(157-166)Online publication date: 7-Feb-2015
      • (2009)Efficient hybrid parallelisation of tiled algorithms on SMP clustersInternational Journal of Computational Science and Engineering10.1504/IJCSE.2009.0270034:2(112-126)Online publication date: 1-Jul-2009
      • (2008)Positivity, posynomials and tile size selectionProceedings of the 2008 ACM/IEEE conference on Supercomputing10.5555/1413370.1413426(1-12)Online publication date: 15-Nov-2008
      • (2008)A practical automatic polyhedral parallelizer and locality optimizerACM SIGPLAN Notices10.1145/1379022.137559543:6(101-113)Online publication date: 7-Jun-2008
      • (2008)A practical automatic polyhedral parallelizer and locality optimizerProceedings of the 29th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/1375581.1375595(101-113)Online publication date: 7-Jun-2008
      • (2008)Global Tiling for Communication Minimal Parallelization on Distributed Memory SystemsProceedings of the 14th international Euro-Par conference on Parallel Processing10.1007/978-3-540-85451-7_41(382-391)Online publication date: 26-Aug-2008
      • (2007)Effective automatic parallelization of stencil computationsACM SIGPLAN Notices10.1145/1273442.125076142:6(235-244)Online publication date: 10-Jun-2007
      • (2007)Effective automatic parallelization of stencil computationsProceedings of the 28th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/1250734.1250761(235-244)Online publication date: 15-Jun-2007
      • Show More Cited By

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media