Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/IPDPSW.2012.12guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Partitioning for Parallel Matrix-Matrix Multiplication with Heterogeneous Processors: The Optimal Solution

Published: 21 May 2012 Publication History

Abstract

The problem of matrix partitioning for parallel matrix-matrix multiplication on heterogeneous processors has been extensively studied since the mid 1990s. During this time, previous research focused mainly on the design of efficient partitioning algorithms, optimally or sub-optimally partitioning matrices into rectangles. The optimality of the rectangular partitioning shape itself has never been studied or even seriously questioned. The accepted approach is that consideration of non-rectangular shapes will not significantly improve the optimality of the solution, but can significantly complicate the partitioning problem, which is already NP-complete even for the restricted case of rectangular shapes. There is no published research, however, supporting this approach. The shape of the globally optimal partitioning, and how the best rectangular partitioning compares with this global optimum, are still wide open problems. Solution of these problems will decide if new partitioning algorithms searching for truly optimal, and not necessarily rectangular, solutions are needed. This paper presents the first results of our research on the problem of optimal partitioning shapes for parallel matrix-matrix multiplication on heterogeneous processors. Namely, the case of two interconnected processors is comprehensively studied. We prove that, depending on performance characteristics of the processors and the communication link, the globally optimal partitioning will have one of just two well-specified shapes, one of which is rectangular and the other is non-rectangular. The theoretical analysis is conducted using an original mathematical technique proposed in the paper. It is shown that the technique can also be applied in the case of arbitrary numbers of processors. While comprehensive analysis of the cases of three and more processors is more complicated and the subject for future work, the paper does prove the optimality of some particular non-rectangular partitioning shapes for some combinations of performance characteristics of heterogeneous processors and communication links. The paper also presents experimental results demonstrating that the optimal non-rectangular partitioning can significantly outperform the optimal rectangular one on real-life heterogeneous HPC platforms.

Cited By

View all
  • (2016)Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platformsConcurrency and Computation: Practice & Experience10.1002/cpe.360928:3(802-821)Online publication date: 10-Mar-2016
  • (2014)Heterogeneous parallel computingSupercomputing Frontiers and Innovations: an International Journal10.14529/jsfi1403041:3(70-87)Online publication date: 12-Oct-2014
  • (2014)Optimal Data Partitioning Shape for Matrix Multiplication on Three Fully Connected Heterogeneous ProcessorsRevised Selected Papers, Part I, of the Euro-Par 2014 International Workshops on Parallel Processing - Volume 880510.1007/978-3-319-14325-5_18(201-214)Online publication date: 25-Aug-2014

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
IPDPSW '12: Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum
May 2012
2554 pages
ISBN:9780769546766

Publisher

IEEE Computer Society

United States

Publication History

Published: 21 May 2012

Author Tags

  1. Heterogeneous Computing
  2. High Performance Computing
  3. Matrix Partitioning
  4. Parallel Matrix Multiplication

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platformsConcurrency and Computation: Practice & Experience10.1002/cpe.360928:3(802-821)Online publication date: 10-Mar-2016
  • (2014)Heterogeneous parallel computingSupercomputing Frontiers and Innovations: an International Journal10.14529/jsfi1403041:3(70-87)Online publication date: 12-Oct-2014
  • (2014)Optimal Data Partitioning Shape for Matrix Multiplication on Three Fully Connected Heterogeneous ProcessorsRevised Selected Papers, Part I, of the Euro-Par 2014 International Workshops on Parallel Processing - Volume 880510.1007/978-3-319-14325-5_18(201-214)Online publication date: 25-Aug-2014

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media