Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2723372.2723712acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Exploiting Matrix Dependency for Efficient Distributed Matrix Computation

Published: 27 May 2015 Publication History

Abstract

Distributed matrix computation is a popular approach for many large-scale data analysis and machine learning tasks. However existing distributed matrix computation systems generally incur heavy communication cost during the runtime, which degrades the overall performance. In this paper, we propose a novel matrix computation system, named DMac, which exploits the matrix dependencies in matrix programs for efficient matrix computation in the distributed environment. We decompose each matrix program into a sequence of operations, and reveal the matrix dependencies between operations in the program. We next design a dependency-oriented cost model to select an optimal execution strategy for each operation, and generate a communication efficient execution plan for the matrix computation program. To facilitate the matrix computation in distributed systems, we further divide the execution plan into multiple un-interleaved stages which can run in a distributed cluster with efficient local execution strategy on each worker. The DMac system has been implemented on a popular general-purpose data processing framework, Spark. The experimental results demonstrate that our techniques can significantly improve the performance of a wide range of matrix programs.

References

[1]
BLAS. http://www.netlib.org/blas/.
[2]
Hadoop. http://hadoop.apache.org/.
[3]
LAPACK. http://www.netlib.org/lapack/.
[4]
Mahout. https://mahout.apache.org/.
[5]
ScaLAPACK. http://www.netlib.org/scalapack/.
[6]
J. Bennett and S. Lanning. The netflix prize. In KDD Cup'2007, pages 35--35.
[7]
J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In UAI'1998, pages 43--52.
[8]
P. G. Brown. Overview of scidb: large scale array storage, processing and analysis. In SIGMOD'2010, pages 963--968.
[9]
Y. Bu, B. Howe, M. Balazinska, and M. D. Ernst. Haloop: Efficient iterative data processing on large clusters. VLDB'2010, 3(1--2):285--296.
[10]
T. A. Davis. Direct Methods for Sparse Linear Systems (Fundamentals of Algorithms 2). 2006.
[11]
J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. Comm. ACM, 51(1):107--113, 2008.
[12]
M. L. Eaton and I. Olkin. Best equivariant estimators of a cholesky decomposition. The Annals of Statistics, 15(4):1639--1650, 12 1987.
[13]
L. C. Freeman. A Set of Measures of Centrality Based on Betweenness. Sociometry, 40(1):35--41, Mar. 1977.
[14]
A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian, and S. Vaithyanathan. Systemml: Declarative machine learning on mapreduce. In ICDE'2011, pages 231--242.
[15]
U. Kang, C. E. Tsourakakis, and C. Faloutsos. Pegasus: A peta-scale graph mining system implementation and observations. In ICDM'2009, pages 229--238.
[16]
D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In Advances in neural information processing systems, pages 556--562, 2001.
[17]
H. Lim, H. Herodotou, and S. Babu. Stubby: A transformation-based optimizer for mapreduce workflows. VLDB'2012, 5(11):1196--1207.
[18]
B. Matthias, T. Shirish, R. Berthold, S. Prithviraj, T. Yuanyuan, R. B. Douglas, and V. Shivakumar. Hybrid parallelization strategies for large-scale machine learning in systemml. In PVLDB'2014, pages 553--564.
[19]
T. Nykiel, M. Potamias, C. Mishra, G. Kollios, and N. Koudas. Mrshare: sharing across multiple queries in mapreduce. VLDB'2010, 3(1--2):494--505.
[20]
Z. Qian, X. Chen, N. Kang, M. Chen, Y. Yu, T. Moscibroda, and Z. Zhang. Madlinq: large-scale distributed matrix computation for the cloud. In EuroSys'12, pages 197--210, 2012.
[21]
S. Seo, E. J. Yoon, J. Kim, S. Jin, J.-S. Kim, and S. Maeng. Hama: An efficient matrix computation with the mapreduce framework. In CloudCom' 2010, pages 721--726, 2010.
[22]
Y. Yu, M. Isard, D. Fetterly, M. Budiu, Ú. Erlingsson, P. K. Gunda, and J. Currey. Dryadlinq: A system for general-purpose distributed data-parallel computing using a high-level language. In OSDI'2008, volume 8, pages 1--14.
[23]
M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica. Spark: Cluster computing with working sets. In HotCloud'2010, pages 10--10.
[24]
J. Zhou, P.-A. Larson, and R. Chaiken. Incorporating partitioning and parallel plans into the scope optimizer. In ICDE'2010, pages 1060--1071.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
May 2015
2110 pages
ISBN:9781450327589
DOI:10.1145/2723372
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 May 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. dependency analysis
  2. distributed system
  3. matrix computing

Qualifiers

  • Research-article

Funding Sources

Conference

SIGMOD/PODS'15
Sponsor:
SIGMOD/PODS'15: International Conference on Management of Data
May 31 - June 4, 2015
Victoria, Melbourne, Australia

Acceptance Rates

SIGMOD '15 Paper Acceptance Rate 106 of 415 submissions, 26%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)2
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)UPLIFTProceedings of the VLDB Endowment10.14778/3551793.355184215:11(2929-2938)Online publication date: Jul-2022
  • (2022)Redundancy Elimination in Distributed Matrix ComputationProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517877(573-586)Online publication date: 10-Jun-2022
  • (2022)Efficient Matrix Computation for SGD-Based Algorithms on Apache SparkDatabase Systems for Advanced Applications10.1007/978-3-031-00123-9_25(309-324)Online publication date: 11-Apr-2022
  • (2022)Data Management in Machine Learning SystemsundefinedOnline publication date: 26-Feb-2022
  • (2021)HyMACProceedings of the VLDB Endowment10.14778/3476311.347632314:12(2699-2702)Online publication date: 28-Oct-2021
  • (2021)Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear AlgebraProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3457317(1222-1234)Online publication date: 9-Jun-2021
  • (2021)Hybrid Evaluation for Distributed Iterative Matrix ComputationProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452843(300-312)Online publication date: 9-Jun-2021
  • (2021)Spangle: A Distributed In-Memory Processing System for Large-Scale Arrays2021 IEEE 37th International Conference on Data Engineering (ICDE)10.1109/ICDE51399.2021.00158(1799-1810)Online publication date: Apr-2021
  • (2020)C olumnSGD: A Column-oriented Framework for Distributed Stochastic Gradient Descent2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00134(1513-1524)Online publication date: Apr-2020
  • (2019)Data Management in Machine Learning SystemsSynthesis Lectures on Data Management10.2200/S00895ED1V01Y201901DTM05714:1(1-173)Online publication date: 25-Feb-2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media