Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2487788.2487984acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Towards highly scalable pregel-based graph processing platform with x10

Published: 13 May 2013 Publication History

Abstract

Many practical computing problems concern large graph. Standard problems include web graph analysis and social networks analysis like Facebook, Twitter. The scale of these graph poses challenge to their efficient processing. To efficiently process large-scale graph, we create X-Pregel, a graph processing system based on Google's Computing Pregel model [1], by using the state-of-the-art PGAS programming language X10. We do not purely implement Google Pregel by using X10 language, but we also introduce two new features that do not exists in the original model to optimize the performance: (1) an optimization to reduce the number of messages which is exchanged among workers, (2) a dynamic re-partitioning scheme that effectively reassign vertices to different workers during the computation. Our performance evaluation demonstrates that our optimization method of sending messages achieves up to 200% speed up on Pagerank by reducing the network I/O to 10 times in comparison with the default method of sending messages when processing SCALE20 Kronecker graph [2](vertices = 1,048,576, edges = 33,554,432). It also demonstrates that our system processes large graph faster than prior implementation of Pregel such as GPS [3](stands for graph processing system) and Giraph [4].

References

[1]
Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. Pregel: a system for large-scale graph processing. In Proceedings of the 2010 international conference on Management of data, SIGMOD '10, pages 135--146. ACM, 2010.
[2]
Jure Leskovec, Deepayan Chakrabarti, Jon Kleinberg, Christos Faloutos, and Zoubin Ghahramani. Kronecker graphs: An approach to modeling networks. pages 985--1042, 2010.
[3]
Semih Salihoglu and Jennifer Widom. Gps: A graph processing system. Technical report, Stanford University.
[4]
The Apache Software Foundation. Apache incubator giraph. http://incubator.apache.org/giraph/.
[5]
Sanjay Ghemawat Jeffrey Dean. Mapreduce: simplified data processing on large clusters. In Proceedings of the 6th conference on Symposium on Operation Systems Design & Implementation - Volume 6, OSDI'04, pages 10--10.
[6]
Yahoo. Apache hadoop. http://hadoop.apache.org.
[7]
Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. In COMPUTER NETWORKS AND ISDN SYSTEMS, pages 107--117. Elsevier Science Publishers B. V., 1998.
[8]
C. Grothoff, V. Saraswat, C. Donawa, A. Kielstra, K. Ebcioglu, C. von Praun, and V. Sarkar. X10: an object-oriented approach to non-uniform cluster computing. pages 519--538, 2005.
[9]
V. Saraswat, B. Bloom, I. Peshansky, O. Tardieu, and D. Grove. The x10 reference manual. 2010.
[10]
U. Kang, Charalampos E. Tsourakakis, and Christos Faloutsos. Pegasus: A peta-scale graph mining system - implementation and observations, 2009.
[11]
Leslie G. Valiant. A bridging model for parallel computation, 1990.
[12]
The Apache Software Foundation. Apache zookeeper. http://zookeeper.apache.org/.
[13]
The Apache Software Foundation. Apache mina. http://mina.apache.org/.
[14]
Graph 500 Steering Committee. Graph500. http://www.graph500.org/.

Cited By

View all
  • (2024)How to Fit the SCC Algorithm Efficiently into Distributed Graph Iterative Computation2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC)10.1109/COMPSAC61105.2024.00043(254-263)Online publication date: 2-Jul-2024
  • (2023)PATRIC: A high performance parallel urban transport simulation framework based on traffic clusteringSimulation Modelling Practice and Theory10.1016/j.simpat.2023.102775126(102775)Online publication date: Jul-2023
  • (2022)GGraph: An Efficient Structure-Aware Approach for Iterative Graph ProcessingIEEE Transactions on Big Data10.1109/TBDATA.2020.30196418:5(1182-1194)Online publication date: 1-Oct-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web
May 2013
1636 pages
ISBN:9781450320382
DOI:10.1145/2487788

Sponsors

  • NICBR: Nucleo de Informatcao e Coordenacao do Ponto BR
  • CGIBR: Comite Gestor da Internet no Brazil

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. distributed computing
  2. graph analysis system
  3. parallel graph processing system

Qualifiers

  • Research-article

Conference

WWW '13
Sponsor:
  • NICBR
  • CGIBR
WWW '13: 22nd International World Wide Web Conference
May 13 - 17, 2013
Rio de Janeiro, Brazil

Acceptance Rates

WWW '13 Companion Paper Acceptance Rate 831 of 1,250 submissions, 66%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)How to Fit the SCC Algorithm Efficiently into Distributed Graph Iterative Computation2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC)10.1109/COMPSAC61105.2024.00043(254-263)Online publication date: 2-Jul-2024
  • (2023)PATRIC: A high performance parallel urban transport simulation framework based on traffic clusteringSimulation Modelling Practice and Theory10.1016/j.simpat.2023.102775126(102775)Online publication date: Jul-2023
  • (2022)GGraph: An Efficient Structure-Aware Approach for Iterative Graph ProcessingIEEE Transactions on Big Data10.1109/TBDATA.2020.30196418:5(1182-1194)Online publication date: 1-Oct-2022
  • (2022)Graph Computing Systems and Partitioning Techniques: A SurveyIEEE Access10.1109/ACCESS.2022.321942210(118523-118550)Online publication date: 2022
  • (2021)An analysis of the graph processing landscapeJournal of Big Data10.1186/s40537-021-00443-98:1Online publication date: 9-Apr-2021
  • (2020)Large-scale graph processing systems: a surveyFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.190012721:3(384-404)Online publication date: 1-Apr-2020
  • (2020)Efficient Graph Query Processing over Geo-Distributed DatacentersProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401157(619-628)Online publication date: 25-Jul-2020
  • (2020)An investigation of big graph partitioning methods for distribution of graphs in vertex-centric systemsDistributed and Parallel Databases10.1007/s10619-019-07256-z38:1(1-29)Online publication date: 1-Mar-2020
  • (2019)SPFC: An Effective Optimization for Vertex-Centric Graph Processing SystemsIEEE Transactions on Sustainable Computing10.1109/TSUSC.2017.27803204:1(118-131)Online publication date: 1-Jan-2019
  • (2019)Composing Optimization Techniques for Vertex-Centric Graph Processing via Communication Channels2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2019.00053(428-438)Online publication date: May-2019
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media