Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-642-40270-8_3guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

BC-BSP: A BSP-Based Parallel Iterative Processing System for Big Data on Cloud Architecture

Published: 22 April 2013 Publication History

Abstract

Many applications in real life can produce and collect large amount of data and many of them can be modeled by Graph. The number of vertexes of a graph could be several hundreds of millions to billions and the number of edges could be ten or more times of the number of its vertexes. A BSP-based system for large-scale data especially graph data parallel and iterative processing is discussed in this paper. The system has the ability to flexible configuration and the extendibility for functions and strategies such as adjusting the parameters according to the volume of data and supporting multiple aggregation functions at the same time, to process large-scale data, to tolerate faults, to balance load, and to run clustering or classification algorithms on metric datasets. Lots of experiments are done to evaluate the extendibility of the system implemented in the paper, and the comparison between BC-BSP-based applications and MapReduce-based ones are made. The experimental results show that BSP-based applications have higher efficiency than that of MapReduce-based applications when the volume of data can be put in the memory during the course of processing; on the contrary the latter are better than the former, and the performance of BC-BSP platform outperforms Hama and Giraph.

References

[1]
Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters. In: Proc. of 6th USENIX Symp. on Operating Syst. Design and Impl., pp. 137---150 2004
[2]
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems 301-7 1998
[3]
Malewicz, G., Austern, M.H., Bik, A.J.C., Dehnert, J.C., Horn, I., Leiser, N., Czajkowski, G.: Pregel: A System for Large-Scale Graph Processing. SIGMOD 2010
[4]
Welcome to Hama Project, http://incubator.apache.org/hama/
[5]
Snoek, J.: Computing PageRank using MapReduce. Technical Report, Report No. CSC2544. University of Toronto, Toronto 2008
[6]
Ching, A., Kunz, C.: Giraph: Large-scale graph processing infrastructure on Hadoop, Hadoop Summit 2011

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
Proceedings of the 18th International Conference on Database Systems for Advanced Applications - Volume 7827
April 2013
243 pages
ISBN:9783642402692
  • Editors:
  • Bonghee Hong,
  • Xiaofeng Meng,
  • Lei Chen,
  • Werner Winiwarter,
  • Wei Song

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 22 April 2013

Author Tags

  1. BSP
  2. Big data
  3. Disk Cache
  4. Graph Processing
  5. MapReduce

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Feb 2025

Other Metrics

Citations

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media