Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3225058.3225136acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

C-Graph: A Highly Efficient Concurrent Graph Reachability Query Framework

Published: 13 August 2018 Publication History

Abstract

Many big data analytics applications explore a set of related entities, which are naturally modeled as graph. However, graph processing is notorious for its performance challenges due to random data access patterns, especially for large data volumes. Solving these challenges is critical to the performance of industry-scale applications. In contrast to most prior works, which focus on accelerating a single graph processing task, in industrial practice we consider multiple graph processing tasks running concurrently, such as a group of queries issued simultaneously to the same graph. In this paper, we present an edge-set based graph traversal framework called C-Graph (i.e. Concurrent Graph), running on a distributed infrastructure, that achieves both high concurrency and efficiency for k-hop reachability queries. The proposed framework maintains global vertex states to facilitate graph traversals, and supports both synchronous and asynchronous communication. In this study, we decompose a set of graph processing tasks into local traversals and analyze their performance on C-Graph. More specifically, we optimize the organization of the physical edge-set and explore the shared subgraphs. We experimentally show that our proposed framework outperforms several baseline methods.

References

[1]
2011. How Response Times Impact Business? https://calendar.perfplanet.com/2011/how-response-times-impact-business/. (2011).
[2]
2014. Apache Giraph. https://giraph.apache.org/. (2014).
[3]
2014. Titan Distributed Graph Database. http://thinkaurelius.github.io/titan/. (2014).
[4]
2017. JanusGraph Distributed Graph Database. https://github.com/JanusGraph/janusgraph. (2017).
[5]
James Cheng, Zechao Shang, Hong Cheng, Haixun Wang, and Jeffrey Xu Yu. 2012. K-reach: who is in your small world. Proceedings of the VLDB Endowment 5, 11 (2012), 1292--1303.
[6]
Yuze Chi, Guohao Dai, Yu Wang, Guangyu Sun, Guoliang Li, and Huazhong Yang. 2016. NXgraph: an efficient graph processing system on a single machine. In Data Engineering (ICDE), 2016 IEEE 32nd International Conference on. IEEE, 409--420.
[7]
Ayush Dubey, Greg D Hill, Robert Escriva, and Emin Gün Sirer. 2016. Weaver: a high-performance, transactional graph database based on refinable timestamps. Proceedings of the VLDB Endowment 9, 11 (2016), 852--863.
[8]
Karthi Duraisamy, Hao Lu, Partha Pratim Pande, and Ananth Kalyanaraman. 2016. High-Performance and Energy-Efficient Network-on-Chip Architectures for Graph Analytics. ACM Trans. Embed. Comput. Syst. 15, 4 (2016), 66:1--66:26.
[9]
Ioanna Filippidou and Yannis Kotidis. 2015. Online and On-demand Partitioning of Streaming Graphs. In Proceedings of the 2015 IEEE International Conference on Big Data (Big Data). 4--13.
[10]
Joseph E Gonzalez, Reynold S Xin, Ankur Dave, Daniel Crankshaw, Michael J Franklin, and Ion Stoica. 2014. GraphX: Graph Processing in a Distributed Dataflow Framework. In OSDI, Vol. 14. 599--613.
[11]
Aditya Grover and Jure Leskovec. 2016. Node2Vec: Scalable Feature Learning for Networks. In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 855--864.
[12]
William L. Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. (2017). arXiv:arXiv:1706.02216
[13]
Ma hias Hauck, Marcus Paradies, and Holger Fröning. 2017. Can Modern Graph Processing Engines Run Concurrent eries E iciently? (2017).
[14]
Borislav Iordanov. 2010. HyperGraphDB: a generalized graph database. Web-Age information management (2010), 25--36.
[15]
Jérôme Kunegis. 2013. Konect: the koblenz network collection. In Proceedings of the 22nd International Conference on World Wide Web. ACM, 1343--1350.
[16]
Aapo Kyrola, Guy Blelloch, and Carlos Guestrin. 2012. Graphchi: Large-scale graph computation on just a pc. In Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Vol. 8. 31--46.
[17]
Jure Leskovec and Rok Sosič. 2016. SNAP: A General-Purpose Network Analysis and Graph-Mining Library. ACM Trans. Intell. Syst. Technol. 8, 1 (2016), 1:1--1:20.
[18]
Hang Liu, H Howie Huang, and Yang Hu. 2016. iBFS: Concurrent breadth-first search on gpus. In Proceedings of the 2016 International Conference on Management of Data. ACM, 403--416.
[19]
Peter Macko, Daniel Margo, and Margo Seltzer. 2013. Performance introspection of graph databases. In Proceedings of the 6th International Systems and Storage Conference. ACM, 18.
[20]
Lifeng Nai, Yinglong Xia, Ilie G. Tanase, Hyesoon Kim, and Ching-Yung Lin. 2015. GraphBIG: Understanding Graph Computing in the Context of Industrial Solutions. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'15).
[21]
Peitian Pan and Chao Li. 2017. Congra: Towards Efficient Processing of Concurrent Graph Queries on Shared-Memory Machines. In 2017 IEEE 35th International Conference on Computer Design (ICCD). IEEE, 217--224.
[22]
I. Robinson, J. Webber, and E. Eifrem. 2013. Graph Databases. O'Reilly Media, Incorporated. http://books.google.com/books?id=RTvAmQEACAAJ
[23]
Jiaxin Shi, Youyang Yao, Rong Chen, Haibo Chen, and Feifei Li. 2016. Fast and Concurrent RDF Queries with RDMA-Based Distributed Graph Exploration. In OSDI. 317--332.
[24]
Ben Shneiderman. 1984. Response Time and Display Rate in Human Performance with Computers. ACM Comput. Surv. 16, 3 (1984), 265--285.
[25]
Yogesh Simmhan, Alok Kumbhare, Charith Wickramaarachchi, Soonil Nagarkar, Santosh Ravi, Cauligi Raghavendra, and Viktor Prasanna. 2014. Goffish: A subgraph centric framework for large-scale graph analytics. In European Conference on Parallel Processing. Springer, 451--462.
[26]
Manuel Then, Moritz Kaufmann, Fernando Chirigati, Tuan-Anh Hoang-Vu, Kien Pham, Alfons Kemper, Thomas Neumann, and Huy T Vo. 2014. The more the merrier: Efficient multi-source graph traversal. Proceedings of the VLDB Endowment 8, 4 (2014), 449--460.
[27]
Yuanyuan Tian, Andrey Balmin, Severin Andreas Corsten, Shirish Tatikonda, and John McPherson. 2013. From think like a vertex to think like a graph. Proceedings of the VLDB Endowment 7, 3 (2013), 193--204.
[28]
Yuanyuan Tian, Andrey Balmin, Severin Andreas Corsten, Shirish Tatikonda, and John McPherson. 2013. From think like a vertex to think like a graph. Proceedings of the VLDB Endowment 7, 3 (2013), 193--204.
[29]
Jim Webber. 2012. A programmatic introduction to neo4j. In Proceedings of the 3rd annual conference on Systems, programming, and applications: software for humanity. ACM, 217--218.
[30]
Huanhuan Wu, James Cheng, Yi Lu, Yiping Ke, Yuzhen Huang, Da Yan, and Hejun Wu. 2015. Core decomposition in large temporal graphs. In IEEE Big Data. 649--658.
[31]
Yinglong Xia, Ilie G. Tanase, Lifeng Nai, Wei Tan, Yanbin G. Liu, Jason Crawford, and C-Y. Lin. 2014. Explore Efficient Data Organization for Large Scale Graph Analytics and Storage. In IEEE Big Data. 942--951.
[32]
Jilong Xue, Zhi Yang, Shian Hou, and Yafei Dai. 2017. Processing Concurrent Graph Analytics with Decoupled Computation Model. IEEE Trans. Comput. 66, 5 (2017), 876--890.
[33]
Jilong Xue, Zhi Yang, Zhi Qu, Shian Hou, and Yafei Dai. 2014. Seraph: an efficient, low-cost system for concurrent graph processing. In Proceedings of the 23rd international symposium on High-performance parallel and distributed computing. ACM, 227--238.
[34]
Da Yan, James Cheng, Yi Lu, and Wilfred Ng. 2014. Blogel: A block-centric framework for distributed computation on real-world graphs. Proceedings of the VLDB Endowment 7, 14 (2014), 1981--1992.
[35]
Da Yan, James Cheng, Yi Lu, and Wilfred Ng. 2014. Blogel: A block-centric framework for distributed computation on real-world graphs. Proceedings of the VLDB Endowment 7, 14 (2014), 1981--1992.
[36]
Da Yan, James Cheng, M Tamer Özsu, Fan Yang, Yi Lu, John Lui, Qizhen Zhang, and Wilfred Ng. 2016. A general-purpose query-centric framework for querying big graphs. Proceedings of the VLDB Endowment 9, 7 (2016), 564--575.
[37]
Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2016. I/O Efficient ECC Graph Decomposition via Graph Reduction. In PVLDB. 516--527.
[38]
Hongyi Zeng, Shidong Zhang, Fei Ye, Vimalkumar Jeyakumar, Mickey Ju, Junda Liu, Nick McKeown, and Amin Vahdat. 2014. Libra: Divide and Conquer to Verify Forwarding Tables in Huge Networks. In Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation. 87--99.
[39]
Wen Zhang. 2017. Knowledge Graph Embedding with Diversity of Structures. In Proceedings of the 26th International Conference on World Wide Web Companion. 747--753.
[40]
Xiaowei Zhu, Wenguang Chen, Weimin Zheng, and Xiaosong Ma. 2016. Gemini: A computation-centric distributed graph processing system. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16)(Savannah, GA.
[41]
Xiaowei Zhu, Wentao Han, and Wenguang Chen. 2015. GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning. In USENIX Annual Technical Conference. 375--386.

Cited By

View all
  • (2024)A Survey on Concurrent Processing of Graph Analytical Queries: Systems and AlgorithmsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.339393636:11(5508-5528)Online publication date: Nov-2024
  • (2024)A disk I/O optimized system for concurrent graph processing jobsFrontiers of Computer Science10.1007/s11704-023-2361-018:3Online publication date: 22-Jan-2024
  • (2024)Distributed k-Hop Query Powered by an Asynchronous FrameworkWeb Information Systems Engineering – WISE 202410.1007/978-981-96-0579-8_22(304-319)Online publication date: 29-Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP '18: Proceedings of the 47th International Conference on Parallel Processing
August 2018
945 pages
ISBN:9781450365109
DOI:10.1145/3225058
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • University of Oregon: University of Oregon

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Concurrent Queries
  2. Distributed System
  3. Graph Processing
  4. K-Hop Reachability

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICPP 2018

Acceptance Rates

ICPP '18 Paper Acceptance Rate 91 of 313 submissions, 29%;
Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)2
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Survey on Concurrent Processing of Graph Analytical Queries: Systems and AlgorithmsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.339393636:11(5508-5528)Online publication date: Nov-2024
  • (2024)A disk I/O optimized system for concurrent graph processing jobsFrontiers of Computer Science10.1007/s11704-023-2361-018:3Online publication date: 22-Jan-2024
  • (2024)Distributed k-Hop Query Powered by an Asynchronous FrameworkWeb Information Systems Engineering – WISE 202410.1007/978-981-96-0579-8_22(304-319)Online publication date: 29-Nov-2024
  • (2022)Indexing the extended Dyck-CFL reachability for context-sensitive program analysisProceedings of the ACM on Programming Languages10.1145/35633396:OOPSLA2(1438-1468)Online publication date: 31-Oct-2022
  • (2021)GraphCP: An I/O-Efficient Concurrent Graph Processing Framework2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS)10.1109/IWQOS52092.2021.9521293(1-10)Online publication date: 25-Jun-2021
  • (2021)ESTI: Efficient k-Hop Reachability Querying over Large General Directed GraphsDatabase Systems for Advanced Applications. DASFAA 2021 International Workshops10.1007/978-3-030-73216-5_6(71-89)Online publication date: 11-Apr-2021
  • (2020)CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUsProceedings of the 49th International Conference on Parallel Processing10.1145/3404397.3404400(1-11)Online publication date: 17-Aug-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media