Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3183713.3196917acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

TurboFlux: A Fast Continuous Subgraph Matching System for Streaming Graph Data

Published: 27 May 2018 Publication History

Abstract

A dynamic graph is defined by an initial graph and a graph update stream consisting of edge insertions and deletions. Identifying and monitoring critical patterns in the dynamic graph is important in various application domains such as fraud detection, cyber security, and emergency response. Given a dynamic data graph and a query graph, a continuous subgraph matching system reports positive matches for an edge insertion and reports negative matches for an edge deletion. Previous systems show significantly low throughput due to either repeated subgraph matching for each edge update or expensive overheads in maintaining enormous intermediate results. We present a fast continuous subgraph matching system called TurboFlux which provides high throughput over a fast graph update stream. TurboFlux employs a concise representation of intermediate results, and its execution model allows fast incremental maintenance. Our empirical evaluation shows that TurboFlux significantly outperforms existing competitors by up to six orders of magnitude.

References

[1]
Ehab Abdelhamid, Mustafa Canim, Mohammad Sadoghi, Bishwaranjan Bhattacharjee, Yuan-Chi Chang, and Panos Kalnis. 2017. Incremental Frequent Subgraph Mining on Large Evolving Graphs. IEEE Transactions on Knowledge and Data Engineering Vol. 29, 12 (2017), 2710--2723.
[2]
Christopher R Aberger, Susan Tu, Kunle Olukotun, and Christopher Ré. 2016 a. Emptyheaded: A relational engine for graph processing Proceedings of the 2016 International Conference on Management of Data. ACM, 431--446.
[3]
Christopher R Aberger, Susan Tu, Kunle Olukotun, and Christopher Ré. 2016 b. Old techniques for new join algorithms: A case study in RDF processing Data Engineering Workshops (ICDEW), 2016 IEEE 32nd International Conference on. IEEE, 97--102.
[4]
Hal Berenson, Phil Bernstein, Jim Gray, Jim Melton, Elizabeth O'Neil, and Patrick O'Neil. 1995. A critique of ANSI SQL isolation levels. In ACM SIGMOD Record, Vol. Vol. 24. ACM, 1--10.
[5]
Fei Bi, Lijun Chang, Xuemin Lin, Lu Qin, and Wenjie Zhang. 2016. Efficient Subgraph Matching by Postponing Cartesian Products Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data. ACM, 1199--1214.
[6]
Sutanay Choudhury, Lawrence Holder, George Chin, Abhik Ray, Sherman Beus, and John Feo. 2013. StreamWorks: a system for dynamic graph search. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. ACM, 1101--1104.
[7]
Sutanay Choudhury, Lawrence B. Holder, George Chin Jr., Khushbu Agarwal, and John Feo. 2015. A Selectivity based approach to Continuous Pattern Detection in Streaming Graphs Proceedings of the 18th International Conference on Extending Database Technology, EDBT 2015, Brussels, Belgium, March 23--27, 2015. 157--168.
[8]
Luigi P. Cordella, Pasquale Foggia, Carlo Sansone, and Mario Vento. 2004. A (Sub)Graph Isomorphism Algorithm for Matching Large Graphs. IEEE PAMI Vol. 26, 10 (2004), 1367--1372.
[9]
Roberto De Virgilio. 2017. Smart RDF Data storage in Graph Databases. In Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. IEEE Press, 872--881.
[10]
Wenfei Fan, Jianzhong Li, Jizhou Luo, Zijing Tan, Xin Wang, and Yinghui Wu. 2011. Incremental graph pattern matching. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of data. ACM, 925--936.
[11]
Wenfei Fan, Xin Wang, and Yinghui Wu. 2013. Incremental graph pattern matching. ACM Transactions on Database Systems (TODS) Vol. 38, 3 (2013), 18.
[12]
Wenfei Fan, Xin Wang, and Yinghui Wu. 2014. Querying big graphs within bounded resources. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, 301--312.
[13]
Jun Gao, Chang Zhou, and Jeffrey Xu Yu. 2016. Toward continuous pattern detection over evolving large graph with snapshot isolation. The VLDB Journal - The International Journal on Very Large Data Bases Vol. 25, 2 (2016), 269--290.
[14]
Wook-Shin Han, Jinsoo Lee, and Jeong-Hoon Lee. 2013. Turbo iso: towards ultrafast and robust subgraph isomorphism search in large graph databases. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. ACM, 337--348.
[15]
Huahai He and Ambuj K. Singh. 2008. Graphs-at-a-time: query language and access methods for graph databases SIGMOD. 405--418.
[16]
Chathura Kankanamge, Siddhartha Sahu, Amine Mhedbhi, Jeremy Chen, and Semih Salihoglu. 2017. Graphflow: An Active Graph Database. In Proceedings of the 2017 ACM International Conference on Management of Data. ACM, 1695--1698.
[17]
Jinha Kim, Hyungyu Shin, Wook-Shin Han, Sungpack Hong, and Hassan Chafi. 2015. Taming subgraph isomorphism for RDF query processing. Proceedings of the VLDB Endowment Vol. 8, 11 (2015), 1238--1249.
[18]
Jinsoo Lee, Wook-Shin Han, Romans Kasperovics, and Jeong-Hoon Lee. 2013, http://www-db.knu.ac.kr/vldb13.pdf. An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases. PVLDB Vol. 6, 2 (2013, http://www-db.knu.ac.kr/vldb13.pdf).
[19]
Juchang Lee, Hyungyu Shin, Chang Gyoo Park, Seongyun Ko, Jaeyun Noh, Yongjae Chuh, Wolfgang Stephan, and Wook-Shin Han. 2016. Hybrid Garbage Collection for Multi-Version Concurrency Control in SAP HANA Proceedings of the 2016 International Conference on Management of Data. ACM, 1307--1318.
[20]
Shuai Ma, Yang Cao, Wenfei Fan, Jinpeng Huai, and Tianyu Wo. 2014. Strong simulation: Capturing topology in graph pattern matching. ACM Transactions on Database Systems (TODS) Vol. 39, 1 (2014), 4.
[21]
Thomas Neumann and Gerhard Weikum. 2008. RDF-3X: a RISC-style engine for RDF. Proceedings of the VLDB Endowment Vol. 1, 1 (2008), 647--659.
[22]
Hung Q Ngo, Christopher Ré, and Atri Rudra. 2014. Skew strikes back: New developments in the theory of join algorithms. ACM SIGMOD Record Vol. 42, 4 (2014), 5--16.
[23]
Andrea Pugliese, Matthias Bröcheler, VS Subrahmanian, and Michael Ovelgönne. 2014. Efficient multiview maintenance under insertion in huge social networks. ACM Transactions on the Web (TWEB) Vol. 8, 2 (2014), 10.
[24]
Xuguang Ren and Junhu Wang. 2015. Exploiting vertex relationships in speeding up subgraph isomorphism over large graphs. Proceedings of the VLDB Endowment Vol. 8, 5 (2015), 617--628.
[25]
G Sadowski and Philip Rathle. 2014. Fraud detection: Discovering connections with graph databases. White Paper (Neo4J) (2014).
[26]
Haichuan Shang, Ying Zhang, Xuemin Lin, and Jeffrey Xu Yu. 2008. Taming verification hardness: an efficient algorithm for testing subgraph isomorphism. PVLDB Vol. 1, 1 (2008), 364--375.
[27]
Chunyao Song, Tingjian Ge, Cindy Chen, and Jie Wang. 2014. Event pattern matching over graph streams. Proceedings of the VLDB Endowment Vol. 8, 4 (2014), 413--424.
[28]
Johan Ugander, Brian Karrer, Lars Backstrom, and Cameron Marlow. 2011. The anatomy of the facebook social graph. arXiv preprint arXiv:1111.4503 (2011).
[29]
J. R. Ullmann. 1976. An Algorithm for Subgraph Isomorphism. J. ACM Vol. 23 (January. 1976), 31--42. Issue 1.
[30]
Changliang Wang and Lei Chen. 2009. Continuous subgraph pattern search over graph streams 2009 IEEE 25th International Conference on Data Engineering. IEEE, 393--404.
[31]
Kai Zeng, Jiacheng Yang, Haixun Wang, Bin Shao, and Zhongyuan Wang. 2013. A distributed graph engine for web scale RDF data. In Proceedings of the VLDB Endowment, Vol. Vol. 6. VLDB Endowment, 265--276.
[32]
Peixiang Zhao and Jiawei Han. 2010. On Graph Query Optimization in Large Networks. PVLDB Vol. 3, 1 (2010), 340--351.
[33]
Lei Zou, Jinghui Mo, Lei Chen, M Tamer Özsu, and Dongyan Zhao. 2011. gStore: answering SPARQL queries via subgraph matching. Proceedings of the VLDB Endowment Vol. 4, 8 (2011), 482--493.

Cited By

View all
  • (2024)Understanding High-Performance Subgraph Pattern Matching: A Systems PerspectiveProceedings of the 7th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3661304.3661897(1-12)Online publication date: 14-Jun-2024
  • (2024)GCSM: GPU-Accelerated Continuous Subgraph Matching for Large Graphs2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00097(1046-1057)Online publication date: 27-May-2024
  • (2024)IVE: Accelerating Enumeration-Based Subgraph Matching via Exploring Isolated Vertices2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00321(4208-4221)Online publication date: 13-May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '18: Proceedings of the 2018 International Conference on Management of Data
May 2018
1874 pages
ISBN:9781450347037
DOI:10.1145/3183713
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 May 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. continuous subgraph matching
  2. data-centric graph
  3. dynamic graph
  4. edge transition model

Qualifiers

  • Research-article

Funding Sources

  • Oracle Labs

Conference

SIGMOD/PODS '18
Sponsor:

Acceptance Rates

SIGMOD '18 Paper Acceptance Rate 90 of 461 submissions, 20%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)187
  • Downloads (Last 6 weeks)18
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Understanding High-Performance Subgraph Pattern Matching: A Systems PerspectiveProceedings of the 7th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3661304.3661897(1-12)Online publication date: 14-Jun-2024
  • (2024)GCSM: GPU-Accelerated Continuous Subgraph Matching for Large Graphs2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00097(1046-1057)Online publication date: 27-May-2024
  • (2024)IVE: Accelerating Enumeration-Based Subgraph Matching via Exploring Isolated Vertices2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00321(4208-4221)Online publication date: 13-May-2024
  • (2024)Time-Constrained Continuous Subgraph Matching Using Temporal Information for Filtering and Backtracking2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00252(3257-3269)Online publication date: 13-May-2024
  • (2024)Efficient Multi-Query Oriented Continuous Subgraph Matching2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00250(3230-3243)Online publication date: 13-May-2024
  • (2024)GPU-Accelerated Batch-Dynamic Subgraph Matching2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00248(3204-3216)Online publication date: 13-May-2024
  • (2024)CSM-TopK: Continuous Subgraph Matching with TopK Density Constraints2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00239(3084-3097)Online publication date: 13-May-2024
  • (2024)Ingress: an automated incremental graph processing systemThe VLDB Journal10.1007/s00778-024-00838-z33:3(781-806)Online publication date: 20-Feb-2024
  • (2024)Temporal graph patterns by timed automataThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-023-00795-z33:1(25-47)Online publication date: 1-Jan-2024
  • (2023)Efficient Continuous Subgraph Matching Scheme Based on Trie Indexing for Graph Stream ProcessingApplied Sciences10.3390/app1308513713:8(5137)Online publication date: 20-Apr-2023
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media