Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3488560.3498446acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Efficient Reachability Query with Extreme Labeling Filter

Published: 15 February 2022 Publication History

Abstract

Being a fundamental graph operator, reachability query has been widely studied by the data mining community in the past decades. In a directed acyclic graph (DAG), one vertex is reachable by another if there exists a chain of directed edges connecting the two vertexes. The state-of-the-art (SOTA) reachability query methods mostly first index all the vertexes in the underlying DAG and assign them with different labels, and then use these indexes and/or labels to efficiently filter out as many unreachable queries as possible. Thus, because a large portion of unreachable queries can be identified without evoking any tedious path-finding process, the overall time taken by a huge number of queries is much shortened with a tolerable compensation on the additional index and/or label preprocessing time and space. In this paper, we propose the Extreme Labeling Filter (ELF), which is a novel generic filter that can be applied to existing reachability query methods to additionally identify a large number of unreachable queries. Based on the analysis of the given DAG in a systematic and autonomous manner, ELF first determines whether to use predecessors or successors to label the vertexes. Based on such self-determined labels, ELF is then able to identify a large number of unreachable queries with a low time complexity of O(1). To evaluate the performance of ELF, we apply it on 4 reachability query methods (1 conventional and 3 SOTA, all designated for reachability query in DAGs) and conduct experiments on 17 datasets of different sizes. The experimental results show that by applying ELF, all methods significantly shorten the query time.

Supplementary Material

MP4 File (10.1145 3488560.3498446 Efficient Reachability Query with Extreme Labeling Filter.mp4)
Being a fundamental graph operator, reachability query has been widely studied by the data mining community in the past decades. In a directed acyclic graph (DAG), one vertex is reachable by another if there exists a chain of directed edges connecting the two vertexes. In this paper, we propose the Extreme Labeling Filter (ELF), which is a novel generic filter that can be applied to existing reachability query methods to additionally identify a large number of unreachable queries. Based on the analysis of the given DAG in a systematic and autonomous manner, ELF first determines whether to use predecessors or successors to label the vertexes. Based on such self-determined labels, ELF is then able to identify a large number of unreachable queries with a low time complexity. To evaluate the performance of ELF, we apply it on 4 reachability query methods (1 conventional and 3 SOTA, all designated for reachability query in DAGs) and conduct experiments on 17 datasets of different sizes.

References

[1]
Yangjun Chen and Yibin Chen. 2008. An efficient algorithm for answering graph reachability queries. In Proceedings of IEEE International Conference on Data Engineering. 893--902.
[2]
James Cheng, Silu Huang, Huanhuan Wu, and Ada Wai-Chee Fu. 2013. TF-label: A topological-folding labeling scheme for reachability querying in a large graph. In Proceedings of ACM SIGMOD International Conference on Management of Data. 193--204.
[3]
Theodoros Chondrogiannis, Mario A. Nascimento, and Panagiotis Bouros. 2019. Relative reachability analysis as a tool for urban mobility planning: Position paper. In Proceedings of the 12th ACM SIGSPATIAL International Workshop on Computational Transportation Science (IWCTS'19). Article 1, bibinfonumpages4 pages.
[4]
Edith Cohen, Eran Halperin, Haim Kaplan, and Uri Zwick. 2003. Reachability and distance queries via 2-hop labels. Society for Industrial and Applied Mathematics Journal on Computing, Vol. 32, 5 (2003), 1338--1355.
[5]
Yi Cui, Jian Pei, Guanting Tang, Wo-Shun Luk, Daxin Jiang, and Ming Hua. 2013. Finding email correspondents in online social networks. World Wide Web, Vol. 16, 2 (2013), 195--218.
[6]
Ruoming Jin and Guan Wang. 2013. Simple, fast, and scalable reachability oracle. arXiv preprint arXiv:1305.0502 (2013).
[7]
Ruoming Jin, Yang Xiang, Ning Ruan, and David Fuhry. 2009. 3-hop: A high-compression indexing scheme for reachability query. In Proceedings of ACM SIGMOD International Conference on Management of Data. 813--826.
[8]
Ruoming Jin, Yang Xiang, Ning Ruan, and Haixun Wang. 2008. Efficiently answering reachability queries on very large directed graphs. In Proceedings of ACM SIGMOD International Conference on Management of Data . 595--608.
[9]
Tianze Luo, Budhitama Subagdja, Di Wang, and Ah-Hwee Tan. 2019. Multi-agent collaborative exploration through graph-based deep reinforcement learning. In IEEE International Conference on Agents. IEEE, 2--7.
[10]
Stephan Seufert, Avishek Anand, Srikanta Bedathur, and Gerhard Weikum. 2013. Ferrari: Flexible and efficient reachability range assignment for graph indexing. In Proceedings of IEEE International Conference on Data Engineering . 1009--1020.
[11]
Garlik Steve Harris. 2010. Sparql 1.1 Query Language. http://www.w3.org/TR/sparql11-query/#propertypaths
[12]
Jiao Su, Qing Zhu, Hao Wei, and Jeffrey Xu Yu. 2016. Reachability querying: Can it be even faster? IEEE Transactions on Knowledge and Data Engineering, Vol. 29, 3 (2016), 683--697.
[13]
Renê Rodrigues Veloso, Lo"ic Cerf, Wagner Meira Jr, and Mohammed J Zaki. 2014. Reachability queries in very large graphs: A fast refined online search approach. In Proceedings of International Conference on Extending Database Technology . 511--522.
[14]
Hao Wei, Jeffrey Xu Yu, Can Lu, and Ruoming Jin. 2014. Reachability querying: An independent permutation labeling approach. Proceedings of the VLDB Endowment, Vol. 7, 12 (2014), 1191--1202.
[15]
Hilmi Yildirim, Vineet Chaoji, and Mohammed J Zaki. 2010. Grail: Scalable reachability index for large graphs. Proceedings of the VLDB Endowment, Vol. 3, 1--2 (2010), 276--284.
[16]
Hilmi Yildirim, Vineet Chaoji, and Mohammed J Zaki. 2012. GRAIL: A scalable index for reachability queries in very large graphs. The VLDB Journal, Vol. 21, 4 (2012), 509--534.
[17]
Pingpeng Yuan, Yujie You, Shuang Zhou, Hai Jin, and Ling Liu. 2021, in press. Providing fast reachability query services with MGTag: A multi-dimensional graph labeling method. IEEE Transactions on Services Computing (2021, in press).
[18]
Yaping Zheng, Xiaofeng Zhang, Shiyi Chen, Xinni Zhang, Xiaofei Yang, and Di Wang. 2021. When Convolutional Network Meets Temporal Heterogeneous Graphs: An Effective Community Detection Method. IEEE Transactions on Knowledge and Data Engineering (2021).
[19]
Shuang Zhou, Pingpeng Yuan, Ling Liu, and Hai Jin. 2018. MGTag: A multi-dimensional graph labeling scheme for fast reachability queries. In Proceedings of IEEE International Conference on Data Engineering . 1372--1375.
[20]
Andy Diwen Zhu, Wenqing Lin, Sibo Wang, and Xiaokui Xiao. 2014. Reachability queries on large dynamic graphs: A total order approach. In Proceedings of ACM SIGMOD International Conference on Management of Data . 1323--1334.

Cited By

View all
  • (2024)BL: An Efficient Index for Reachability Queries on Large GraphsIEEE Transactions on Big Data10.1109/TBDATA.2023.332721510:2(108-121)Online publication date: Apr-2024
  • (2023)Incorporating Surprisingly Popular Algorithm and Euclidean distance-based adaptive topology into PSOSwarm and Evolutionary Computation10.1016/j.swevo.2022.10122276(101222)Online publication date: Feb-2023

Index Terms

  1. Efficient Reachability Query with Extreme Labeling Filter

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining
    February 2022
    1690 pages
    ISBN:9781450391320
    DOI:10.1145/3488560
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 February 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. directed acyclic graph
    2. query time
    3. reachability query

    Qualifiers

    • Research-article

    Conference

    WSDM '22

    Acceptance Rates

    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)34
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)BL: An Efficient Index for Reachability Queries on Large GraphsIEEE Transactions on Big Data10.1109/TBDATA.2023.332721510:2(108-121)Online publication date: Apr-2024
    • (2023)Incorporating Surprisingly Popular Algorithm and Euclidean distance-based adaptive topology into PSOSwarm and Evolutionary Computation10.1016/j.swevo.2022.10122276(101222)Online publication date: Feb-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media