Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

Efficient k-Clique Listing: An Edge-Oriented Branching Strategy

Published: 26 March 2024 Publication History
  • Get Citation Alerts
  • Abstract

    k-clique listing is a vital graph mining operator with diverse applications in various networks. The state-of-the-art algorithms all adopt a branch-and-bound (BB) framework with a vertex-oriented branching strategy (called VBBkC), which forms a sub-branch by expanding a partial k-clique with a vertex. These algorithms have the time complexity of O(k · m · (δ/2)k-2 ), where m is the number of edges in the graph and δ is the degeneracy of the graph. In this paper, we propose a BB framework with a new edge-oriented branching (called EBBkC), which forms a sub-branch by expanding a partial k-clique with two vertices that connect each other (which correspond to an edge ). We explore various edge orderings for EBBkC such that it achieves a time complexity of O( m · δ + k · m · (τ/2)k-2 ), where τ is an integer related to the maximum truss number of the graph and we have τ < δ. The time complexity of EBBkC is better than that of VBBkC algorithms for k>3 since both O(m · δ) and O(k · m · (τ/2)k-2 ) are bounded by O(k · m · (δ/2)k-2 ). Furthermore, we develop specialized algorithms for sub-branches on dense graphs so that we can early-terminate them and apply the specialized algorithms. We conduct extensive experiments on 19 real graphs, and the results show that our newly developed EBBkC based algorithms with the early termination technique consistently and largely outperform the state-of-the-art (VBBkC based) algorithms.

    References

    [1]
    Real Graphs. http://lcs.ios.ac.cn/~caisw/Resource/realworld%20graphs.tar.gz.
    [2]
    Balázs Adamcsek, Gergely Palla, Illés J Farkas, Imre Derényi, and Tamás Vicsek. 2006. CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics 22, 8 (2006), 1021--1023.
    [3]
    Albert Angel, Nick Koudas, Nikos Sarkas, Divesh Srivastava, Michael Svendsen, and Srikanta Tirthapura. 2014. Dense subgraph maintenance under streaming edge weight updates for real-time story identification. The VLDB journal 23, 2 (2014), 175--199.
    [4]
    Vladimir Batagelj and Matjaz Zaversnik. 2003. An O (m) algorithm for cores decomposition of networks. arXiv preprint cs/0310049 (2003).
    [5]
    Austin R Benson, David F Gleich, and Jure Leskovec. 2016. Higher-order organization of complex networks. Science 353, 6295 (2016), 163--166.
    [6]
    Coen Bron and Joep Kerbosch. 1973. Algorithm 457: finding all cliques of an undirected graph. Commun. ACM 16, 9 (1973), 575--577.
    [7]
    Lijun Chang. 2019. Efficient maximum clique computation over large sparse graphs. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 529--538.
    [8]
    Lijun Chang. 2020. Efficient maximum clique computation and enumeration over large sparse graphs. The VLDB Journal 29, 5 (2020), 999--1022.
    [9]
    Yulin Che, Zhuohang Lai, Shixuan Sun, Yue Wang, and Qiong Luo. 2020. Accelerating truss decomposition on heterogeneous processors. Proceedings of the VLDB Endowment 13, 10 (2020), 1751--1764.
    [10]
    Norishige Chiba and Takao Nishizeki. 1985. Arboricity and subgraph listing algorithms. SIAM Journal on computing 14, 1 (1985), 210--223.
    [11]
    Jonathan Cohen. 2008. Trusses: Cohesive subgraphs for social network analysis. National security agency technical report 16, 3.1 (2008).
    [12]
    Maximilien Danisch, Oana Balalau, and Mauro Sozio. 2018. Listing k-cliques in sparse real-world graphs. In Proceedings of the 2018 World Wide Web Conference. 589--598.
    [13]
    David Eppstein, Maarten Löffler, and Darren Strash. 2010. Listing all maximal cliques in sparse graphs in near-optimal time. In Algorithms and Computation: 21st International Symposium, ISAAC 2010, Jeju Island, Korea, December 15--17, 2010, Proceedings, Part I 21. Springer, 403--414.
    [14]
    Paul Erdös and George Szekeres. 1935. A combinatorial problem in geometry. Compositio mathematica 2 (1935), 463--470.
    [15]
    Yixiang Fang, Kaiqiang Yu, Reynold Cheng, Laks VS Lakshmanan, and Xuemin Lin. 2019. Efficient algorithms for densest subgraph discovery. Proceedings of the VLDB Endowment 12, 11 (2019), 1719--1732.
    [16]
    Irene Finocchi, Marco Finocchi, and Emanuele G Fusco. 2015. Clique counting in mapreduce: Algorithms and experiments. Journal of Experimental Algorithmics (JEA) 20 (2015), 1--20.
    [17]
    Enrico Gregori, Luciano Lenzini, and Simone Mainardi. 2012. Parallel k-clique community detection on large-scale networks. IEEE Transactions on Parallel and Distributed Systems 24, 8 (2012), 1651--1660.
    [18]
    William Hasenplaugh, Tim Kaler, Tao B Schardl, and Charles E Leiserson. 2014. Ordering heuristics for parallel graph coloring. In Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures. 166--177.
    [19]
    Pan Hui and Jon Crowcroft. 2008. Human mobility models and opportunistic communications system design. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 366, 1872 (2008), 2005--2016.
    [20]
    SK Jayanthi. 2012. Clique-attacks detection in web search engine for spamdexing using k-clique percolation technique. International Journal of Machine Learning and Computing 2, 5 (2012), 648.
    [21]
    Yan Jin, Bowen Xiong, Kun He, Yangming Zhou, and Yi Zhou. 2022. On fast enumeration of maximal cliques in large graphs. Expert Systems with Applications 187 (2022), 115915.
    [22]
    Richard M Karp. 2010. Reducibility among combinatorial problems. Springer.
    [23]
    Matthieu Latapy. 2008. Main-memory triangle computations for very large (sparse (power-law)) graphs. Theoretical computer science 407, 1--3 (2008), 458--473.
    [24]
    Ronghua Li, Sen Gao, Lu Qin, Guoren Wang, Weihua Yang, and Jeffrey Xu Yu. 2020. Ordering Heuristics for k-clique Listing. Proc. VLDB Endow. (2020).
    [25]
    Can Lu, Jeffrey Xu Yu, Hao Wei, and Yikai Zhang. 2017. Finding the maximum clique in massive graphs. Proceedings of the VLDB Endowment 10, 11 (2017), 1538--1549.
    [26]
    Kazuhisa Makino and Takeaki Uno. 2004. New algorithms for enumerating all maximal cliques. In Scandinavian workshop on algorithm theory. Springer, 260--272.
    [27]
    Kevin A Naudé. 2016. Refined pivot selection for maximal clique enumeration in graphs. Theoretical Computer Science 613 (2016), 28--37.
    [28]
    Mark Ortmann and Ulrik Brandes. 2014. Triangle listing algorithms: Back from the diversion. In 2014 Proceedings of the Sixteenth Workshop on Algorithm Engineering and Experiments (ALENEX). SIAM, 1--8.
    [29]
    Patric RJ Östergård. 2002. A fast algorithm for the maximum clique problem. Discrete Applied Mathematics 120, 1--3 (2002), 197--207.
    [30]
    Gergely Palla, Imre Derényi, Illés Farkas, and Tamás Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. nature 435, 7043 (2005), 814--818.
    [31]
    Gergely Palla, Imre Derényi, Illés Farkas, and Tamás Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. nature 435, 7043 (2005), 814--818.
    [32]
    Bharath Pattabiraman, Md Mostofa Ali Patwary, Assefaw H Gebremedhin, Wei-keng Liao, and Alok Choudhary. 2015. Fast algorithms for the maximum clique problem on massive graphs with applications to overlapping community detection. Internet Mathematics 11, 4--5 (2015), 421--448.
    [33]
    Ryan Rossi and Nesreen Ahmed. 2015. The network data repository with interactive graph analytics and visualization. In Twenty-Ninth AAAI Conference on Artificial Intelligence.
    [34]
    Hiroo Saito, Masashi Toyoda, Masaru Kitsuregawa, and Kazuyuki Aihara. 2007. A large-scale study of link spam detection by graph algorithms. In Proceedings of the 3rd international workshop on Adversarial information retrieval on the web. 45--48.
    [35]
    Ahmet Erdem Sariyuce, C Seshadhri, Ali Pinar, and Umit V Catalyurek. 2015. Finding the hierarchy of dense subgraphs using nucleus decompositions. In Proceedings of the 24th International Conference on World Wide Web. 927--937.
    [36]
    Matthew C Schmidt, Nagiza F Samatova, Kevin Thomas, and Byung-Hoon Park. 2009. A scalable, parallel algorithm for maximal clique enumeration. Journal of parallel and distributed computing 69, 4 (2009), 417--428.
    [37]
    UNO Takeaki. 2012. Implementation issues of clique enumeration algorithm. Special issue: Theoretical computer science and discrete mathematics, Progress in Informatics 9 (2012), 25--30.
    [38]
    Etsuji Tomita, Akira Tanaka, and Haruhisa Takahashi. 2006. The worst-case time complexity for generating all maximal cliques and computational experiments. Theoretical computer science 363, 1 (2006), 28--42.
    [39]
    Charalampos Tsourakakis. 2015. The k-clique densest subgraph problem. In Proceedings of the 24th international conference on world wide web. 1122--1132.
    [40]
    Charalampos Tsourakakis, Francesco Bonchi, Aristides Gionis, Francesco Gullo, and Maria Tsiarli. 2013. Denser than the densest subgraph: extracting optimal quasi-cliques with quality guarantees. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. 104--112.
    [41]
    Jia Wang and James Cheng. 2012. Truss decomposition in massive networks. Proceedings of the VLDB Endowment 5, 9 (2012), 812--823.
    [42]
    Kaixin Wang, Kaiqiang Yu, and Cheng Long. 2023. Efficient k-Clique Listing: An Edge-Oriented Branching Strategy (Technical Report). https://github.com/wangkaixin219/EBBkC/blob/main/EBBkC_TR.pdf.
    [43]
    Jun Wu, Chu-Min Li, Lu Jiang, Junping Zhou, and Minghao Yin. 2020. Local search for diversified top-k clique search problem. Computers & Operations Research 116 (2020), 104867.
    [44]
    Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2016. Diversified top-k clique search. The VLDB Journal 25, 2 (2016), 171--196.
    [45]
    Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2017. Effective and efficient dynamic graph coloring. Proceedings of the VLDB Endowment 11, 3 (2017), 338--351.
    [46]
    Zhirong Yuan, You Peng, Peng Cheng, Li Han, Xuemin Lin, Lei Chen, and Wenjie Zhang. 2022. Efficient k-clique listing with set intersection speedup. ICDE. IEEE (2022).
    [47]
    Xiao Zhou and Takao Nishizeki. 1994. Edge-coloring and f-coloring for various classes of graphs. In Algorithms and Computation: 5th International Symposium, ISAAC'94 Beijing, PR China, August 25--27, 1994 Proceedings 5. Springer, 199--207.

    Cited By

    View all
    • (2024)On Searching Maximum Directed $(k, \ell)$-Plex2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00202(2570-2583)Online publication date: 13-May-2024

    Index Terms

    1. Efficient k-Clique Listing: An Edge-Oriented Branching Strategy

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Proceedings of the ACM on Management of Data
      Proceedings of the ACM on Management of Data  Volume 2, Issue 1
      SIGMOD
      February 2024
      1874 pages
      EISSN:2836-6573
      DOI:10.1145/3654807
      Issue’s Table of Contents
      This work is licensed under a Creative Commons Attribution International 4.0 License.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 26 March 2024
      Published in PACMMOD Volume 2, Issue 1

      Author Tags

      1. branch-and-bound
      2. graph mining
      3. k-clique listing

      Qualifiers

      • Research-article

      Funding Sources

      • Ministry of Education, Singapore

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)153
      • Downloads (Last 6 weeks)53
      Reflects downloads up to 27 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)On Searching Maximum Directed $(k, \ell)$-Plex2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00202(2570-2583)Online publication date: 13-May-2024

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Full Access

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media