Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Maximal Defective Clique Enumeration

Published: 30 May 2023 Publication History

Abstract

Maximal clique enumeration is a fundamental operator in graph analysis. The model of clique, however, is typically too restrictive for real-world applications as it requires an edge for every pair of vertices. To remedy this restriction, practical graph analysis applications often resort to find relaxed cliques as alternatives. In this work, we investigate a notable relaxed clique model, called s-defective clique, which allows at most s edges to be missing. Similar to the complexity of maximal clique enumeration, the problem of enumerating all maximal s-defective cliques is also NP-hard. To solve this problem, we first develop a new polynomial-delay algorithm based on a carefully-designed reverse search technique, which can output two consecutive results within polynomial time. To achieve better practical efficiency, we propose a branch-and-bound algorithm with a novel pivoting technique. We prove that the time complexity of this algorithm depends only on O(α_sn) or O(αsδ) when using a degeneracy ordering optimization, where αs is a positive real number strictly less than 2, and δ (δ <n) is the degeneracy of the graph. To our knowledge, this is the first algorithm that can break the O(2n) time complexity to enumerate all maximal s-defective cliques (s>0). We also develop several new pruning techniques to further improve the efficiency of our branch-and-bound algorithm to enumerate all relatively-large maximal s-defective cliques. In addition, we further generalize our pivot-based branch-and-bound algorithm to enumerate all maximal subgraphs satisfying a hereditary property. Here we call a graph meeting the hereditary property if all its subgraphs have the same property as itself. Finally, extensive experiments on 11 datasets demonstrate the efficiency, effectiveness, and scalability of the proposed solutions.

Supplemental Material

MP4 File
Presentation video

References

[1]
Aman Abidi, Rui Zhou, Lu Chen, and Chengfei Liu. 2020. Pivot-based Maximal Biclique Enumeration. In IJCAI. 3558--3564.
[2]
David Avis and Komei Fukuda. 1996. Reverse Search for Enumeration. Discret. Appl. Math., Vol. 65, 1--3 (1996), 21--46.
[3]
Gary D. Bader and Christopher WV Hogue. 2002. Analyzing yeast protein--protein interaction data obtained from different sources. Nature biotechnology, Vol. 20, 10 (2002), 991--997.
[4]
Vladimir Batagelj and Matjaz Zaversnik. 2003. An O(m) Algorithm for Cores Decomposition of Networks. CoRR, Vol. cs.DS/0310049 (2003).
[5]
Punam Bedi and Chhavi Sharma. 2016. Community detection in social networks. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, Vol. 6, 3 (2016), 115--135.
[6]
Rachel Behar and Sara Cohen. 2018. Finding All Maximal Connected s-Cliques in Social Networks. In EDBT. 61--72.
[7]
Devora Berlowitz, Sara Cohen, and Benny Kimelfeld. 2015. Efficient Enumeration of Maximal k-Plexes. In SIGMOD. 431--444.
[8]
Vladimir Boginski, Sergiy Butenko, and Panos M Pardalos. 2005. Statistical analysis of financial networks. Computational statistics & data analysis, Vol. 48, 2 (2005), 431--443.
[9]
Vladimir Boginski, Sergiy Butenko, and Panos M Pardalos. 2006. Mining market data: A network approach. Computers & Operations Research, Vol. 33, 11 (2006), 3171--3184.
[10]
Coenraad Bron and Joep Kerbosch. 1973. Finding All Cliques of an Undirected Graph (Algorithm 457). Commun. ACM, Vol. 16, 9 (1973), 575--576.
[11]
Sergiy Butenko and Wilbert E. Wilhelm. 2006. Clique-detection models in computational biochemistry and genomics. Eur. J. Oper. Res., Vol. 173, 1 (2006), 1--17.
[12]
Frédéric Cazals and Chinmay Karande. 2006. Reporting maximal cliques: new insights into an old problem. Ph.,D. Dissertation. INRIA.
[13]
Xiaoyu Chen, Yi Zhou, Jin-Kao Hao, and Mingyu Xiao. 2021. Computing maximum k-defective cliques in massive graphs. Comput. Oper. Res., Vol. 127 (2021), 105131.
[14]
Zi Chen, Long Yuan, Xuemin Lin, Lu Qin, and Jianye Yang. 2020. Efficient Maximal Balanced Clique Enumeration in Signed Networks. In WWW. 339--349.
[15]
Sara Cohen, Benny Kimelfeld, and Yehoshua Sagiv. 2008. Generating all maximal induced subgraphs for hereditary and connected-hereditary graph properties. J. Comput. Syst. Sci., Vol. 74, 7 (2008), 1147--1159.
[16]
Alessio Conte, Roberto Grossi, Andrea Marino, and Luca Versari. 2019. Listing Maximal Subgraphs Satisfying Strongly Accessible Properties. SIAM J. Discret. Math., Vol. 33, 2 (2019), 587--613.
[17]
Alessio Conte, Tiziano De Matteis, Daniele De Sensi, Roberto Grossi, Andrea Marino, and Luca Versari. 2018. D2K: Scalable Community Detection in Massive Networks via Small-Diameter k-Plexes. In KDD. 1272--1281.
[18]
Qiangqiang Dai, Rong-Hua Li, Meihao Liao, and Guoren Wang. 2023. Maximal Defective Clique Enumeration (full version). https://github.com/qq-dai/DefectiveClique/blob/main/DefectiveClique-full.pdf
[19]
Naga Shailaja Dasari, Desh Ranjan, and Mohammad Zubair. 2014. pbitMCE: A bit-based approach for maximal clique enumeration on multicore processors. In ICPADS. 478--485.
[20]
Nan Du, Bin Wu, Xin Pei, Bai Wang, and Liutong Xu. 2007. Community detection in large-scale social networks. In WebKDD workshop. 16--25.
[21]
David Eppstein, Maarten Lö ffler, and Darren Strash. 2010. Listing All Maximal Cliques in Sparse Graphs in Near-Optimal Time. In ISAAC, Vol. 6506. 403--414.
[22]
V. Fomin Fedor and Kratsch Dieter. 2010. Exact Exponential Algorithms. Springer.
[23]
Anne-Claude Gavin, Markus Bösche, Roland Krause, Paola Grandi, Martina Marzioch, Andreas Bauer, Jörg Schultz, Jens M. Rick, Anne-Marie Michon, Cristina-Maria Cruciat, et al. 2002. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature, Vol. 415, 6868 (2002), 141--147.
[24]
Eric Harley, Anthony Bonner, and Nathan Goodman. 2001. Uniform integration of genome mapping data using intersection graphs. Bioinformatics, Vol. 17, 6 (2001), 487--494.
[25]
Anne-Sophie Himmel, Hendrik Molter, Rolf Niedermeier, and Manuel Sorge. 2016. Enumerating maximal cliques in temporal graphs. In ASONAM. 337--344.
[26]
George Kollios, Michalis Potamias, and Evimaria Terzi. 2013. Clustering Large Probabilistic Graphs. IEEE Trans. Knowl. Data Eng., Vol. 25, 2 (2013), 325--336.
[27]
Andrea Lancichinetti and Santo Fortunato. 2009. Community detection algorithms: a comparative analysis. Physical review E., Vol. 80, 5 (2009), 056117.
[28]
Jinyan Li, Guimei Liu, Haiquan Li, and Limsoon Wong. 2007. Maximal Biclique Subgraphs and Closed Pattern Pairs of the Adjacency Matrix: A One-to-One Correspondence and Mining Algorithms. IEEE Trans. Knowl. Data Eng., Vol. 19, 12 (2007), 1625--1637.
[29]
Rong-Hua Li, Qiangqiang Dai, Lu Qin, Guoren Wang, Xiaokui Xiao, Jeffrey Xu Yu, and Shaojie Qiao. 2018. Efficient Signed Clique Search in Signed Networks. In ICDE. 245--256.
[30]
Rong-Hua Li, Qiangqiang Dai, Guoren Wang, Zhong Ming, Lu Qin, and Jeffrey Xu Yu. 2019. Improved Algorithms for Maximal Clique Search in Uncertain Networks. In ICDE. 1178--1189.
[31]
Rong-Hua Li, Qiushuo Song, Xiaokui Xiao, Lu Qin, Guoren Wang, Jeffrey Xu Yu, and Rui Mao. 2022. I/O-Efficient Algorithms for Degeneracy Computation on Massive Networks. IEEE Trans. Knowl. Data Eng., Vol. 34, 7 (2022), 3335--3348.
[32]
Don R. Lick and Arthur T. White. 1970. k-Degenerate graphs. Canadian Journal of Mathematics, Vol. 22, 5 (1970), 1082--1096.
[33]
Longlong Lin, Pingpeng Yuan, Rong-Hua Li, Jifei Wang, Ling Liu, and Hai Jin. 2022. Mining Stable Quasi-Cliques on Temporal Networks. IEEE Trans. Syst. Man Cybern. Syst., Vol. 52, 6 (2022), 3731--3745.
[34]
Guimei Liu and Limsoon Wong. 2008. Effective Pruning Techniques for Mining Quasi-Cliques. In ECML/PKDD, Vol. 5212. 33--49.
[35]
R. Duncan Luce. 1950. Connectivity and generalized cliques in sociometric group structure. Psychometrika, Vol. 15, 2 (1950), 169--190.
[36]
John W. Moon and Leo Moser. 1965. On cliques in graphs. Israel journal of Mathematics, Vol. 3, 1 (1965), 23--28.
[37]
Arko Mukherjee, Pan Xu, and Srikanta Tirthapura. 2017. Enumeration of Maximal Cliques from an Uncertain Graph. IEEE Trans. Knowl. Data Eng., Vol. 29, 3 (2017), 543--555.
[38]
Kevin A. Naudé. 2016. Refined pivot selection for maximal clique enumeration in graphs. Theor. Comput. Sci., Vol. 613 (2016), 28--37.
[39]
Gergely Palla, Imre Derényi, Illés Farkas, and Tamás Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. nature, Vol. 435, 7043 (2005), 814--818.
[40]
Jeffrey Pattillo, Nataly Youssef, and Sergiy Butenko. 2013. On clique relaxation models in network analysis. Eur. J. Oper. Res., Vol. 226, 1 (2013), 9--18.
[41]
Jian Pei, Daxin Jiang, and Aidong Zhang. 2005. On mining cross-graph quasi-cliques. In KDD. 228--238.
[42]
Hongchao Qin, Rong-Hua Li, Guoren Wang, Lu Qin, Yurong Cheng, and Ye Yuan. 2019. Mining Periodic Cliques in Temporal Networks. In ICDE. 1130--1141.
[43]
Pablo San Segundo, Jorge Artieda, and Darren Strash. 2018. Efficiently enumerating all maximal cliques with bit-parallelism. Comput. Oper. Res., Vol. 92 (2018), 37--46.
[44]
Stephen B. Seidman and Brian L. Foster. 1978. A graph?theoretic generalization of the clique concept. Journal of Mathematical Sociology, Vol. 6, 1 (1978), 139--154.
[45]
Uno Takeaki. 2003. Two general methods to reduce delay and change of enumeration algorithms. Technical Report. Technical Report.
[46]
Etsuji Tomita, Tatsuya Akutsu, and Tsutomu Matsunaga. 2011. Efficient algorithms for finding maximum and maximal cliques: Effective tools for bioinformatics. IntechOpen.
[47]
Etsuji Tomita, Akira Tanaka, and Haruhisa Takahashi. 2006. The worst-case time complexity for generating all maximal cliques and computational experiments. Theor. Comput. Sci., Vol. 363, 1 (2006), 28--42.
[48]
Svyatoslav Trukhanov, Chitra Balasubramaniam, Balabhaskar Balasundaram, and Sergiy Butenko. 2013. Algorithms for detecting optimal hereditary structures in graphs, with application to clique relaxations. Comput. Optim. Appl., Vol. 56, 1 (2013), 113--130.
[49]
Zhengren Wang, Yi Zhou, Mingyu Xiao, and Bakhadyr Khoussainov. 2022. Listing Maximal k-Plexes in Large Real-World Graphs. In WWW.
[50]
Haiyuan Yu, Alberto Paccanaro, Valery Trifonov, and Mark Gerstein. 2006. Predicting interactions in protein networks by completing defective cliques. Bioinform., Vol. 22, 7 (2006), 823--829.
[51]
Kaiqiang Yu, Cheng Long, Shengxin Liu, and Da Yan. 2022. Efficient Algorithms for Maximal k-Biplex Enumeration. In SIGMOD. 860--873.
[52]
Yun Zhang, Charles A. Phillips, Gary L. Rogers, Erich J. Baker, Elissa J. Chesler, and Michael A. Langston. 2014. On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types. BMC Bioinform., Vol. 15 (2014), 110.
[53]
Yi Zhou, Weibo Lin, Jin-Kao Hao, Mingyu Xiao, and Yan Jin. 2022. An effective branch-and-bound algorithm for the maximum s-bundle problem. Eur. J. Oper. Res., Vol. 297, 1 (2022), 27--39.
[54]
Yi Zhou, Jingwei Xu, Zhenyu Guo, Mingyu Xiao, and Yan Jin. 2020. Enumerating Maximal k-Plexes with Worst-Case Time Guarantee. In AAAI. 2442--2449.
[55]
Zhaonian Zou, Jianzhong Li, Hong Gao, and Shuo Zhang. 2010. Finding top-k maximal cliques in an uncertain graph. In ICDE. 649--652.

Cited By

View all
  • (2024)Efficient Algorithms for Density Decomposition on Large Static and Dynamic GraphsProceedings of the VLDB Endowment10.14778/3681954.368197417:11(2933-2945)Online publication date: 30-Aug-2024
  • (2024)Theoretically and Practically Efficient Maximum Defective Clique SearchProceedings of the ACM on Management of Data10.1145/36771422:4(1-27)Online publication date: 30-Sep-2024
  • (2024)SpeedCore: Space-efficient and Dependency-aware GPU Parallel Framework for Core DecompositionProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673111(555-564)Online publication date: 12-Aug-2024
  • Show More Cited By

Index Terms

  1. Maximal Defective Clique Enumeration

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the ACM on Management of Data
    Proceedings of the ACM on Management of Data  Volume 1, Issue 1
    PACMMOD
    May 2023
    2807 pages
    EISSN:2836-6573
    DOI:10.1145/3603164
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 May 2023
    Published in PACMMOD Volume 1, Issue 1

    Permissions

    Request permissions for this article.

    Author Tags

    1. branch-and-bound
    2. cohesive subgraph
    3. reverse search
    4. s-defective clique

    Qualifiers

    • Research-article

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)160
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 12 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Efficient Algorithms for Density Decomposition on Large Static and Dynamic GraphsProceedings of the VLDB Endowment10.14778/3681954.368197417:11(2933-2945)Online publication date: 30-Aug-2024
    • (2024)Theoretically and Practically Efficient Maximum Defective Clique SearchProceedings of the ACM on Management of Data10.1145/36771422:4(1-27)Online publication date: 30-Sep-2024
    • (2024)SpeedCore: Space-efficient and Dependency-aware GPU Parallel Framework for Core DecompositionProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673111(555-564)Online publication date: 12-Aug-2024
    • (2024)On Searching Maximum Directed $(k, \ell)$-Plex2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00202(2570-2583)Online publication date: 13-May-2024
    • (2024)A two-phase approach for enumeration of maximal $$(\Delta , \gamma )$$-cliques of a temporal networkSocial Network Analysis and Mining10.1007/s13278-024-01207-y14:1Online publication date: 8-Mar-2024

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media