Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3637528.3671615acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

DAG: Deep Adaptive and Generative K-Free Community Detection on Attributed Graphs

Published: 24 August 2024 Publication History

Abstract

Community detection on attributed graphs with rich semantic and topological information offers great potential for real-world network analysis, especially user matching in online games. Graph Neural Networks (GNNs) have recently enabled Deep Graph Clustering (DGC) methods to learn cluster assignments from semantic and topological information. However, their success depends on the prior knowledge related to the number of communities K, which is unrealistic due to the high costs and privacy issues of acquisition. In this paper, we investigate the community detection problem without prior K, referred to as K-Free Community Detection problem. To address this problem, we propose a novel Deep Adaptive and Generative model~(DAG) for community detection without specifying the prior K. DAG consists of three key components, i.e., a node representation learning module with masked attribute reconstruction, a community affiliation readout module, and a community number search module with group sparsity. These components enable DAG to convert the process of non-differentiable grid search for the community number, i.e., a discrete hyperparameter in existing DGC methods, into a differentiable learning process. In such a way, DAG can simultaneously perform community detection and community number search end-to-end. To alleviate the cost of acquiring community labels in real-world applications, we design a new metric, EDGE, to evaluate community detection methods even when the labels are not feasible. Extensive offline experiments on five public datasets and a real-world online mobile game dataset demonstrate the superiority of our DAG over the existing state-of-the-art (SOTA) methods. DAG has a relative increase of 7.35% in teams in a Tencent online game compared with the best competitor.

Supplemental Material

MP4 File - DAG: Deep Adaptive and Generative K-Free Community Detection on Attributed Graphs
Promo Video of DAG: Deep Adaptive and Generative K-Free Community Detection on Attributed Graphs

References

[1]
Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg, and Eric P. Xing. 2008. Mixed Membership Stochastic Blockmodels. In NIPS. Curran Associates, Inc., 33--40.
[2]
Wendong Bi, Bingbing Xu, Xiaoqian Sun, Zidong Wang, Huawei Shen, and Xueqi Cheng. 2022. Company-as-Tribe: Company Financial Risk Assessment on Tribe-Style Graph with Hierarchical Graph Neural Networks. In KDD. ACM, 2712--2720.
[3]
Wendong Bi, Bingbing Xu, Xiaoqian Sun, Easton Li Xu, Huawei Shen, and Xueqi Cheng. 2023. Predicting the Silent Majority on Graphs: Knowledge Transferable Graph Neural Network. In WWW. ACM, 274--285.
[4]
Vincent D. Blondel, Jean-Loup Guillaume, Renaud Lambiotte, and Etienne Lefebvre. 2008. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, Vol. 2008 (2008), 10008.
[5]
Ronald L Breiger. 1974. The duality of persons and groups. Social forces, Vol. 53, 2 (1974), 181--190.
[6]
Tadeusz Cali'nski and Jerzy Harabasz. 1974. A dendrite method for cluster analysis. Communications in Statistics-theory and Methods, Vol. 3, 1 (1974), 1--27.
[7]
Tanmoy Chakraborty, Ayushi Dalmia, Animesh Mukherjee, and Niloy Ganguly. 2017. Metrics for Community Analysis: A Survey. ACM Comput. Surv., Vol. 50, 4 (2017), 54:1--54:37.
[8]
Aaron Clauset, Mark E. J. Newman, and Cristopher Moore. 2004. Finding community structure in very large networks. Physical review. E, Statistical, nonlinear, and soft matter physics, Vol. 70 6 Pt 2 (2004), 066111.
[9]
Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In KDD. AAAI Press, 226--231.
[10]
Adil Fahad, Najlaa Alshatri, Zahir Tari, Abdullah Alamri, Ibrahim Khalil, Albert Y. Zomaya, Sebti Foufou, and Abdelaziz Bouras. 2014. A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis. IEEE Trans. Emerg. Top. Comput., Vol. 2, 3 (2014), 267--279.
[11]
Javier O. Garcia, Arian Ashourvan, Sarah Muldoon, Jean M. Vettel, and Danielle S. Bassett. 2018. Applications of Community Detection Techniques to Brain Graphs: Algorithmic Considerations and Implications for Neural Function. Proc. IEEE, Vol. 106, 5 (2018), 846--867.
[12]
Michelle Girvan and Mark E. J. Newman. 2001. Community structure in social and biological networks. Proceedings of the National Academy of Sciences of the United States of America, Vol. 99 (2001), 7821 -- 7826.
[13]
Xiaotian Han, Zhimeng Jiang, Ninghao Liu, Qingquan Song, Jundong Li, and Xia Hu. 2022. Geometric Graph Representation Learning via Maximizing Rate Reduction. (2022), 1226--1237.
[14]
John A. Hartigan and M. Anthony. Wong. 1979. A k-means clustering algorithm.
[15]
Trevor Hastie, Robert Tibshirani, et al. 2009. The elements of statistical learning: data mining, inference, and prediction.
[16]
Dongxiao He, Yue Song, Di Jin, Zhiyong Feng, Binbin Zhang, Zhizhi Yu, and Weixiong Zhang. 2020. Community-Centric Graph Convolutional Network for Unsupervised Community Detection. In IJCAI. ijcai.org, 3515--3521.
[17]
Geoffrey E. Hinton, Simon Osindero, and Yee Whye Teh. 2006. A Fast Learning Algorithm for Deep Belief Nets. Neural Comput., Vol. 18, 7 (2006), 1527--1554.
[18]
Qirong Ho, Wenqing Lin, Eran Shaham, Shonali Krishnaswamy, The Anh Dang, Jingxuan Wang, Isabel Choo Zhongyan, and Amy She-Nash. 2016. A Distributed Graph Algorithm for Discovering Unique Behavioral Groups from Large-Scale Telco Data. In CIKM. ACM, 1353--1362.
[19]
Paul W Holland, Kathryn Blackmond Laskey, and Samuel Leinhardt. 1983. Stochastic blockmodels: First steps. Social networks, Vol. 5, 2 (1983), 109--137.
[20]
Zhenyu Hou, Yufei He, Yukuo Cen, Xiao Liu, Yuxiao Dong, Evgeny Kharlamov, and Jie Tang. 2023. GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner. In WWW. ACM, 737--746.
[21]
Zhenyu Hou, Xiao Liu, Yukuo Cen, Yuxiao Dong, Hongxia Yang, Chunjie Wang, and Jie Tang. 2022. GraphMAE: Self-Supervised Masked Graph Autoencoders. In KDD. ACM, 594--604.
[22]
Shixun Huang, Wenqing Lin, Zhifeng Bao, and Jiachen Sun. 2022. Influence Maximization in Real-World Closed Social Networks. Proc. VLDB Endow., Vol. 16, 2 (2022), 180--192.
[23]
Di Jin, Meng Ge, Liang Yang, Dongxiao He, Longbiao Wang, and Weixiong Zhang. 2018. Integrative Network Embedding via Deep Joint Reconstruction. In IJCAI. ijcai.org, 3407--3413.
[24]
Di Jin, Zhizhi Yu, Pengfei Jiao, Shirui Pan, et al. 2021. A Survey of Community Detection Approaches: From Statistical Modeling to Deep Learning. IEEE Transactions on Knowledge and Data Engineering, Vol. 35 (2021), 1149--1170.
[25]
Mohammad Reza Keyvanpour, Mehrnoush Barani Shirzad, and Maryam Ghaderi. 2020. AD-C: a new node anomaly detection based on community detection in social networks. Int. J. Electron. Bus., Vol. 15, 3 (2020), 199--222.
[26]
Silvio Lattanzi and D. Sivakumar. 2009. Affiliation networks. In STOC. ACM, 427--434.
[27]
Ian X. Y. Leung, Pan Hui, Pietro Lio', and Jon A. Crowcroft. 2008. Towards real-time community detection in large networks. Physical review. E, Statistical, nonlinear, and soft matter physics, Vol. 79 6 Pt 2 (2008), 066107.
[28]
Peiyan Li, Honglian Wang, Jianyun Lu, Qinli Yang, and Junming Shao. 2020. Community Detection with Local Metric Learning. (2020), 312--321.
[29]
Wenqing Lin. 2019. Distributed Algorithms for Fully Personalized PageRank on Large Graphs. In WWW. ACM, 1084--1094.
[30]
Wenqing Lin. 2021. Large-Scale Network Embedding in Apache Spark. In KDD. ACM, 3271--3279.
[31]
Wenqing Lin, Feng He, Faqiang Zhang, Xu Cheng, and Hongyun Cai. 2020. Initialization for Network Embedding: A Graph Partition Approach. In WSDM. ACM, 367--374.
[32]
Fanzhen Liu, Shan Xue, Jia Wu, Chuan Zhou, Wenbin Hu, Cécile Paris, Surya Nepal, Jian Yang, and Philip S. Yu. 2020. Deep Learning for Community Detection: Progress, Challenges and Opportunities. In IJCAI.
[33]
Jun Liu, Shuiwang Ji, and Jieping Ye. 2009. Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization. In UAI. 339--348.
[34]
Yue Liu, Jun Xia, Sihang Zhou, Siwei Wang, Xifeng Guo, Xihong Yang, Ke Liang, Wenxuan Tu, Stan Z. Li, and Xinwang Liu. 2022. A Survey of Deep Graph Clustering: Taxonomy, Challenge, and Application. (2022).
[35]
Yue Liu, Xihong Yang, Sihang Zhou, Xinwang Liu, Zhen Wang, Ke Liang, Wenxuan Tu, Liang Li, Jingcan Duan, and Cancan Chen. 2023. Hard Sample Aware Network for Contrastive Deep Graph Clustering. In AAAI. AAAI Press, 8914--8922.
[36]
Warren S. McCulloch and Walter Pitts. 1990. A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biology (1990), 99--115.
[37]
Miller McPherson, Lynn Smith-Lovin, and James M. Cook. 2001. Birds of a Feather: Homophily in Social Networks. Review of Sociology (2001), 415--444.
[38]
Namyong Park, Ryan A. Rossi, Eunyee Koh, Iftikhar Ahamath Burhanuddin, Sungchul Kim, Fan Du, Nesreen K. Ahmed, and Christos Faloutsos. 2022. CGC: Contrastive Graph Clustering for Community Detection and Tracking. CoRR, Vol. abs/2204.08504 (2022).
[39]
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in PyTorch. (2017).
[40]
Zhihao Peng, Hui Liu, Yuheng Jia, and Junhui Hou. 2021. Attention-driven Graph Clustering Network. In ACM Multimedia. ACM, 935--943.
[41]
Chenyang Qiu, Zhaoci Huang, Wenzhe Xu, and Huijia Li. 2022. VGAER: graph neural network reconstruction based community detection. CoRR, Vol. abs/2201.04066 (2022).
[42]
Usha Nandini Raghavan, Réka Albert, et al. 2007. Near linear time algorithm to detect community structures in large-scale networks. Physical review. E, Statistical, nonlinear, and soft matter physics, Vol. 76 3 Pt 2 (2007), 036106.
[43]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian Personalized Ranking from Implicit Feedback. In UAI, Jeff A. Bilmes and Andrew Y. Ng (Eds.).
[44]
Venu Satuluri, Yao Wu, Xun Zheng, Yilei Qian, Brian Wichers, Qieyun Dai, Gui Ming Tang, Jerry Jiang, and Jimmy Lin. 2020. SimClusters: Community-Based Representations for Heterogeneous Recommendations at Twitter. In KDD. ACM, 3183--3193.
[45]
Erich Schubert, Jörg Sander, Martin Ester, Hans-Peter Kriegel, and Xiaowei Xu. 2017. DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN. ACM Trans. Database Syst., Vol. 42, 3 (2017), 19:1--19:21.
[46]
Prithviraj Sen, Galileo Namata, Mustafa Bilgic, Lise Getoor, et al. 2008. Collective Classification in Network Data. In The AI Magazine.
[47]
Oleksandr Shchur, Maximilian Mumme, Aleksandar Bojchevski, and Stephan Günnemann. 2018. Pitfalls of Graph Neural Network Evaluation. CoRR, Vol. abs/1811.05868 (2018).
[48]
Alexander Strehl and Joydeep Ghosh. 2002. Cluster Ensembles - A Knowledge Reuse Framework for Combining Multiple Partitions. J. Mach. Learn. Res. (2002).
[49]
Xing Su, Shan Xue, Fanzhen Liu, Jia Wu, Jian Yang, Chuan Zhou, et al. 2021. A Comprehensive Survey on Community Detection with Deep Learning. IEEE transactions on neural networks and learning systems, Vol. PP (2021).
[50]
Fei Tian, Bin Gao, Qing Cui, Enhong Chen, and Tie-Yan Liu. 2014. Learning Deep Representations for Graph Clustering. In AAAI. AAAI Press, 1293--1299.
[51]
Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research (2008).
[52]
Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. 2017. Graph Attention Networks. CoRR, Vol. abs/1710.10903 (2017).
[53]
Chun Wang, Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019. Attributed Graph Clustering: A Deep Attentional Embedding Approach. In IJCAI. ijcai.org, 3670--3676.
[54]
Jing Wang and Ioannis Ch. Paschalidis. 2017. Botnet Detection Based on Anomaly and Community Detection. IEEE Trans. Control. Netw. Syst., Vol. 4, 2 (2017), 392--404.
[55]
Xin Xin, Chaokun Wang, Xiang Ying, and Bo-Hung Wang. 2017. Deep community detection in topologically incomplete networks. Physica A-statistical Mechanics and Its Applications, Vol. 469 (2017), 342--352.
[56]
Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y. Chang. 2015. Network Representation Learning with Rich Text Information. In IJCAI. AAAI Press, 2111--2117.
[57]
Xihong Yang, Yue Liu, Sihang Zhou, Siwei Wang, Wenxuan Tu, Qun Zheng, Xinwang Liu, Liming Fang, and En Zhu. 2023. Cluster-Guided Contrastive Graph Clustering Network. In AAAI. AAAI Press, 10834--10842.
[58]
Shiqi Zhang, Yiqian Huang, Jiachen Sun, Wenqing Lin, Xiaokui Xiao, and Bo Tang. 2023. Capacity Constrained Influence Maximization in Social Networks. In KDD. ACM, 3376--3385.
[59]
Shiqi Zhang, Jiachen Sun, Wenqing Lin, Xiaokui Xiao, Yiqian Huang, and Bo Tang. 2024. Information Diffusion Meets Invitation Mechanism. In WWW. ACM, 383--392.
[60]
Shiqi Zhang, Jiachen Sun, Wenqing Lin, Xiaokui Xiao, and Bo Tang. 2022. Measuring Friendship Closeness: A Perspective of Social Identity Theory. In CIKM. ACM, 3664--3673.
[61]
Tianqi Zhang, Yun Xiong, Jiawei Zhang, Yao Zhang, Yizhu Jiao, and Yangyong Zhu. 2020. CommDGI: Community Detection Oriented Deep Graph Infomax. In CIKM. ACM, 1843--1852.
[62]
Xingyi Zhang, Shuliang Xu, Wenqing Lin, and Sibo Wang. 2023. Constrained Social Community Recommendation. In KDD. ACM, 5586--5596.
[63]
Zhao Zhang, Fanzhang Li, Mingbo Zhao, Li Zhang, and Shuicheng Yan. 2017. Robust neighborhood preserving projection by nuclear/L2, 1-norm regularization for image feature extraction. IEEE Trans. Image Process., Vol. 26, 4 (2017).
[64]
Elena Zheleva, Hossam Sharara, and Lise Getoor. 2009. Co-evolution of social and affiliation networks. In KDD. ACM, 1007--1016.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2024
6901 pages
ISBN:9798400704901
DOI:10.1145/3637528
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. community detection
  2. graph neural networks
  3. social network
  4. unsupervised learning

Qualifiers

  • Research-article

Funding Sources

  • National Nature Science Foundation of China
  • Shanghai Municipal Science and Technology Major Project

Conference

KDD '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 191
    Total Downloads
  • Downloads (Last 12 months)191
  • Downloads (Last 6 weeks)40
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media