Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3583780.3614915acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Homophily-enhanced Structure Learning for Graph Clustering

Published: 21 October 2023 Publication History

Abstract

Graph clustering is a fundamental task in graph analysis, and recent advances in utilizing graph neural networks (GNNs) have shown impressive results. Despite the success of existing GNN-based graph clustering methods, they often overlook the quality of graph structure, which is inherent in real-world graphs due to their sparse and multifarious nature, leading to subpar performance. Graph structure learning allows refining the input graph by adding missing links and removing spurious connections. However, previous endeavors in graph structure learning have predominantly centered around supervised settings, and cannot be directly applied to our specific clustering tasks due to the absence of ground-truth labels. To bridge the gap, we propose a novel method called homophily-enhanced structure learning for graph clustering (HoLe). Our motivation stems from the observation that subtly enhancing the degree of homophily within the graph structure can significantly improve GNNs and clustering outcomes. To realize this objective, we develop two clustering-oriented structure learning modules, i.e., hierarchical correlation estimation and cluster-aware sparsification. The former module enables a more accurate estimation of pairwise node relationships by leveraging guidance from latent and clustering spaces, while the latter one generates a sparsified structure based on the similarity matrix and clustering assignments. Additionally, we devise a joint optimization approach alternating between training the homophily-enhanced structure learning and GNN-based clustering, thereby enforcing their reciprocal effects. Extensive experiments on seven benchmark datasets of various types and scales, across a range of clustering metrics, demonstrate the superiority of HoLe against state-of-the-art baselines.

Supplementary Material

MP4 File (0471-video.mp4)
In this video we present our paper "Homophily-enhanced Structure Learning for Graph Clustering (HoLe)". First we show that our proposal is motivated by the empirical observation that subtle enhancements to the degree of homophily within the graph structure can lead to significant improvements in GNN-based graph clustering. Then we present the HoLe Method in detail. Specifically, our approach incorporates hierarchical correlation estimation and cluster-aware sparsification modules to accurately estimate the relationships between pairs of nodes. These modules are guided by both the latent space and the clustering space, allowing us to generate a sparsified structure that enhances homophily. Finally, through joint optimization, we demonstrate the superiority of HoLe over state-of-the-art baselines on 7 benchmark datasets.

References

[1]
Imtiaz Ahmed, Travis Galoppo, Xia Hu, and Yu Ding. 2021. Graph regularized autoencoder and its application in unsupervised anomaly detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 44, 8 (2021), 4110--4124.
[2]
Sambaran Bandyopadhyay and Vishal Peter. 2021. Unsupervised constrained community detection via self-expressive graph neural network. In Uncertainty in Artificial Intelligence. PMLR, 1078--1088.
[3]
Deyu Bo, Xiao Wang, Chuan Shi, Meiqi Zhu, Emiao Lu, and Peng Cui. 2020. Structural deep clustering network. In Proceedings of The Web Conference 2020. 1400--1410.
[4]
Yu Chen, Lingfei Wu, and Mohammed Zaki. 2020. Iterative deep graph learning for graph neural networks: Better and robust node embeddings. Advances in neural information processing systems, Vol. 33 (2020), 19314--19326.
[5]
Michele Coscia and Frank MH Neffke. 2017. Network backboning with noisy data. In 2017 IEEE 33rd international conference on data engineering (ICDE). IEEE, 425--436.
[6]
Ganqu Cui, Jie Zhou, Cheng Yang, and Zhiyuan Liu. 2020. Adaptive graph encoder for attributed graph embedding. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 976--985.
[7]
Hande Dong, Jiawei Chen, Fuli Feng, Xiangnan He, Shuxian Bi, Zhaolin Ding, and Peng Cui. 2021. On the equivalence of decoupled graph convolution network and label propagation. In Proceedings of the Web Conference 2021. 3651--3662.
[8]
Bahare Fatemi, Layla El Asri, and Seyed Mehran Kazemi. 2021. SLAPS: Self-supervision improves structure learning for graph neural networks. Advances in Neural Information Processing Systems, Vol. 34 (2021), 22667--22681.
[9]
Chakib Fettal, Lazhar Labiod, and Mohamed Nadif. 2022. Efficient Graph Convolution for Joint Node Representation Learning and Clustering. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 289--297.
[10]
Luca Franceschi, Mathias Niepert, Massimiliano Pontil, and Xiao He. 2019. Learning discrete structures for graph neural networks. In International conference on machine learning. PMLR, 1972--1982.
[11]
Kun FU, Jinhui GAO, Xiaomeng ZHAO, and Jianing LI. 2022. Topology optimization based graph convolutional network combining with global structural information. Journal of Computer Applications, Vol. 42, 2 (2022), 357.
[12]
Guojun Gan, Chaoqun Ma, and Jianhong Wu. 2020. Data clustering: theory, algorithms, and applications. SIAM.
[13]
Lei Gong, Sihang Zhou, Xinwang Liu, and Wenxuan Tu. 2022. Attributed graph clustering with dual redundancy reduction. In Proc. of IJCAI.
[14]
Gabriele Grunig, Nedim Durmus, Yian Zhang, Yuting Lu, Sultan Pehlivan, Yuyan Wang, Kathleen Doo, Maria L Cotrina-Vidal, Roberta Goldring, Kenneth I Berger, et al. 2022. Molecular clustering analysis of blood biomarkers in world trade center exposed community members with persistent lower respiratory symptoms. International journal of environmental research and public health, Vol. 19, 13 (2022), 8102.
[15]
Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. Advances in neural information processing systems, Vol. 30 (2017).
[16]
Kaveh Hassani and Amir Hosein Khasahmadi. 2020. Contrastive multi-view representation learning on graphs. In International Conference on Machine Learning. PMLR, 4116--4126.
[17]
Wei Jin, Yao Ma, Xiaorui Liu, Xianfeng Tang, Suhang Wang, and Jiliang Tang. 2020. Graph structure learning for robust graph neural networks. In Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining. 66--74.
[18]
P Kingma Diederik and Jimmy Ba Adam. 2014. A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[19]
Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
[20]
Solomon Kullback and Richard A Leibler. 1951. On information and sufficiency. The annals of mathematical statistics, Vol. 22, 1 (1951), 79--86.
[21]
ACM Digital Library. 2023 [Online]. https://dl.acm.org/. Accessed: 2023-01--16.
[22]
Yixin Liu, Ming Jin, Shirui Pan, Chuan Zhou, Yu Zheng, Feng Xia, and Philip Yu. 2022a. Graph self-supervised learning: A survey. IEEE Transactions on Knowledge and Data Engineering (2022).
[23]
Yixin Liu, Zhao Li, Shirui Pan, Chen Gong, Chuan Zhou, and George Karypis. 2021. Anomaly Detection on Attributed Networks via Contrastive Self-Supervised Learning. IEEE Transactions on Neural Networks and Learning Systems (2021).
[24]
Yue Liu, Wenxuan Tu, Sihang Zhou, Xinwang Liu, Linxuan Song, Xihong Yang, and En Zhu. 2022b. Deep Graph Clustering via Dual Correlation Reduction. In Proc. of AAAI.
[25]
Yixin Liu, Yu Zheng, Daokun Zhang, Hongxu Chen, Hao Peng, and Shirui Pan. 2022c. Towards unsupervised deep graph structure learning. In Proceedings of the ACM Web Conference 2022. 1392--1403.
[26]
Stuart Lloyd. 1982. Least squares quantization in PCM. IEEE transactions on information theory, Vol. 28, 2 (1982), 129--137.
[27]
Zhen Peng, Wenbing Huang, Minnan Luo, Qinghua Zheng, Yu Rong, Tingyang Xu, and Junzhou Huang. 2020. Graph representation learning via graphical mutual information maximization. In Proceedings of The Web Conference 2020. 259--270.
[28]
Zhihao Peng, Hui Liu, Yuheng Jia, and Junhui Hou. 2021. Attention-driven graph clustering network. In Proceedings of the 29th ACM International Conference on Multimedia. 935--943.
[29]
Satu Elisa Schaeffer. 2007. Graph clustering. Computer science review, Vol. 1, 1 (2007), 27--64.
[30]
Prithviraj Sen, Galileo Namata, Mustafa Bilgic, Lise Getoor, Brian Galligher, and Tina Eliassi-Rad. 2008. Collective classification in network data. AI magazine, Vol. 29, 3 (2008), 93--93.
[31]
Wenxuan Tu, Sihang Zhou, Xinwang Liu, Xifeng Guo, Zhiping Cai, En Zhu, and Jieren Cheng. 2021. Deep fusion clustering network. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 9978--9987.
[32]
Laurens Van Der Maaten. 2014. Accelerating t-SNE using tree-based algorithms. The journal of machine learning research, Vol. 15, 1 (2014), 3221--3245.
[33]
Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, Yoshua Bengio, et al. 2017. Graph attention networks. stat, Vol. 1050, 20 (2017), 10--48550.
[34]
Petar Velickovic, William Fedus, William L Hamilton, Pietro Liò, Yoshua Bengio, and R Devon Hjelm. 2019. Deep Graph Infomax. ICLR (Poster), Vol. 2, 3 (2019), 4.
[35]
Chun Wang, Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019a. Attributed graph clustering: A deep attentional embedding approach. arXiv preprint arXiv:1906.06532 (2019).
[36]
Junshan Wang, Ziyao Li, Qingqing Long, Weiyu Zhang, Guojie Song, and Chuan Shi. 2020. Learning node representations from noisy graph structures. In 2020 IEEE international conference on data mining (ICDM). IEEE, 1310--1315.
[37]
Ruijia Wang, Shuai Mou, Xiao Wang, Wanpeng Xiao, Qi Ju, Chuan Shi, and Xing Xie. 2021. Graph structure estimation neural networks. In Proceedings of the Web Conference 2021. 342--353.
[38]
Zhongdao Wang, Liang Zheng, Yali Li, and Shengjin Wang. 2019b. Linkage based face clustering via graph convolution network. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 1117--1125.
[39]
Felix Wu, Amauri Souza, Tianyi Zhang, Christopher Fifty, Tao Yu, and Kilian Weinberger. 2019. Simplifying graph convolutional networks. In International conference on machine learning. PMLR, 6861--6871.
[40]
Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems, Vol. 32, 1 (2020), 4--24.
[41]
Junyuan Xie, Ross Girshick, and Ali Farhadi. 2016. Unsupervised deep embedding for clustering analysis. In International conference on machine learning. PMLR, 478--487.
[42]
Liu Yue, Xia Jun, Zhou Sihang, Wang Siwei, Guo Xifeng, Yang Xihong, Liang Ke, Tu Wenxuan, Liu Xin Wang, et al. 2022. A survey of deep graph clustering: Taxonomy, challenge, and application. arXiv preprint arXiv:2211.12875 (2022).
[43]
Xiaotong Zhang, Han Liu, Qimai Li, and Xiao-Ming Wu. 2019. Attributed graph clustering via adaptive graph convolution. arXiv preprint arXiv:1906.01210 (2019).
[44]
Xiaotong Zhang, Han Liu, Xiao-Ming Wu, Xianchao Zhang, and Xinyue Liu. 2021. Spectral embedding network for attributed graph clustering. Neural Networks, Vol. 142 (2021), 388--396.
[45]
Zhen Zhang, Hongxia Yang, Jiajun Bu, Sheng Zhou, Pinggang Yu, Jianwei Zhang, Martin Ester, and Can Wang. 2018. ANRL: attributed network representation learning via deep neural networks. In Ijcai, Vol. 18. 3155--3161.
[46]
Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, and Maosong Sun. 2020a. Graph neural networks: A review of methods and applications. AI open, Vol. 1 (2020), 57--81.
[47]
Sheng Zhou, Xin Wang, Jiajun Bu, Martin Ester, Pinggang Yu, Jiawei Chen, Qihao Shi, and Can Wang. 2020b. DGE: Deep generative network embedding based on commonality and individuality. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 6949--6956.
[48]
Sheng Zhou, Hongjia Xu, Zhuonan Zheng, Jiawei Chen, Jiajun Bu, Jia Wu, Xin Wang, Wenwu Zhu, Martin Ester, et al. 2022. A comprehensive survey on deep clustering: Taxonomy, challenges, and future directions. arXiv preprint arXiv:2206.07579 (2022).
[49]
Zhiyao Zhou, Sheng Zhou, Bochao Mao, Xuanyi Zhou, Jiawei Chen, Qiaoyu Tan, Daochen Zha, Can Wang, Yan Feng, and Chun Chen. 2023. OpenGSL: A Comprehensive Benchmark for Graph Structure Learning. arXiv preprint arXiv:2306.10280 (2023).
[50]
Jiong Zhu, Yujun Yan, Lingxiao Zhao, Mark Heimann, Leman Akoglu, and Danai Koutra. 2020. Beyond homophily in graph neural networks: Current limitations and effective designs. Advances in Neural Information Processing Systems, Vol. 33 (2020), 7793--7804.
[51]
Yanqiao Zhu, Weizhi Xu, Jinghao Zhang, Qiang Liu, Shu Wu, and Liang Wang. 2021. Deep graph structure learning for robust representations: A survey. arXiv preprint arXiv:2103.03036 (2021).

Cited By

View all
  • (2025)Enhancing federated learning-based social recommendations with graph attention networksNeurocomputing10.1016/j.neucom.2024.129045617(129045)Online publication date: Feb-2025
  • (2024)ACDM: An Effective and Scalable Active Clustering with Pairwise ConstraintProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679601(643-652)Online publication date: 21-Oct-2024
  • (2024)DSLR: Diversity Enhancement and Structure Learning for Rehearsal-based Graph Continual LearningProceedings of the ACM Web Conference 202410.1145/3589334.3645561(733-744)Online publication date: 13-May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
October 2023
5508 pages
ISBN:9798400701245
DOI:10.1145/3583780
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graph clustering
  2. graph neural networks
  3. graphs structure learning

Qualifiers

  • Research-article

Funding Sources

Conference

CIKM '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)165
  • Downloads (Last 6 weeks)15
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Enhancing federated learning-based social recommendations with graph attention networksNeurocomputing10.1016/j.neucom.2024.129045617(129045)Online publication date: Feb-2025
  • (2024)ACDM: An Effective and Scalable Active Clustering with Pairwise ConstraintProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679601(643-652)Online publication date: 21-Oct-2024
  • (2024)DSLR: Diversity Enhancement and Structure Learning for Rehearsal-based Graph Continual LearningProceedings of the ACM Web Conference 202410.1145/3589334.3645561(733-744)Online publication date: 13-May-2024
  • (2024)Denoised Graph Collaborative Filtering via Neighborhood Similarity and Dynamic ThresholdingIEEE Transactions on Big Data10.1109/TBDATA.2024.345376510:6(683-693)Online publication date: Dec-2024
  • (2024)RAIN: Reconstructed-aware in-context enhancement with graph denoising for session-based recommendationNeural Networks10.1016/j.neunet.2024.107056(107056)Online publication date: Dec-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media