Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3488560.3498467acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article
Public Access

GAGE: Geometry Preserving Attributed Graph Embeddings

Published: 15 February 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Node embedding is the task of extracting concise and informative representations of certain entities that are connected in a network. Various real-world networks include information about both node connectivity and certain node attributes, in the form of features or time-series data. Modern representation learning techniques employ both the connectivity and attribute information of the nodes to produce embeddings in an unsupervised manner. In this context, deriving embeddings that preserve the geometry of the network and the attribute vectors would be highly desirable, as they would reflect both the topological neighborhood structure and proximity in feature space. While this is fairly straightforward to maintain when only observing the connectivity or attribute information of the network, preserving the geometry of both types of information is challenging. A novel tensor factorization approach for node embedding in attributed networks is proposed in this paper, that preserves the distances of both the connections and the attributes. Furthermore, an effective and lightweight algorithm is developed to tackle the learning task and judicious experiments with multiple state-of-the-art baselines suggest that the proposed algorithm offers significant performance improvements in downstream tasks.

    Supplementary Material

    MP4 File (WSDM22-fp500.mp4)
    This video presents the paper titled "GAGE: Geometry Preserving Attributed Graph Embeddings". The objective of the video is twofold: First, to present the problem of node embedding in attributed graphs and discuss it's importance. Second, to propose GAGE, a novel node embedding approach that preserves the geometry of the network. In particular, GAGE leverages ideas from tensors and multi-dimensional scaling and produces low dimensional representations that provably respect the proximities associated with both the graph and the attributes. In addition, the proposed algorithm is scalable, can work with missing connectivity and attribute information, and produces embeddings that are unique and permutation invariant. The proposed GAGE algorithm shows state-of-the-art performance in the task of node classification (which is presented in the video) and link prediction.

    References

    [1]
    Amr Ahmed, Nino Shervashidze, Shravan Narayanamurthy, Vanja Josifovski, and Alexander J Smola. 2013. Distributed large-scale natural graph factorization. In Proceedings of the 22nd international conference on World Wide Web. 37--48.
    [2]
    Saba A Al-Sayouri, Ekta Gujral, Danai Koutra, Evangelos E Papalexakis, and Sarah S Lam. 2018. t-PNE: tensor-based predictable node embeddings. In 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE, 491--494.
    [3]
    Albert-László Barabási et al. 2016. Network science. Cambridge university press.
    [4]
    Dimitris Berberidis and Georgios B Giannakis. 2019. Node embedding with adaptive similarities for scalable learning over graphs. IEEE Transactions on Knowledge and Data Engineering (2019).
    [5]
    Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2015. Grarep: Learning graph representations with global structural information. In Proceedings of the 24th ACM international on conference on information and knowledge management . 891--900.
    [6]
    Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2016. Deep neural networks for learning graph representations. In AAAI, Vol. 16. 1145--1152.
    [7]
    J Douglas Carroll and Jih-Jie Chang. 1970. Analysis of individual differences in multidimensional scaling via an N-way generalization of "Eckart-Young" decomposition. Psychometrika, Vol. 35, 3 (1970), 283--319.
    [8]
    Luca Chiantini and Giorgio Ottaviani. 2012. On generic identifiability of 3-tensors of small rank. SIAM J. Matrix Anal. Appl., Vol. 33, 3 (2012), 1018--1037.
    [9]
    Michael AA Cox and Trevor F Cox. 2008. Multidimensional scaling. In Handbook of data visualization . Springer, 315--347.
    [10]
    Ganqu Cui, Jie Zhou, Cheng Yang, and Zhiyuan Liu. 2020. Adaptive Graph Encoder for Attributed Graph Embedding. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining . 976--985.
    [11]
    Ignat Domanov and Lieven De Lathauwer. 2014. Canonical polyadic decomposition of third-order tensors: reduction to generalized eigenvalue decomposition. SIAM J. Matrix Anal. Appl., Vol. 35, 2 (2014), 636--660.
    [12]
    David Easley, Jon Kleinberg, et al. 2010. Networks, crowds, and markets . Vol. 8. Cambridge university press Cambridge.
    [13]
    Hongchang Gao and Heng Huang. 2018. Deep attributed network embedding. In Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI)) .
    [14]
    Lise Getoor. 2005. Link-based classification. In Advanced methods for knowledge discovery from complex data. Springer, 189--207.
    [15]
    GH Golub and CF Van Loan. 2013. Matrix Computations 4th Edition The Johns Hopkins University Press. Baltimore, MD (2013).
    [16]
    Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining . 855--864.
    [17]
    Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in neural information processing systems. 1024--1034.
    [18]
    Richard A Harshman, Margaret E Lundy, et al. 1994. PARAFAC: Parallel factor analysis. Computational Statistics and Data Analysis, Vol. 18, 1 (1994), 39--72.
    [19]
    Xiao Huang, Jundong Li, and Xia Hu. 2017. Label informed attributed network embedding. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. 731--739.
    [20]
    Charilaos I Kanatsoulis and Nicholas D Sidiropoulos. 2021. TeX-Graph: Coupled tensor-matrix knowledge-graph embedding for COVID-19 drug repurposing. In Proceedings of the 2021 SIAM International Conference on Data Mining (SDM). SIAM, 603--611.
    [21]
    Thomas N Kipf and Max Welling. 2016a. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
    [22]
    Thomas N Kipf and Max Welling. 2016b. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 (2016).
    [23]
    Tamara G Kolda and Brett W Bader. 2009. Tensor decompositions and applications. SIAM review, Vol. 51, 3 (2009), 455--500.
    [24]
    Joseph B Kruskal. 1978. Multidimensional scaling . Number 11. Sage.
    [25]
    Mark Newman. 2018. Networks .Oxford university press.
    [26]
    Mingdong Ou, Peng Cui, Jian Pei, Ziwei Zhang, and Wenwu Zhu. 2016. Asymmetric transitivity preserving graph embedding. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining . 1105--1114.
    [27]
    Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining . 701--710.
    [28]
    Leif E Peterson. 2009. K-nearest neighbor. Scholarpedia, Vol. 4, 2 (2009), 1883.
    [29]
    Jiezhong Qiu, Yuxiao Dong, Hao Ma, Jian Li, Kuansan Wang, and Jie Tang. 2018. Network embedding as matrix factorization: Unifying deepwalk, line, pte, and node2vec. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining . 459--467.
    [30]
    Eugenio Sanchez and Bruce R Kowalski. 1990. Tensorial resolution: a direct trilinear decomposition. Journal of Chemometrics, Vol. 4, 1 (1990), 29--45.
    [31]
    Susan S Schiffman, M Lance Reynolds, and Forrest W Young. 1981. Introduction to multidimensional scaling .Academic press New York.
    [32]
    Blake Shaw and Tony Jebara. 2009. Structure preserving embedding. In Proceedings of the 26th Annual International Conference on Machine Learning. 937--944.
    [33]
    Nicholas D Sidiropoulos, Lieven De Lathauwer, Xiao Fu, Kejun Huang, Evangelos E Papalexakis, and Christos Faloutsos. 2017. Tensor decomposition for signal processing and machine learning. IEEE Transactions on Signal Processing, Vol. 65, 13 (2017), 3551--3582.
    [34]
    Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th international conference on world wide web. 1067--1077.
    [35]
    Warren S Torgerson. 1952. Multidimensional scaling: I. Theory and method. Psychometrika, Vol. 17, 4 (1952), 401--419.
    [36]
    Anton Tsitsulin, Davide Mottin, Panagiotis Karras, and Emmanuel Müller. 2018. Verse: Versatile graph embeddings from similarity measures. In Proceedings of the 2018 World Wide Web Conference. 539--548.
    [37]
    Petar Velickovic, William Fedus, William L Hamilton, Pietro Liò, Yoshua Bengio, and R Devon Hjelm. 2019. Deep Graph Infomax. ICLR (Poster), Vol. 2, 3 (2019), 4.
    [38]
    Daixin Wang, Peng Cui, and Wenwu Zhu. 2016. Structural deep network embedding. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining . 1225--1234.
    [39]
    Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y Chang. 2015. Network representation learning with rich text information. In Proceedings of the 24th International Conference on Artificial Intelligence . 2111--2117.

    Cited By

    View all
    • (2024)Community detection in attributed social networks using deep learningThe Journal of Supercomputing10.1007/s11227-024-06436-8Online publication date: 16-Aug-2024
    • (2023)PANE: scalable and effective attributed network embeddingThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-023-00790-432:6(1237-1262)Online publication date: 24-Mar-2023

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining
    February 2022
    1690 pages
    ISBN:9781450391320
    DOI:10.1145/3488560
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 February 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. embedding
    2. graphs
    3. multi dimensional scaling
    4. networks
    5. representation learning
    6. tensors

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    WSDM '22

    Acceptance Rates

    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)51
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 12 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Community detection in attributed social networks using deep learningThe Journal of Supercomputing10.1007/s11227-024-06436-8Online publication date: 16-Aug-2024
    • (2023)PANE: scalable and effective attributed network embeddingThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-023-00790-432:6(1237-1262)Online publication date: 24-Mar-2023

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media