Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

Incomplete Network Alignment: Problem Definitions and Fast Solutions

Published: 30 May 2020 Publication History

Abstract

Networks are prevalent in many areas and are often collected from multiple sources. However, due to the veracity characteristics, more often than not, networks are incomplete. Network alignment and network completion have become two fundamental cornerstones behind a wealth of high-impact graph mining applications. The state-of-the-art have been addressing these two tasks in parallel. That is, most of the existing network alignment methods have implicitly assumed that the topology of the input networks for alignment are perfectly known a priori, whereas the existing network completion methods admit either a single network (i.e., matrix completion) or multiple aligned networks (e.g., tensor completion). In this article, we argue that network alignment and completion are inherently complementary with each other, and hence propose to jointly address them so that the two tasks can mutually benefit from each other. We formulate the problem from the optimization perspective, and propose an effective algorithm (iNeAt) to solve it. The proposed method offers two distinctive advantages. First (Alignment accuracy), our method benefits from the higher-quality input networks while mitigates the effect of the incorrectly inferred links introduced by the completion task itself. Second (Alignment efficiency), thanks to the low-rank structure of the complete networks and the alignment matrix, the alignment process can be significantly accelerated. We perform extensive experiments which show that (1) the network completion can significantly improve the alignment accuracy, i.e., up to 30% over the baseline methods; (2) the network alignment can in turn help recover more missing edges than the baseline methods; and (3) our method achieves a good balance between the running time and the accuracy, and scales with a provable linear complexity in both time and space.

References

[1]
Reid Andersen, Fan Chung, and Kevin Lang. 2006. Local graph partitioning using pagerank vectors. InProceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science, (FOCS’06). IEEE, 475--486.
[2]
Nicola Barbieri, Francesco Bonchi, and Giuseppe Manco. 2014. Who to follow and why: Link prediction with explanations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1266--1275.
[3]
Mohsen Bayati, David F. Gleich, Amin Saberi, and Ying Wang. 2013. Message-passing algorithms for sparse network alignment. ACM Transactions on Knowledge Discovery from Data 7, 1 (2013), 3.
[4]
Johannes Berg and Michael Lässig. 2004. Local graph alignment and motif search in biological networks. Proceedings of the National Academy of Sciences of the United States of America 101, 41 (2004), 14689--14694.
[5]
Nicolas Boumal and Pierre-antoine Absil. 2011. RTRMC: A riemannian trust-region method for low-rank matrix completion. In Proceedings of the Advances in Neural Information Processing Systems. 406--414.
[6]
Ulrik Brandes. 2008. On variants of shortest-path betweenness centrality and their generic computation. Social Networks 30, 2 (2008), 136--145.
[7]
Jian-Feng Cai, Emmanuel J. Candès, and Zuowei Shen. 2010. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization 20, 4 (2010), 1956--1982.
[8]
Zheng Chen, Xinli Yu, Bo Song, Jianliang Gao, Xiaohua Hu, and Wei-Shih Yang. 2017. Community-based network alignment for large attributed network. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. ACM, 587--596.
[9]
Chris Ding, Xiaofeng He, and Horst D. Simon. 2005. On the equivalence of nonnegative matrix factorization and spectral clustering. In Proceedings of the 2005 SIAM International Conference on Data Mining. SIAM, 606--610.
[10]
Boxin Du and Hanghang Tong. 2018. FASTEN: Fast sylvester equation solver for graph mining. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery 8 Data Mining. 1339--1347.
[11]
Boxin Du and Hanghang Tong. 2019. MrMine: Multi-resolution Multi-network embedding. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 479--488.
[12]
Mohammed El-Kebir, Jaap Heringa, and Gunnar W. Klau. 2015. Natalie 2.0: Sparse global network alignment as a special case of quadratic assignment. Algorithms 8, 4 (2015), 1035--1051.
[13]
Somaye Hashemifar and Jinbo Xu. 2014. Hubalign: An accurate and efficient method for global alignment of protein–protein interaction networks. Bioinformatics 30, 17 (2014), i438–i444.
[14]
Mark Heimann, Haoming Shen, and Danai Koutra. 2018. Node representation learning for multiple networks: The case of graph alignment. Arxiv Preprint Arxiv:1802.06257 (2018).
[15]
Xiangnan Kong, Jiawei Zhang, and Philip S. Yu. 2013. Inferring anchor links across multiple heterogeneous social networks. In Proceedings of the 22nd ACM International Conference on Information 8 Knowledge Management. ACM, 179--188.
[16]
Danai Koutra, Hanghang Tong, and David Lubensky. 2013. Big-align: Fast bipartite graph alignment. In Proceedings of the 2013 IEEE 13th International Conference on Data Mining. IEEE, 389--398.
[17]
Jure Leskovec, Deepayan Chakrabarti, Jon Kleinberg, Christos Faloutsos, and Zoubin Ghahramani. 2010. Kronecker graphs: An approach to modeling networks. Journal of Machine Learning Research 11, Feb (2010), 985--1042.
[18]
Jure Leskovec, Jon Kleinberg, and Christos Faloutsos. 2007. Graph evolution: Densification and shrinking diameters. ACM Transactions on Knowledge Discovery from Data 1, 1 (2007), 2.
[19]
Jure Leskovec and Julian J. Mcauley. 2012. Learning to discover social circles in ego networks. In Proceedings of the Advances in Neural Information Processing Systems. 539--547.
[20]
Chung-Shou Liao, Kanghao Lu, Michael Baym, Rohit Singh, and Bonnie Berger. 2009. IsoRankN: Spectral methods for global alignment of multiple protein networks. Bioinformatics 25, 12 (2009), i253–i258.
[21]
David Liben-Nowell and Jon Kleinberg. 2007. The link-prediction problem for social networks. Journal of the Association for Information Science and Technology 58, 7 (2007), 1019--1031.
[22]
Ji Liu, Przemyslaw Musialski, Peter Wonka, and Jieping Ye. 2013. Tensor completion for estimating missing values in visual data. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 1 (2013), 208--220.
[23]
Li Liu, William K. Cheung, Xin Li, and Lejian Liao. 2016. Aligning users across social networks using network embedding. In Proceedings of the 25th International Joint Conference on Artificial Intelligence. 1774--1780.
[24]
Yuanyuan Liu, Fanhua Shang, Hong Cheng, James Cheng, and Hanghang Tong. 2014. Factor matrix trace norm minimization for low-rank tensor completion. In Proceedings of the 2014 SIAM International Conference on Data Mining. SIAM, 866--874.
[25]
Noël Malod-Dognin and Nataša Pržulj. 2015. L-GRAAL: Lagrangian graphlet-based network aligner. Bioinformatics 31, 13 (2015), 2182--2189.
[26]
Hazel N. Manners, Ahed Elmsallati, Pietro H. Guzzi, Swarup Roy, and Jugal K. Kalita. 2017. Performing local network alignment by ensembling global aligners. In Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM’17). IEEE, 1316--1323.
[27]
Farzan Masrour, Iman Barjesteh, Rana Forsati, Abdol-Hossein Esfahanian, and Hayder Radha. 2015. Network completion with node similarity: A matrix completion approach with provable guarantees. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’15). IEEE, 302--307.
[28]
Aditya Krishna Menon and Charles Elkan. 2011. Link prediction via matrix factorization. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 437--452.
[29]
Kurt Miller, Michael I. Jordan, and Thomas L. Griffiths. 2009. Nonparametric latent feature models for link prediction. In Proceedings of the Advances in Neural Information Processing Systems. 1276--1284.
[30]
Marco Mina and Pietro Hiram Guzzi. 2014. Improving the robustness of local network alignment: Design and extensive assessment of a markov clustering-based approach. IEEE/ACM Transactions on Computational Biology and Bioinformatics 11, 3 (2014), 561--572.
[31]
K. B. Petersen, M. S. Pedersen, and others. 2008. The matrix cookbook, vol 7. Technical University of Denmark 15 (2008).
[32]
Benjamin Recht. 2011. A simpler approach to matrix completion. Journal of Machine Learning Research 12, Dec (2011), 3413--3430.
[33]
Jasson D. M. Rennie and Nathan Srebro. 2005. Fast maximum margin matrix factorization for collaborative prediction. In Proceedings of the 22nd International Conference on Machine Learning. ACM, 713--719.
[34]
Rohit Singh, Jinbo Xu, and Bonnie Berger. 2008. Global alignment of multiple protein interaction networks with application to functional orthology detection. Proceedings of the National Academy of Sciences 105, 35 (2008), 12763--12768.
[35]
Sucheta Soundarajan, Tina Eliassi-Rad, Brian Gallagher, and Ali Pinar. 2016. MaxReach: Reducing network incompleteness through node probes. In Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’16). IEEE, 152--157.
[36]
Kim-Chuan Toh and Sangwoon Yun. 2010. An accelerated proximal gradient algorithm for nuclear norm regularized linear least squares problems. Pacific Journal of Optimization 6, 615–640 (2010), 15.
[37]
Rianne van den Berg, Thomas N. Kipf, and Max Welling. 2017. Graph convolutional matrix completion. arXiv preprint arXiv:1706.02263.
[38]
Vipin Vijayan, Vikram Saraph, and T. Milenković. 2015. MAGNA++: Maximizing accuracy in global network alignment via both node and edge conservation. Bioinformatics 31, 14 (2015), 2409--2411.
[39]
Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems 42, 1 (2015), 181--213.
[40]
Reza Zafarani and Huan Liu. 2013. Connecting users across social media sites: A behavioral-modeling approach. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 41--49.
[41]
Jiawei Zhang, Jianhui Chen, Shi Zhi, Yi Chang, S. Yu Philip, and Jiawei Han. 2017. Link prediction across aligned networks with sparse and low rank matrix estimation. In Proceedings of the 2017 IEEE 33rd International Conference on Data Engineering (ICDE’17). IEEE, 971--982.
[42]
Jiawei Zhang and S. Yu Philip. 2015. Multiple anonymized social networks alignment. In Proceedings of the 2015 IEEE International Conference onData Mining (ICDM’15). IEEE, 599--608.
[43]
Si Zhang and Hanghang Tong. 2016. Final: Fast attributed network alignment. In Proceedings of the 22th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM.
[44]
Si Zhang and Hanghang Tong. 2018. Attributed network alignment: Problem definitions and fast solutions. IEEE Transactions on Knowledge and Data Engineering 31, 9 (2018), 1680–1692.
[45]
Si Zhang, Hanghang Tong, Jiejun Xu, Yifan Hu, and Ross Maciejewski. 2019. Origin: Non-rigid network alignment. In Proceedings of the 2019 IEEE International Conference on Big Data. IEEE.
[46]
Yutao Zhang, Jie Tang, Zhilin Yang, Jian Pei, and Philip S Yu. 2015. Cosnet: Connecting heterogeneous social networks with local and global consistency. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1485--1494.

Cited By

View all
  • (2024)WL-Align: Weisfeiler-Lehman Relabeling for Aligning Users Across Networks via Regularized Representation LearningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.327784336:1(445-458)Online publication date: Jan-2024
  • (2023)Everything Evolves in Personalized PageRankProceedings of the ACM Web Conference 202310.1145/3543507.3583474(3342-3352)Online publication date: 30-Apr-2023
  • (2023)PARROT: Position-Aware Regularized Optimal Transport for Network AlignmentProceedings of the ACM Web Conference 202310.1145/3543507.3583357(372-382)Online publication date: 30-Apr-2023
  • Show More Cited By

Index Terms

  1. Incomplete Network Alignment: Problem Definitions and Fast Solutions

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Knowledge Discovery from Data
    ACM Transactions on Knowledge Discovery from Data  Volume 14, Issue 4
    August 2020
    316 pages
    ISSN:1556-4681
    EISSN:1556-472X
    DOI:10.1145/3403605
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 May 2020
    Online AM: 07 May 2020
    Accepted: 01 February 2020
    Revised: 01 November 2019
    Received: 01 May 2018
    Published in TKDD Volume 14, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Incomplete network alignment
    2. low rank
    3. network completion

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)176
    • Downloads (Last 6 weeks)15
    Reflects downloads up to 18 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)WL-Align: Weisfeiler-Lehman Relabeling for Aligning Users Across Networks via Regularized Representation LearningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.327784336:1(445-458)Online publication date: Jan-2024
    • (2023)Everything Evolves in Personalized PageRankProceedings of the ACM Web Conference 202310.1145/3543507.3583474(3342-3352)Online publication date: 30-Apr-2023
    • (2023)PARROT: Position-Aware Regularized Optimal Transport for Network AlignmentProceedings of the ACM Web Conference 202310.1145/3543507.3583357(372-382)Online publication date: 30-Apr-2023
    • (2023)Identifying Users Across Social Media Networks for Interpretable Fine-Grained Neighborhood Matching by Adaptive GATIEEE Transactions on Services Computing10.1109/TSC.2023.328887216:5(3453-3466)Online publication date: Sep-2023
    • (2023)Aligning Users across Social Networks via Integrating Structural Similarity and Graph Representation Learning2023 9th International Conference on Big Data and Information Analytics (BigDIA)10.1109/BigDIA60676.2023.10429339(549-556)Online publication date: 15-Dec-2023
    • (2022)Geometry interaction network alignmentNeurocomputing10.1016/j.neucom.2022.06.077501(618-628)Online publication date: Aug-2022
    • (2020)CANON: Complex Analytics of Network of Networks for Modeling Adversarial Activities2020 IEEE International Conference on Big Data (Big Data)10.1109/BigData50022.2020.9378258(1634-1643)Online publication date: 10-Dec-2020

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Get Access

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media