Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

Reinforcement Learning–based Collective Entity Alignment with Adaptive Features

Published: 05 May 2021 Publication History

Abstract

Entity alignment (EA) is the task of identifying the entities that refer to the same real-world object but are located in different knowledge graphs (KGs). For entities to be aligned, existing EA solutions treat them separately and generate alignment results as ranked lists of entities on the other side. Nevertheless, this decision-making paradigm fails to take into account the interdependence among entities. Although some recent efforts mitigate this issue by imposing the 1-to-1 constraint on the alignment process, they still cannot adequately model the underlying interdependence and the results tend to be sub-optimal.
To fill in this gap, in this work, we delve into the dynamics of the decision-making process, and offer a reinforcement learning (RL)–based model to align entities collectively. Under the RL framework, we devise the coherence and exclusiveness constraints to characterize the interdependence and restrict collective alignment. Additionally, to generate more precise inputs to the RL framework, we employ representative features to capture different aspects of the similarity between entities in heterogeneous KGs, which are integrated by an adaptive feature fusion strategy. Our proposal is evaluated on both cross-lingual and mono-lingual EA benchmarks and compared against state-of-the-art solutions. The empirical results verify its effectiveness and superiority.

References

[1]
Yasser Altowim, Dmitri V. Kalashnikov, and Sharad Mehrotra. 2014. Progressive approach to relational entity resolution. Proc. Endow. Very Large Data Base 7, 11 (2014), 999–1010.
[2]
Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. 2007. DBpedia: A nucleus for a web of open data. In Proceedings of ISWC. 722–735.
[3]
Indrajit Bhattacharya and Lise Getoor. 2006. A latent dirichlet model for unsupervised entity resolution. In Proceedings of ICDM. 47–58.
[4]
Indrajit Bhattacharya and Lise Getoor. 2007. Collective entity resolution in relational data. Trans. Knowl. Discov. Data 1, 1 (2007), 5.
[5]
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5 (2017), 135–146.
[6]
Kurt D. Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A collaboratively created graph database for structuring human knowledge. In Proceedings of SIGMOD, Jason Tsong-Li Wang (Ed.). ACM, 1247–1250.
[7]
Antoine Bordes, Nicolas Usunier, Alberto García-Durán, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In Proceedings of NIPS. 2787–2795.
[8]
J. Roger Bray and John T. Curtis. 1957. An ordination of the upland forest communities of southern Wisconsin. Ecol. Monogr. 27, 4 (1957), 325–349.
[9]
Christopher J. C. Burges. 2010. From RankNet to LambdaRank to LambdaMART: An Overview. Technical Report. Microsoft Research. Retrieved from http://research.microsoft.com/en-us/um/people/cburges/tech_reports/MSR-TR-2010-82.pdf.
[10]
Yixin Cao, Zhiyuan Liu, Chengjiang Li, Zhiyuan Liu, Juanzi Li, and Tat-Seng Chua. 2019. Multi-channel graph neural network for entity alignment. In Proceedings of ACL. 1452–1461.
[11]
Yixin Cao, Xiang Wang, Xiangnan He, Zikun Hu, and Tat-Seng Chua. 2019. Unifying knowledge graph learning and recommendation: Towards a better understanding of user preferences. In Proceedings of WWW. 151–161.
[12]
Muhao Chen, Yingtao Tian, Kai-Wei Chang, Steven Skiena, and Carlo Zaniolo. 2018. Co-training Embeddings of knowledge graphs and entity descriptions for cross-lingual entity alignment. In Proceedings of IJCAI. 3998–4004.
[13]
Muhao Chen, Yingtao Tian, Mohan Yang, and Carlo Zaniolo. 2017. Multilingual knowledge graph embeddings for cross-lingual knowledge alignment. In Proceedings of IJCAI. 1511–1517.
[14]
Peter Christen. 2012. A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans. Knowl. Data Eng. 24, 9 (2012), 1537–1555.
[15]
Kevin Clark and Christopher D. Manning. 2016. Deep reinforcement learning for mention-ranking coreference models. In Proceedings of EMNLP. 2256–2262.
[16]
Sanjib Das, Paul Suganthan G. C., AnHai Doan, Jeffrey F. Naughton, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, Vijay Raghavendra, and Youngchoon Park. 2017. Falcon: Scaling up hands-off crowdsourced entity matching to build cloud services. In Proceedings of SIGMOD. 1431–1446.
[17]
Jack Doerner, David Evans, and Abhi Shelat. 2016. Secure stable matching at scale. In Proceedings of SIGSAC. 1602–1613.
[18]
Zheng Fang, Yanan Cao, Qian Li, Dongjie Zhang, Zhenyu Zhang, and Yanbing Liu. 2019. Joint entity linking with deep reinforcement learning. In Proceedings of WWW. 438–447.
[19]
Jun Feng, Minlie Huang, Li Zhao, Yang Yang, and Xiaoyan Zhu. 2018. Reinforcement learning for relation classification from noisy data. In Proceedings of AAAI, IAAI, and EAAI. 5779–5786.
[20]
Cheng Fu, Xianpei Han, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, and Hao Kong. 2019. End-to-end multi-perspective matching for entity resolution. In Proceedings of IJCAI. 4961–4967.
[21]
David Gale and Lloyd S. Shapley. 1962. College admissions and the stability of marriage. Amer. Math. Month. 69, 1 (1962), 9–15.
[22]
Marko Gulic, Boris Vrdoljak, and Marko Banek. 2016. CroMatcher: An ontology matching system based on automated weighted aggregation and iterative final alignment. J. Web Semant. 41 (2016), 50–71.
[23]
Lingbing Guo, Zequn Sun, and Wei Hu. 2019. Learning to exploit long-term relational dependencies in knowledge graphs. In Proceedings of ICML. 2505–2514.
[24]
Ben Hixon, Peter Clark, and Hannaneh Hajishirzi. 2015. Learning knowledge graphs for question answering through conversational dialog. In Proceedings of NAACL-HLT. 851–861.
[25]
Thomas N. Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In Proceedings of ICLR.
[26]
Pigi Kouki, Jay Pujara, Christopher Marcum, Laura M. Koehly, and Lise Getoor. 2019. Collective entity resolution in multi-relational familial networks. Knowl. Info. Syst. 61, 3 (2019), 1547–1581.
[27]
Harold W. Kuhn. 1955. The hungarian method for the assignment problem. Naval Res. Logist. Quart. 2, 1--2 (1955), 83–97.
[28]
Simon Lacoste-Julien, Konstantina Palla, Alex Davies, Gjergji Kasneci, Thore Graepel, and Zoubin Ghahramani. 2013. SIGMa: Simple greedy matching for aligning large knowledge bases. In Proceedings of KDD. 572–580.
[29]
Vladimir I. Levenshtein. 1966. Binary codes capable of correcting deletions, insertions, and reversals. In Soviet Phys. Doklady, Vol. 10. 707–710.
[30]
Chengjiang Li, Yixin Cao, Lei Hou, Jiaxin Shi, Juanzi Li, and Tat-Seng Chua. 2019. Semi-supervised entity alignment via joint knowledge embedding model and cross-graph model. In Proceedings of EMNLP-IJCNLP. 2723–2732.
[31]
Xin Mao, Wenting Wang, Huimin Xu, Man Lan, and Yuanbin Wu. 2020. MRAEA: An efficient and robust entity alignment approach for cross-lingual knowledge graph. In Proceedings of WSDM. 420–428.
[32]
Andrew McCallum and Ben Wellner. 2004. Conditional models of identity uncertainty with application to noun coreference. In Proceedings of NIPS. 905–912.
[33]
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In Proceedings of ICML. 1928–1937.
[34]
Sidharth Mudgal, Han Li, Theodoros Rekatsinas, AnHai Doan, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, and Vijay Raghavendra. 2018. Deep learning for entity matching: A design space exploration. In Proceedings of SIGMOD. 19–34.
[35]
Hao Nie, Xianpei Han, Ben He, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, and Hao Kong. 2019. Deep sequence-to-sequence entity matching for heterogeneous entity resolution. In Proceedings of CIKM. 629–638.
[36]
Ning Pang, Weixin Zeng, Jiuyang Tang, Zhen Tan, and Xiang Zhao. 2019. Iterative entity alignment with improved neural attribute embedding. In Proceedings of DL4KG@ESWC. 41–46.
[37]
Heiko Paulheim. 2017. Knowledge graph refinement: A survey of approaches and evaluation methods. Semant. Web 8, 3 (2017), 489–508.
[38]
Shichao Pei, Lu Yu, Robert Hoehndorf, and Xiangliang Zhang. 2019. Semi-supervised entity alignment via knowledge graph embedding with awareness of degree difference. In Proceedings of WWW. 3130–3136.
[39]
Alvin E. Roth. 2008. Deferred acceptance algorithms: History, theory, practice, and open questions. Int. J. Game Theory 36, 3--4 (2008), 537–569.
[40]
Wei Shen, Jianyong Wang, and Jiawei Han. 2015. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27, 2 (2015), 443–460.
[41]
Parag Singla and Pedro M. Domingos. 2006. Entity resolution with markov logic. In Proceedings of the ICDM. 572–582.
[42]
Fabian M. Suchanek, Serge Abiteboul, and Pierre Senellart. 2011. PARIS: Probabilistic alignment of relations, instances, and schema. Proc. Endow. Very Large Data Base 5, 3 (2011), 157–168.
[43]
Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. Yago: A core of semantic knowledge. In Proceedings of WWW. 697–706.
[44]
Zequn Sun, Wei Hu, and Chengkai Li. 2017. Cross-lingual entity alignment via joint attribute-preserving embedding. In Proceedings of ISWC. 628–644.
[45]
Zequn Sun, Wei Hu, Qingheng Zhang, and Yuzhong Qu. 2018. Bootstrapping entity alignment with knowledge graph embedding. In Proceedings of IJCAI. 4396–4402.
[46]
Zequn Sun, JiaCheng Huang, Wei Hu, Muhao Chen, Lingbing Guo, and Yuzhong Qu. 2019. TransEdge: Translating relation-contextualized embeddings for knowledge graphs. In Proceedings of ISWC. 612–629.
[47]
Zequn Sun, Chengming Wang, Wei Hu, Muhao Chen, Jian Dai, Wei Zhang, and Yuzhong Qu. 2020. Knowledge graph alignment network with gated multi-hop neighborhood aggregation. In Proceedings of EAAI. 222–229.
[48]
Bayu Distiawan Trisedya, Jianzhong Qi, and Rui Zhang. 2019. Entity alignment between knowledge graphs using attribute embeddings. In Proceedings of AAAI. 297–304.
[49]
Denny Vrandecic and Markus Krötzsch. 2014. Wikidata: A free collaborative knowledgebase. Commun. ACM 57, 10 (2014), 78–85.
[50]
Zhichun Wang, Qingsong Lv, Xiaohan Lan, and Yu Zhang. 2018. Cross-lingual knowledge graph alignment via graph convolutional networks. In Proceedings of EMNLP. 349–357.
[51]
Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 8 (1992), 229–256.
[52]
Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Rui Yan, and Dongyan Zhao. 2019. Relation-aware entity alignment for heterogeneous knowledge graphs. In Proceedings of IJCAI. 5278–5284.
[53]
Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, and Dongyan Zhao. 2019. Jointly learning entity and relation representations for entity alignment. In Proceedings of EMNLP-IJCNLP. 240–249.
[54]
Chenyan Xiong, Russell Power, and Jamie Callan. 2017. Explicit semantic ranking for academic search via knowledge graph embedding. In Proceedings of WWW. 1271–1279.
[55]
Kun Xu, Linfeng Song, Yansong Feng, Yan Song, and Dong Yu. 2020. Coordinated reasoning for cross-lingual knowledge graph alignment. In Proceedings of AAAI. 9354–9361.
[56]
Kun Xu, Liwei Wang, Mo Yu, Yansong Feng, Yan Song, Zhiguo Wang, and Dong Yu. 2019. Cross-lingual knowledge graph alignment via graph matching neural network. In Proceedings of ACL. 3156–3161.
[57]
Hsiu-Wei Yang, Yanyan Zou, Peng Shi, Wei Lu, Jimmy Lin, and Xu Sun. 2019. Aligning cross-lingual entities with multi-aspect information. In Proceedings of EMNLP-IJCNLP. 4430–4440.
[58]
Weixin Zeng, Xiang Zhao, Jiuyang Tang, and Xuemin Lin. 2020. Collective entity alignment via adaptive features. In Proceedings of ICDE. IEEE, 1870–1873.
[59]
Weixin Zeng, Xiang Zhao, Wei Wang, Jiuyang Tang, and Zhen Tan. 2020. Degree-aware alignment for entities in tail. In Proceedings of SIGIR. ACM, 811–820.
[60]
Qingheng Zhang, Zequn Sun, Wei Hu, Muhao Chen, Lingbing Guo, and Yuzhong Qu. 2019. Multi-view knowledge graph embedding for entity alignment. In Proceedings of IJCAI. 5429–5435.
[61]
Xiang Zhao, Weixin Zeng, Jiuyang Tang, Wei Wang, and Fabian Suchanek. 2020. An experimental study of state-of-the-art entity alignment approaches. IEEE Trans. Knowl. Data Eng. (2020), 1–1. https://ieeexplore.ieee.org/document/9174835.
[62]
Hao Zhu, Ruobing Xie, Zhiyuan Liu, and Maosong Sun. 2017. Iterative entity alignment via joint knowledge embeddings. In Proceedings of IJCAI. 4258–4264.
[63]
Qiannan Zhu, Xiaofei Zhou, Jia Wu, Jianlong Tan, and Li Guo. 2019. Neighborhood-aware attentional representation for multilingual knowledge graphs. In Proceedings of IJCAI. 1943–1949.

Cited By

View all
  • (2025)CAFE+: Towards Compact, Adaptive, and Fast Embedding for Large-scale Online Recommendation ModelsACM Transactions on Information Systems10.1145/3713072Online publication date: 21-Jan-2025
  • (2025)Active in-context learning for cross-domain entity resolutionInformation Fusion10.1016/j.inffus.2024.102816117(102816)Online publication date: May-2025
  • (2024)Detect-Then-Resolve: Enhancing Knowledge Graph Conflict Resolution with Large Language ModelMathematics10.3390/math1215231812:15(2318)Online publication date: 24-Jul-2024
  • Show More Cited By

Index Terms

  1. Reinforcement Learning–based Collective Entity Alignment with Adaptive Features

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Information Systems
      ACM Transactions on Information Systems  Volume 39, Issue 3
      July 2021
      432 pages
      ISSN:1046-8188
      EISSN:1558-2868
      DOI:10.1145/3450607
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 05 May 2021
      Accepted: 01 December 2020
      Revised: 01 November 2020
      Received: 01 June 2020
      Published in TOIS Volume 39, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Entity alignment
      2. reinforcement learning
      3. adaptive feature fusion

      Qualifiers

      • Research-article
      • Refereed

      Funding Sources

      • Ministry of Science and Technology of China
      • NSFC
      • NSF of Hunan Province
      • The Science and Technology Innovation Program of Hunan Province
      • Postgraduate Scientific Research Innovation Project of Hunan Province

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)595
      • Downloads (Last 6 weeks)51
      Reflects downloads up to 08 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)CAFE+: Towards Compact, Adaptive, and Fast Embedding for Large-scale Online Recommendation ModelsACM Transactions on Information Systems10.1145/3713072Online publication date: 21-Jan-2025
      • (2025)Active in-context learning for cross-domain entity resolutionInformation Fusion10.1016/j.inffus.2024.102816117(102816)Online publication date: May-2025
      • (2024)Detect-Then-Resolve: Enhancing Knowledge Graph Conflict Resolution with Large Language ModelMathematics10.3390/math1215231812:15(2318)Online publication date: 24-Jul-2024
      • (2024)Temporal Knowledge Graph Reasoning Based on Entity Relationship Similarity PerceptionElectronics10.3390/electronics1312241713:12(2417)Online publication date: 20-Jun-2024
      • (2024)Commonsense-Guided Inductive Relation Prediction with Dual Attention MechanismApplied Sciences10.3390/app1405204414:5(2044)Online publication date: 29-Feb-2024
      • (2024)Pseudo-label calibration semi-supervised multi-modal entity alignmentProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i8.28762(9116-9124)Online publication date: 20-Feb-2024
      • (2024)XLORE 3: A Large-Scale Multilingual Knowledge Graph from Heterogeneous Wiki Knowledge ResourcesACM Transactions on Information Systems10.1145/366052142:6(1-47)Online publication date: 19-Aug-2024
      • (2024)CoBjeason: Reasoning Covered Object in Image by Multi-Agent Collaboration Based on Informed Knowledge GraphACM Transactions on Knowledge Discovery from Data10.1145/364356518:5(1-56)Online publication date: 28-Feb-2024
      • (2024)Diverse Structure-Aware Relation Representation in Cross-Lingual Entity AlignmentACM Transactions on Knowledge Discovery from Data10.1145/363877818:4(1-23)Online publication date: 13-Feb-2024
      • (2024)Zero-shot Knowledge Graph Question Generation via Multi-agent LLMs and Small Models SynthesisProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679805(3341-3351)Online publication date: 21-Oct-2024
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Login options

      Full Access

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media