research-article

Open access

Reinforcement Learning–based Collective Entity Alignment with Adaptive Features

Authors:

Paul GrothAuthors Info & Claims

ACM Transactions on Information Systems (TOIS), Volume 39, Issue 3

Article No.: 26, Pages 1 - 31

https://doi.org/10.1145/3446428

Published: 05 May 2021 Publication History

All formats PDF

Abstract

Entity alignment (EA) is the task of identifying the entities that refer to the same real-world object but are located in different knowledge graphs (KGs). For entities to be aligned, existing EA solutions treat them separately and generate alignment results as ranked lists of entities on the other side. Nevertheless, this decision-making paradigm fails to take into account the interdependence among entities. Although some recent efforts mitigate this issue by imposing the 1-to-1 constraint on the alignment process, they still cannot adequately model the underlying interdependence and the results tend to be sub-optimal.

To fill in this gap, in this work, we delve into the dynamics of the decision-making process, and offer a reinforcement learning (RL)–based model to align entities collectively. Under the RL framework, we devise the coherence and exclusiveness constraints to characterize the interdependence and restrict collective alignment. Additionally, to generate more precise inputs to the RL framework, we employ representative features to capture different aspects of the similarity between entities in heterogeneous KGs, which are integrated by an adaptive feature fusion strategy. Our proposal is evaluated on both cross-lingual and mono-lingual EA benchmarks and compared against state-of-the-art solutions. The empirical results verify its effectiveness and superiority.

References

[1]

Yasser Altowim, Dmitri V. Kalashnikov, and Sharad Mehrotra. 2014. Progressive approach to relational entity resolution. Proc. Endow. Very Large Data Base 7, 11 (2014), 999–1010.

Digital Library

[2]

Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. 2007. DBpedia: A nucleus for a web of open data. In Proceedings of ISWC. 722–735.

Digital Library

[3]

Indrajit Bhattacharya and Lise Getoor. 2006. A latent dirichlet model for unsupervised entity resolution. In Proceedings of ICDM. 47–58.

[4]

Indrajit Bhattacharya and Lise Getoor. 2007. Collective entity resolution in relational data. Trans. Knowl. Discov. Data 1, 1 (2007), 5.

Digital Library

[5]

Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5 (2017), 135–146.

[6]

Kurt D. Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A collaboratively created graph database for structuring human knowledge. In Proceedings of SIGMOD, Jason Tsong-Li Wang (Ed.). ACM, 1247–1250.

Digital Library

[7]

Antoine Bordes, Nicolas Usunier, Alberto García-Durán, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In Proceedings of NIPS. 2787–2795.

Digital Library

[8]

J. Roger Bray and John T. Curtis. 1957. An ordination of the upland forest communities of southern Wisconsin. Ecol. Monogr. 27, 4 (1957), 325–349.

[9]

Christopher J. C. Burges. 2010. From RankNet to LambdaRank to LambdaMART: An Overview. Technical Report. Microsoft Research. Retrieved from http://research.microsoft.com/en-us/um/people/cburges/tech_reports/MSR-TR-2010-82.pdf.

[10]

Yixin Cao, Zhiyuan Liu, Chengjiang Li, Zhiyuan Liu, Juanzi Li, and Tat-Seng Chua. 2019. Multi-channel graph neural network for entity alignment. In Proceedings of ACL. 1452–1461.

[11]

Yixin Cao, Xiang Wang, Xiangnan He, Zikun Hu, and Tat-Seng Chua. 2019. Unifying knowledge graph learning and recommendation: Towards a better understanding of user preferences. In Proceedings of WWW. 151–161.

Digital Library

[12]

Muhao Chen, Yingtao Tian, Kai-Wei Chang, Steven Skiena, and Carlo Zaniolo. 2018. Co-training Embeddings of knowledge graphs and entity descriptions for cross-lingual entity alignment. In Proceedings of IJCAI. 3998–4004.

Digital Library

[13]

Muhao Chen, Yingtao Tian, Mohan Yang, and Carlo Zaniolo. 2017. Multilingual knowledge graph embeddings for cross-lingual knowledge alignment. In Proceedings of IJCAI. 1511–1517.

Digital Library

[14]

Peter Christen. 2012. A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans. Knowl. Data Eng. 24, 9 (2012), 1537–1555.

Digital Library

[15]

Kevin Clark and Christopher D. Manning. 2016. Deep reinforcement learning for mention-ranking coreference models. In Proceedings of EMNLP. 2256–2262.

[16]

Sanjib Das, Paul Suganthan G. C., AnHai Doan, Jeffrey F. Naughton, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, Vijay Raghavendra, and Youngchoon Park. 2017. Falcon: Scaling up hands-off crowdsourced entity matching to build cloud services. In Proceedings of SIGMOD. 1431–1446.

Digital Library

[17]

Jack Doerner, David Evans, and Abhi Shelat. 2016. Secure stable matching at scale. In Proceedings of SIGSAC. 1602–1613.

Digital Library

[18]

Zheng Fang, Yanan Cao, Qian Li, Dongjie Zhang, Zhenyu Zhang, and Yanbing Liu. 2019. Joint entity linking with deep reinforcement learning. In Proceedings of WWW. 438–447.

Digital Library

[19]

Jun Feng, Minlie Huang, Li Zhao, Yang Yang, and Xiaoyan Zhu. 2018. Reinforcement learning for relation classification from noisy data. In Proceedings of AAAI, IAAI, and EAAI. 5779–5786.

[20]

Cheng Fu, Xianpei Han, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, and Hao Kong. 2019. End-to-end multi-perspective matching for entity resolution. In Proceedings of IJCAI. 4961–4967.

[21]

David Gale and Lloyd S. Shapley. 1962. College admissions and the stability of marriage. Amer. Math. Month. 69, 1 (1962), 9–15.

[22]

Marko Gulic, Boris Vrdoljak, and Marko Banek. 2016. CroMatcher: An ontology matching system based on automated weighted aggregation and iterative final alignment. J. Web Semant. 41 (2016), 50–71.

[23]

Lingbing Guo, Zequn Sun, and Wei Hu. 2019. Learning to exploit long-term relational dependencies in knowledge graphs. In Proceedings of ICML. 2505–2514.

[24]

Ben Hixon, Peter Clark, and Hannaneh Hajishirzi. 2015. Learning knowledge graphs for question answering through conversational dialog. In Proceedings of NAACL-HLT. 851–861.

[25]

Thomas N. Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In Proceedings of ICLR.

[26]

Pigi Kouki, Jay Pujara, Christopher Marcum, Laura M. Koehly, and Lise Getoor. 2019. Collective entity resolution in multi-relational familial networks. Knowl. Info. Syst. 61, 3 (2019), 1547–1581.

[27]

Harold W. Kuhn. 1955. The hungarian method for the assignment problem. Naval Res. Logist. Quart. 2, 1--2 (1955), 83–97.

[28]

Simon Lacoste-Julien, Konstantina Palla, Alex Davies, Gjergji Kasneci, Thore Graepel, and Zoubin Ghahramani. 2013. SIGMa: Simple greedy matching for aligning large knowledge bases. In Proceedings of KDD. 572–580.

Digital Library

[29]

Vladimir I. Levenshtein. 1966. Binary codes capable of correcting deletions, insertions, and reversals. In Soviet Phys. Doklady, Vol. 10. 707–710.

[30]

Chengjiang Li, Yixin Cao, Lei Hou, Jiaxin Shi, Juanzi Li, and Tat-Seng Chua. 2019. Semi-supervised entity alignment via joint knowledge embedding model and cross-graph model. In Proceedings of EMNLP-IJCNLP. 2723–2732.

[31]

Xin Mao, Wenting Wang, Huimin Xu, Man Lan, and Yuanbin Wu. 2020. MRAEA: An efficient and robust entity alignment approach for cross-lingual knowledge graph. In Proceedings of WSDM. 420–428.

Digital Library

[32]

Andrew McCallum and Ben Wellner. 2004. Conditional models of identity uncertainty with application to noun coreference. In Proceedings of NIPS. 905–912.

Digital Library

[33]

Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In Proceedings of ICML. 1928–1937.

Digital Library

[34]

Sidharth Mudgal, Han Li, Theodoros Rekatsinas, AnHai Doan, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, and Vijay Raghavendra. 2018. Deep learning for entity matching: A design space exploration. In Proceedings of SIGMOD. 19–34.

Digital Library

[35]

Hao Nie, Xianpei Han, Ben He, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, and Hao Kong. 2019. Deep sequence-to-sequence entity matching for heterogeneous entity resolution. In Proceedings of CIKM. 629–638.

Digital Library

[36]

Ning Pang, Weixin Zeng, Jiuyang Tang, Zhen Tan, and Xiang Zhao. 2019. Iterative entity alignment with improved neural attribute embedding. In Proceedings of DL4KG@ESWC. 41–46.

[37]

Heiko Paulheim. 2017. Knowledge graph refinement: A survey of approaches and evaluation methods. Semant. Web 8, 3 (2017), 489–508.

Digital Library

[38]

Shichao Pei, Lu Yu, Robert Hoehndorf, and Xiangliang Zhang. 2019. Semi-supervised entity alignment via knowledge graph embedding with awareness of degree difference. In Proceedings of WWW. 3130–3136.

Digital Library

[39]

Alvin E. Roth. 2008. Deferred acceptance algorithms: History, theory, practice, and open questions. Int. J. Game Theory 36, 3--4 (2008), 537–569.

Digital Library

[40]

Wei Shen, Jianyong Wang, and Jiawei Han. 2015. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27, 2 (2015), 443–460.

[41]

Parag Singla and Pedro M. Domingos. 2006. Entity resolution with markov logic. In Proceedings of the ICDM. 572–582.

Digital Library

[42]

Fabian M. Suchanek, Serge Abiteboul, and Pierre Senellart. 2011. PARIS: Probabilistic alignment of relations, instances, and schema. Proc. Endow. Very Large Data Base 5, 3 (2011), 157–168.

Digital Library

[43]

Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. Yago: A core of semantic knowledge. In Proceedings of WWW. 697–706.

Digital Library

[44]

Zequn Sun, Wei Hu, and Chengkai Li. 2017. Cross-lingual entity alignment via joint attribute-preserving embedding. In Proceedings of ISWC. 628–644.

[45]

Zequn Sun, Wei Hu, Qingheng Zhang, and Yuzhong Qu. 2018. Bootstrapping entity alignment with knowledge graph embedding. In Proceedings of IJCAI. 4396–4402.

Digital Library

[46]

Zequn Sun, JiaCheng Huang, Wei Hu, Muhao Chen, Lingbing Guo, and Yuzhong Qu. 2019. TransEdge: Translating relation-contextualized embeddings for knowledge graphs. In Proceedings of ISWC. 612–629.

[47]

Zequn Sun, Chengming Wang, Wei Hu, Muhao Chen, Jian Dai, Wei Zhang, and Yuzhong Qu. 2020. Knowledge graph alignment network with gated multi-hop neighborhood aggregation. In Proceedings of EAAI. 222–229.

[48]

Bayu Distiawan Trisedya, Jianzhong Qi, and Rui Zhang. 2019. Entity alignment between knowledge graphs using attribute embeddings. In Proceedings of AAAI. 297–304.

[49]

Denny Vrandecic and Markus Krötzsch. 2014. Wikidata: A free collaborative knowledgebase. Commun. ACM 57, 10 (2014), 78–85.

Digital Library

[50]

Zhichun Wang, Qingsong Lv, Xiaohan Lan, and Yu Zhang. 2018. Cross-lingual knowledge graph alignment via graph convolutional networks. In Proceedings of EMNLP. 349–357.

[51]

Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 8 (1992), 229–256.

Digital Library

[52]

Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Rui Yan, and Dongyan Zhao. 2019. Relation-aware entity alignment for heterogeneous knowledge graphs. In Proceedings of IJCAI. 5278–5284.

[53]

Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, and Dongyan Zhao. 2019. Jointly learning entity and relation representations for entity alignment. In Proceedings of EMNLP-IJCNLP. 240–249.

[54]

Chenyan Xiong, Russell Power, and Jamie Callan. 2017. Explicit semantic ranking for academic search via knowledge graph embedding. In Proceedings of WWW. 1271–1279.

Digital Library

[55]

Kun Xu, Linfeng Song, Yansong Feng, Yan Song, and Dong Yu. 2020. Coordinated reasoning for cross-lingual knowledge graph alignment. In Proceedings of AAAI. 9354–9361.

[56]

Kun Xu, Liwei Wang, Mo Yu, Yansong Feng, Yan Song, Zhiguo Wang, and Dong Yu. 2019. Cross-lingual knowledge graph alignment via graph matching neural network. In Proceedings of ACL. 3156–3161.

[57]

Hsiu-Wei Yang, Yanyan Zou, Peng Shi, Wei Lu, Jimmy Lin, and Xu Sun. 2019. Aligning cross-lingual entities with multi-aspect information. In Proceedings of EMNLP-IJCNLP. 4430–4440.

[58]

Weixin Zeng, Xiang Zhao, Jiuyang Tang, and Xuemin Lin. 2020. Collective entity alignment via adaptive features. In Proceedings of ICDE. IEEE, 1870–1873.

[59]

Weixin Zeng, Xiang Zhao, Wei Wang, Jiuyang Tang, and Zhen Tan. 2020. Degree-aware alignment for entities in tail. In Proceedings of SIGIR. ACM, 811–820.

Digital Library

[60]

Qingheng Zhang, Zequn Sun, Wei Hu, Muhao Chen, Lingbing Guo, and Yuzhong Qu. 2019. Multi-view knowledge graph embedding for entity alignment. In Proceedings of IJCAI. 5429–5435.

[61]

Xiang Zhao, Weixin Zeng, Jiuyang Tang, Wei Wang, and Fabian Suchanek. 2020. An experimental study of state-of-the-art entity alignment approaches. IEEE Trans. Knowl. Data Eng. (2020), 1–1. https://ieeexplore.ieee.org/document/9174835.

[62]

Hao Zhu, Ruobing Xie, Zhiyuan Liu, and Maosong Sun. 2017. Iterative entity alignment via joint knowledge embeddings. In Proceedings of IJCAI. 4258–4264.

Digital Library

[63]

Qiannan Zhu, Xiaofei Zhou, Jia Wu, Jianlong Tan, and Li Guo. 2019. Neighborhood-aware attentional representation for multilingual knowledge graphs. In Proceedings of IJCAI. 1943–1949.

Cited By

Liu ZZhang HChen BJiang ZZhao YTao YYang TCui B(2025)CAFE+: Towards Compact, Adaptive, and Fast Embedding for Large-scale Online Recommendation ModelsACM Transactions on Information Systems10.1145/3713072Online publication date: 21-Jan-2025
https://doi.org/10.1145/3713072
Zhang ZZeng WTang JHuang HZhao X(2025)Active in-context learning for cross-domain entity resolutionInformation Fusion10.1016/j.inffus.2024.102816117(102816)Online publication date: May-2025
https://doi.org/10.1016/j.inffus.2024.102816
Peng HZhang PTang JXu HZeng W(2024)Detect-Then-Resolve: Enhancing Knowledge Graph Conflict Resolution with Large Language ModelMathematics10.3390/math1215231812:15(2318)Online publication date: 24-Jul-2024
https://doi.org/10.3390/math12152318
Show More Cited By

Index Terms

Reinforcement Learning–based Collective Entity Alignment with Adaptive Features
1. Information systems
  1. Data management systems
    1. Information integration
  2. World Wide Web
    1. Web mining
      1. Data extraction and integration

Recommendations

An Entity Alignment Method Based on Graph Attention Network with Pre-classification
Web Information Systems and Applications
Abstract
Entity alignment is the process of identifying entities that point to the same object in different knowledge graphs. Entity alignment is a key step in building knowledge graphs, and the result of entity alignment directly affects the quality of ...
Adaptive Entity Alignment for Cross-Lingual Knowledge Graph
Knowledge Science, Engineering and Management
Abstract
Entity alignment is a key step in knowledge graph (KG) fusion, which aims to match the same entity from different KGs. Currently, embedding-based entity alignment is the mainstream. It embeds entities into low-dimensional vectors and transfers ...
Entity Alignment Between Knowledge Graphs Using Entity Type Matching
Knowledge Science, Engineering and Management
Abstract
The task of entity alignment between knowledge graphs (KGs) aims to find entities in two knowledge graphs that represent the same real-world entity. Recently, embedding-based entity alignment methods get extended attention. Most of them firstly ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems

ACM Transactions on Information Systems Volume 39, Issue 3

July 2021

432 pages

ISSN:1046-8188

EISSN:1558-2868

DOI:10.1145/3450607

Editor:
Min Zhang
Tsinghua University, China

Issue’s Table of Contents

Copyright © 2021 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 May 2021

Accepted: 01 December 2020

Revised: 01 November 2020

Received: 01 June 2020

Published in TOIS Volume 39, Issue 3

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Refereed

Funding Sources

Ministry of Science and Technology of China
NSFC
NSF of Hunan Province
The Science and Technology Innovation Program of Hunan Province
Postgraduate Scientific Research Innovation Project of Hunan Province

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

48
Total Citations
View Citations
1,818
Total Downloads

Downloads (Last 12 months)595
Downloads (Last 6 weeks)51

Reflects downloads up to 08 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Liu ZZhang HChen BJiang ZZhao YTao YYang TCui B(2025)CAFE+: Towards Compact, Adaptive, and Fast Embedding for Large-scale Online Recommendation ModelsACM Transactions on Information Systems10.1145/3713072Online publication date: 21-Jan-2025
https://doi.org/10.1145/3713072
Zhang ZZeng WTang JHuang HZhao X(2025)Active in-context learning for cross-domain entity resolutionInformation Fusion10.1016/j.inffus.2024.102816117(102816)Online publication date: May-2025
https://doi.org/10.1016/j.inffus.2024.102816
Peng HZhang PTang JXu HZeng W(2024)Detect-Then-Resolve: Enhancing Knowledge Graph Conflict Resolution with Large Language ModelMathematics10.3390/math1215231812:15(2318)Online publication date: 24-Jul-2024
https://doi.org/10.3390/math12152318
Feng SZhou CLiu QJi XHuang M(2024)Temporal Knowledge Graph Reasoning Based on Entity Relationship Similarity PerceptionElectronics10.3390/electronics1312241713:12(2417)Online publication date: 20-Jun-2024
https://doi.org/10.3390/electronics13122417
Duan YTang JXu HLiu CZeng W(2024)Commonsense-Guided Inductive Relation Prediction with Dual Attention MechanismApplied Sciences10.3390/app1405204414:5(2044)Online publication date: 29-Feb-2024
https://doi.org/10.3390/app14052044
Wang LQi PBao XZhou CQin BWooldridge MDy JNatarajan S(2024)Pseudo-label calibration semi-supervised multi-modal entity alignmentProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i8.28762(9116-9124)Online publication date: 20-Feb-2024
https://dl.acm.org/doi/10.1609/aaai.v38i8.28762
Zeng KJin HLv XZhu FHou LZhang YPang FQi YLiu DLi JFeng L(2024)XLORE 3: A Large-Scale Multilingual Knowledge Graph from Heterogeneous Wiki Knowledge ResourcesACM Transactions on Information Systems10.1145/366052142:6(1-47)Online publication date: 19-Aug-2024
https://dl.acm.org/doi/10.1145/3660521
Rong HQian MMa TJin DSheng V(2024)CoBjeason: Reasoning Covered Object in Image by Multi-Agent Collaboration Based on Informed Knowledge GraphACM Transactions on Knowledge Discovery from Data10.1145/364356518:5(1-56)Online publication date: 28-Feb-2024
https://dl.acm.org/doi/10.1145/3643565
Zhang YWu JYu KWu X(2024)Diverse Structure-Aware Relation Representation in Cross-Lingual Entity AlignmentACM Transactions on Knowledge Discovery from Data10.1145/363877818:4(1-23)Online publication date: 13-Feb-2024
https://dl.acm.org/doi/10.1145/3638778
Zhao RTang JZeng WChen ZZhao XSerra ESpezzano F(2024)Zero-shot Knowledge Graph Question Generation via Multi-agent LLMs and Small Models SynthesisProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679805(3341-3351)Online publication date: 21-Oct-2024
https://dl.acm.org/doi/10.1145/3627673.3679805
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Figures

Tables

Media

View Issue’s Table of Contents